79
I replaced Google Home with Home Assistant and a local LLM, and I'm not looking back
(www.xda-developers.com)
For the discussion of smart homes, home automation and the like. Because of the instance it will tend to have a more UK flavour but everyone is welcome.
Elsewhere in the Fediverse:
Rules:
NB: looking for moderators.
This was the greatest thing for my home.
Privacy concerns abound with google/amazon/apple obviously but the bigger thing from a day to day perspective was the lack of control
I used siri for awhile and it was frustrating because it lagged behind tremendously. I tried alexa and google at friends and families houses and while they were notably better they were still awful. Ads were obviously the worst part, with Alexa especially there was so much fluff to try and sell shit. A voice assistant should be as unobtrusive as possible and the second it says extra bullshit it’s doing a shitty job. Poor query handling happened with all three to varying degrees but the worse issue was not having queries locally processed.
This meant 3 major deal breaking issues: the assistants needed a constant internet connection (so outages meant your “smart” home was now objectively worse than before), the internet connection was a key factor so if there was bandwidth saturation, dns issues, etc latency would notably increase or timeout query processing, and because the processing is server sided and not controlled by you query response can suddenly change: where a macro would work one day and stop the next or asking a question might work one day and then not next week.
With whisper and qwen I am entirely local and have control of the entire process. If a query is processed poorly or not at all I can create intent scripts to shift the behavior as I please. I can have the entire smart home cordoned into an isolated vlan or separate switch with only intranet access so the possibly of data collection is impossible and external intrusion is extremely unlikely. I can bridge key components around this with something like headscale so I can still check my cameras or whatever remotely.
It’s not all roses: the latency can still occur depending on demand and hardware. If you’re like me and have 100+ entities that the llm can interact with things can slow down a bit. VRAM matters more than gpu in my experience. The speech transcription is fairly lightweight and can occur in sub 1 second timing but the query processing can take a few moments if hardware isn’t beefy enough. This also means a server that is more of a power hog, sometimes significantly so (relative to something like an rpi4).
Biggest hurdle is that because it is so controllable, so open, and relatively new setup can be a pain. There are several guides on github and the HA forums that are very helpful, but this is definitely in the space of “tinkerer who is comfortable with yaml and docker”. Additionally the other parts of the hardware can be a pain. Where in other ecosystems you can just spend $80-100 on an echo or homepod or whatever here you have to figure out a speaker and microphone. This can be very cheap (literally an esp32 with a cheap mic and speaker attached) but this typically doesn’t give you the “smart speaker” functionality which can be nice to have. If you like the idea of music wherever in your house those devices have really improved the concept of “multi room audio”, which can be an absurdly expensive nightmare otherwise.
The other pain point is that if you truly want local only for iot devices you either have to accept that there will be some unknowns (mainly new and time sensitive data, eg “what was the score of the game”, stock prices, etc) or use another system as a bridge to run something like searxng then set it up so that the llm invokes this bridge upon unknowns. This adds several seconds of latency but honestly 95% of queries exist within the LLMs model so asking for general info is usually entirely local (eg “who wrote flowers for algernon”)
I know there can be a lot of hate for smarthome stuff, which is often justified, but if you set it up correctly there’s no internet access and it just kind of works. With presence sensors I don’t even do much voice command of devices. Lighting and hvac are all automated. The voice stuff is for incidentals: playing music, searching for info, setting timers, etc. it’s handy most of the time and it’s especially helpful when my hands are full, when my elderly family members visit, when I’m working on something and can’t move away to adjust lighting, etc. but it is frivolous for sure
That was an interesting read. I've been against personal assistants/smarthome/etc., but your comment might have brought me around. Thank you for sharing your setup.