137

Google Chrome silently installs a 4 GB AI model on your device without consent. At a billion-device scale the climate costs are insane. — That Privacy Guy! (www.thatprivacyguy.com)

submitted 3 weeks ago by ooli3@sopuli.xyz to c/FightForPrivacy@sopuli.xyz

28 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[-] turdas@suppo.fi 1 points 3 weeks ago

but I'd assume for real world energy usage the KV cache would be very impactful if not eventually dominant.

They talk about this in the appendix where they go over the (estimated) effects of large amounts of input tokens (up to 100k). This isn't really relevant for Gemini Nano because it only has a max 32k context window, and the deployment in Chrome probably caps it at far less than that.

I'm inclined to believe the main analysis is reasonably accurate. The numbers are similar to what I get on my local machine with local models. Granted, I tested with smaller models (7b parameter Mistral in this case) on weaker hardware (AMD 6700XT), but on a quick test I get about 50 tok/s locally at 180 W power use, which is about 0.5 Wh for 500 tokens. AMD GPUs suck for AI, so I think it's plausible that dedicated compute hardware would get basically the same energy efficiency on a frontier model.

Gemini Nano on a phone NPU is obviously going to be far more efficient -- by all accounts it gets the same or better tok/s I am getting at like 1/50th the TDP.

this post was submitted on 05 May 2026

137 points (100.0% liked)

Fight For Privacy

108 readers

1 users here now

Privacy is a fundamental human right, we have to fight for in data capitalism.

Everything about privacy, and means , from legal to obfuscation we can use to protect human way of life

founded 6 months ago

MODERATORS

ooli3@sopuli.xyz