565
top 50 comments
sorted by: hot top controversial new old
[-] SirEDCaLot@lemmy.today 32 points 12 hours ago

Ever run an AI model locally? If you want the most capability you need a fast GPU with 32-48gb RAM. And that's all for you, ONE user.

Copilot has millions of users, with tens or hundreds of thousands of them hitting the AI all at once. Each one needs $thousands worth of GPU and RAM dedicated to them for the length of their query processing.

Where do you think the money to buy all that hardware comes from? You see OpenAI buying a double digit percentage of the world's RAM production, you think they got it on clearance sale?

No, there are investors. Investors who are pouring hundreds of billions into this AI stuff. And they don't do this because it's fun, they do it because they expect a BIG return.

So what's going on is just like your neighborhood drug pusher, only the drug pusher is more honest. He says 'first hit's free, man'. AI company says 'AI models are an easy and cost effective way to modernize your workflow!'; they don't tell you that once you've integrated them and fired all the humans who know how to do the work, the price is gonna go way up.

Because the fact is, there IS a real cost of AI compute. GPU time, or at the large scale, datacenter space, power, cooling, etc.

In another few months to few years, the C-suites will stop huffing the koolaid and will start doing cost-benefit analysis on where AI is and isn't cost-effective vs. humans. With any luck (for the AI people) by that time the AIs will be good enough that it's a clear benefit. If not this bubble's gonna pop.

[-] Rooster326@programming.dev 9 points 2 hours ago

Has anyone ever actually had a drug dealer giving out free hits?

I feel like that's the biggest lie DARE ever told.

[-] MIDItheKID@lemmy.world 2 points 56 minutes ago

DARE told so many lies. I was told that if I smoked weed, it would ruin my life and I would become a helpless drug addict. Then I smoked weed and that didn't happen. Then they got me thinking "what else did they lie to me about?".

[-] BlackPenguins@lemmy.world 2 points 2 hours ago

That's why I would rather run locally. I control my data. Better for the environment. And if I ask a programming question, sure ChatGPT will come back in seconds, but I'm fine with waiting 30-60 seconds for my own AI. How impatient are we.

[-] T156@lemmy.world 6 points 11 hours ago

Ever run an AI model locally? If you want the most capability you need a fast GPU with 32-48gb RAM. And that's all for you, ONE user.

Even then, that's quite small. Top of the line frontier models would be looking at hundreds of gigabytes of video memory, and just as much RAM.

A terabyte of VRAM/RAM needed for something like CoPilot is probably a fairly sensible estimate.

[-] phx@lemmy.world 7 points 11 hours ago

Depends on what you want to do, the model, and optimization or quantization.

A lot of LLM stuff that seemed pretty amazing a few years ago - chatbots and the like that respond to questions in plain language - can run in comparatively light hardware. Coding agents can take more, but could also be optimized against a particular language and spit out useful snippets.

Image stuff can be pretty complex especially at higher resolutions and detail, and creating seamless video segments gets expensive on hardware, fast.

[-] SirEDCaLot@lemmy.today 2 points 2 hours ago

Quite true. The thing is, there aren't billions and billions of dollars in chatbots. The billions are for the creative stuff and the code.

And that is where the reckoning / correction will come from, the bill has to come due eventually. When top end generative AI starts to have a real cost associated with it, then it's no longer a blanket 'everyone start using this immediately' mandate, it prompts some consideration of cost versus output quality.

[-] heartSagan5@lemmy.zip 6 points 12 hours ago

So, what you’re saying…is the AI Bubble is going to pop once the pencil pushers do the math? But they’re asking their local LLM for that… so it isn’t happening?

[-] SirEDCaLot@lemmy.today 4 points 11 hours ago

Not pop. Correct.

A lot of the managers aggressively pushing AI have little or no understanding of it themselves. They just hear of a technology that can make a human more productive by doing most of the work for them. So absolutely that's worth a ton of money. It's why many companies are encouraging if not demanding employees to start using AI- because in their mind, one employee fully utilizing AI can do the work of two standard employees. Of course they believe this because they've never actually had to use the damn thing themselves and thus don't realize it doesn't do all the work for you. Or worse they think it does and your wonderful code base turns into spaghetti.

Side note- A few companies even had leaderboards for who was using the most AI tokens. This led to 'tokenmaxxing', trying to consume as many tokens as possible to prove you are adopting AI. Things like 'Write unit tests for our company code base, then refactor the code base. Spin up an instance of Claude and another of ChatGPT to each generate unit tests of the old code and run them against the new code, then run the tests against each other to check each other's work, submit full debug output to another instance of gpt 5.5 that will check for hallucinations... Keep that query going for a few paragraphs and you'll have an army of AI workers all checking each other's work while producing zero productive output but costing a fortune to run.

[-] sudoer777@lemmy.ml 5 points 11 hours ago

Deepseek V4 Flash exists as a small open-weight model and is crazy cheap for what it is capable of. ClosedAI/Anthropic and other non-free models are sketchy AF with their pricing and basically everything else about them.

[-] spaceracoon@lemmy.zip 22 points 19 hours ago

The squeeze could not happen soon enough 😌

[-] heartSagan5@lemmy.zip 7 points 12 hours ago

For real. This whole “AI/LLM” nonsense could’ve been a nothing burger if they correctly charged it at the get go… Ah, well. Venture capital is suckers, I guess?

[-] Sinthesis@lemmy.today 3 points 11 hours ago* (last edited 11 hours ago)

From what I've seen, the AI companies don't even know what to charge. For example, claude.ai finally made an API for user costs and we get this fun fact:

Values for a given date may be revised for up to 30 days as late events arrive and reconciliation runs. For invoicing-grade totals, query dates at least 30 days in the past.

[-] underscores@lemmy.zip 3 points 14 hours ago

btw soon GitHub actions will no longer be free to self host

[-] garbage_world@lemmy.world 2 points 11 hours ago

For private repos, unless something changed since I last heard about that

[-] mazzilius_marsti@lemmy.world 2 points 15 hours ago

what is a "usage" btw? They said 1 cent = 1 usage.

So 1 message sent to Copilot? Or usage = number of response Copilot gives back to the user?

[-] SirEDCaLot@lemmy.today 7 points 13 hours ago

I think that was before. Now it's based on the compute cost of the request. Like if you ask a large model to perform a long and difficult task you'll pay through the nose. If you ask a simple model to perform an easy task it'll be dirt cheap. And that's what's got people pissed, is that what used to be 'one query' now may consume a lot of credits and it may not be clear how many.

[-] kazerniel@lemmy.world 19 points 1 day ago

*plays the world's tiniest violin*

[-] qaz@lemmy.world 58 points 1 day ago

Everyone could've seen it coming from mile away

[-] nbsp@programming.dev 20 points 1 day ago

anyone who didnt is a fucking idiot

[-] zbyte64@awful.systems 2 points 1 hour ago* (last edited 1 hour ago)

I've got some bad news about our managers and leaders....

[-] pHr34kY@lemmy.world 118 points 1 day ago

It wasn't profitable. This bait and switch was a long time coming.

Yup, the “first one is free” deal is always a trap.

[-] SCmSTR 13 points 1 day ago* (last edited 1 day ago)

Literal drug dealer strategy

[-] Clutter@sh.itjust.works 20 points 1 day ago

In just glad it happened relatively soon. Should have happened sooner :-)

[-] replicat@lemmy.world 38 points 1 day ago

I've been using it for well over a year now.

Hit my limit in 1 day and canceled.

load more comments (4 replies)
[-] Trilogy3452@lemmy.world 23 points 1 day ago

I think this is "stupid money', again.

I hope this makes companies stop being "AI native"

[-] bigbangdangler@reddthat.com 100 points 1 day ago

Sigh. None of this is surprising in the least. The cost of AI infrastructure and compute (coupled with the complexity of the chain) makes it prohibitively expensive.

It only has appeared cheap because of investor money flowing like Niagara on the off-chance that it could be made cheap enough to be profitable after getting everyone addicted to using it. I really don't think it's there, and it's definitely not cheap enough to continue flying for free much longer.

load more comments (8 replies)
[-] ryantown@lemmy.world 179 points 1 day ago

Yeah, what's the surprise here? Turns out it's expensive.

[-] paraphrand@lemmy.world 79 points 1 day ago

Microsoft was deceptive here and never made it clear exactly what sort of deal you were getting with the flat rate. There was no indication of the actual magnitude.

[-] W98BSoD@lemmy.dbzer0.com 1 points 2 hours ago
load more comments (13 replies)
load more comments (1 replies)
[-] teft@piefed.social 115 points 1 day ago

Man, enshittification is happening so fast for ai. Imagine the next big thing. It'll be enshittified prior to release.

[-] kieron115@startrek.website 4 points 16 hours ago* (last edited 16 hours ago)

This is the same business model that tech "startups" use, just at a vastly accelerated time table. Upend some existing market (in this case, several markets) by burning through cash at an unsustainable rate. Once your customers have been hooked, and/or the alternatives eliminated as competitors, you crank up the prices to try and make the business cashflow positive.

[-] porous_grey_matter@lemmy.ml 69 points 1 day ago

This isn't enshittification in the traditional sense, they haven't captured the market enough for that. They're just panicking because they're burning cash way too fast.

load more comments (12 replies)
load more comments (8 replies)
[-] vk6flab@lemmy.radio 80 points 1 day ago

~~GitHub~~ Microsoft just switched Copilot to metered billing, and developers are watching months of credits vanish in a single day

load more comments (3 replies)
load more comments
view more: next ›
this post was submitted on 04 Jun 2026
565 points (100.0% liked)

Technology

85169 readers
3555 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related news or articles.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


founded 3 years ago
MODERATORS