337
you are viewing a single comment's thread
view the rest of the comments
[-] brucethemoose@lemmy.world 5 points 1 day ago* (last edited 1 day ago)

I wouldn’t use the word “desperate.”

Scaling is inefficient.

For training, it takes a ton of work to even get half-decent utilization across a bunch of servers, and it makes any sort of experimentation with architectures immensely more difficult.

Hence allegations that some GPUs are assigned “busywork” just to meet utilization quotas from the hardware seller.

For inference, scale isn’t so important. But the demand for tokens is self inflicted: from Meta shoving chatbots in ramdom places in software, and from their architecture being archaic and inefficient.


In other words, none of this has to be. It’s just the whims of one insecure man, surrounded by sycophantic tech bros, who’s feeling FOMO but doesn’t understand transformers LLMs at all.

If he had half a brain, he wouldn’t have fired the team that literally founded the open weights LLM space.

But he’s also too rich to ever feel the consequences of bad decisions now.

this post was submitted on 09 Jun 2026
337 points (100.0% liked)

Technology

85297 readers
3839 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related news or articles.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


founded 3 years ago
MODERATORS