1778
You think? (lemmy.world)
submitted 1 week ago by nifty@lemmy.world to c/fuck_ai@lemmy.world
you are viewing a single comment's thread
view the rest of the comments
[-] JasminIstMuede 18 points 1 week ago

As far as I know, the Deepmind paper was actually a challenge of the OpenAI paper, suggesting that models are undertrained and underperform while using too much compute due to this. They tested a model with 70B params and were able to outperform much larger models while using less compute by introducing more training. I don't think there can be any general conclusion about some hard ceiling for LLM performance drawn from this.

However, this does not change the fact that there are areas (ones that rely on correctness) that simply cannot be replaced by this kind of model, and it is a foolish pursuit.

this post was submitted on 31 Dec 2024
1778 points (100.0% liked)

Fuck AI

1629 readers
146 users here now

"We did it, Patrick! We made a technological breakthrough!"

A place for all those who loathe AI to discuss things, post articles, and ridicule the AI hype. Proud supporter of working people. And proud booer of SXSW 2024.

founded 10 months ago
MODERATORS