1801
You think? (lemmy.world)
submitted 1 month ago by nifty@lemmy.world to c/fuck_ai@lemmy.world
you are viewing a single comment's thread
view the rest of the comments
[-] JasminIstMuede 18 points 1 month ago

As far as I know, the Deepmind paper was actually a challenge of the OpenAI paper, suggesting that models are undertrained and underperform while using too much compute due to this. They tested a model with 70B params and were able to outperform much larger models while using less compute by introducing more training. I don't think there can be any general conclusion about some hard ceiling for LLM performance drawn from this.

However, this does not change the fact that there are areas (ones that rely on correctness) that simply cannot be replaced by this kind of model, and it is a foolish pursuit.

this post was submitted on 31 Dec 2024
1801 points (100.0% liked)

Fuck AI

1842 readers
104 users here now

"We did it, Patrick! We made a technological breakthrough!"

A place for all those who loathe AI to discuss things, post articles, and ridicule the AI hype. Proud supporter of working people. And proud booer of SXSW 2024.

founded 11 months ago
MODERATORS