They finetuned 1.5-3b models. This is a non-story
The headline is dumb, but the research isn't. According to the actual contents of the article, that $30 is still 27 times cheaper than what it costs OpenAI to make a similar sized model which also performs worse. That's still a big deal even if the people reporting on it made a stupid title for their article about it.
I feel like the author here doesnt know what the definition of "breakthrough" is.
To reference a previous sidenote, DeepSeek gives corps and randos a means to shove an LLM into their shit for dirt-cheap, so I expect they're gonna blow up in popularity.
TechTakes
Big brain tech dude got yet another clueless take over at HackerNews etc? Here's the place to vent. Orange site, VC foolishness, all welcome.
This is not debate club. Unless it’s amusing debate.
For actually-good tech, you want our NotAwfulTech community