16
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
this post was submitted on 03 May 2026
16 points (100.0% liked)
TechTakes
2574 readers
55 users here now
Big brain tech dude got yet another clueless take over at HackerNews etc? Here's the place to vent. Orange site, VC foolishness, all welcome.
This is not debate club. Unless it’s amusing debate.
For actually-good tech, you want our NotAwfulTech community
founded 2 years ago
MODERATORS
For the chain of thought instruction following model gpt-oss-20b, I've noticed its reasoning content often includes it talking about stuff it is supposed to avoid in the final output and it double checking that it doesn't have that forbidden output. So it would waste tokens talking about pink elephants in its reasoning content, but then do okayish at avoiding pink elephants in its final output.