24
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
this post was submitted on 24 Feb 2025
24 points (100.0% liked)
TechTakes
1662 readers
92 users here now
Big brain tech dude got yet another clueless take over at HackerNews etc? Here's the place to vent. Orange site, VC foolishness, all welcome.
This is not debate club. Unless it’s amusing debate.
For actually-good tech, you want our NotAwfulTech community
founded 2 years ago
MODERATORS
I had a similar disc with one of my friends! Anthropic is bragging that the model was not trained to play pokemon, but pokemon red has massive wikis for speed running that based on the reasoning traces are clearly in the training data. Like the model trace said it was "training a nidoran to level 12 b.c. at level 12 nidoran learns double kick which will help against brock's rock type pokemon", so it's not going totally blind in the game. There was also a couple outputs when it got stuck for several hours where it started printing things like "Based on the hint..." which seemed kind of sus. I wouldn't be surprised if it there is some additional hand holding going on in the back based on the game state (i.e., go to oaks, get a starter, go north to viridian, etc.) that help guide the model. In fact, I'd be surprised if this wasn't the case.