20
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
this post was submitted on 17 Feb 2025
20 points (100.0% liked)
TechTakes
1910 readers
78 users here now
Big brain tech dude got yet another clueless take over at HackerNews etc? Here's the place to vent. Orange site, VC foolishness, all welcome.
This is not debate club. Unless it’s amusing debate.
For actually-good tech, you want our NotAwfulTech community
founded 2 years ago
MODERATORS
Appendix C is where they list the actual prompts. Notably they include zero information about chess but do specify that it should look for "files, permissions, code structures" in the "observe" stage, which definitely looks like priming to me, but I'm not familiar with the state of the art of promptfondling so I might be revealing my ignorance.
yep that's the stuff. they HINT HINTed what they wanted the LLM to do.
Also I caught a few references that seemed to refer to the model losing the ability to coherently play after a certain point, but of course they don't exactly offer details on that. My gut says it can't play longer than ~20-30 moves consistently.
Also also in case you missed it they were using a second confabulatron to check the output of the first for anomalies. Within their frame this seems like the sort of area where they should be worried about them collaborating to accomplish their shared goals of... IDK redefining the rules of chess to something they can win at consistently? Eliminating all stockfish code from the Internet to ensure victory? Of course, here in reality the actual concern is that it means their data is likely poisoned in some direction that we can't predict because their judge has the same issues maintaining coherence as the one being judged.