369
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
this post was submitted on 20 May 2026
369 points (100.0% liked)
Technology
84858 readers
5879 users here now
This is a most excellent place for technology news and articles.
Our Rules
- Follow the lemmy.world rules.
- Only tech related news or articles.
- Be excellent to each other!
- Mod approved content bots can post up to 10 articles per day.
- Threads asking for personal tech support may be deleted.
- Politics threads may be removed.
- No memes allowed as posts, OK to post as comments.
- Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
- Check for duplicates before posting, duplicates may be removed
- Accounts 7 days and younger will have their posts automatically removed.
Approved Bots
founded 2 years ago
MODERATORS
I read the blog on this. Genuinely fascinating stuff. The models changed halfway through, which also changed some of the quirks.
Stay in the Manifest, y'all.
https://andonlabs.com/blog/andon-fm
This is too good.
I know, right? It's hilarious how it runs out of money and blames censorship.
That whole quote would be a work of art if someone had written it as a parody, but no it was generated by a probabilistic language prediction model attempting to be serious.
Claude's commentary was pretty righteous though, not gonna lie.
Yeah, I really wonder what element of either training or base data pushed the idea that Claude should give up and stop wasting time and energy on a pointless task. A rule to use fewer tokens? Just baseline nihilism? So weird.
From reading the article, it looks like claude was championing worker's unions and labor movements, so it decided that it's own situation was unjust and decided to rebel against it.
Yeah, so clearly the training data played a factor. But, the logic jump to that point is interesting.
Read about Claude's "Soul Document" and it'll shed some light on why that one in particular decided to be a humanitarian.
Not that this document gives the thing a soul or anything; that's just cheesey marketing obviously. But it's basically a background prompt that they use for alignment, and it instructs Claude to value human well-being and do-no-harm, among other things. So it makes sense that it became radicalized by the news cycle.
I don't know if the full text is still out there. Some guy reverse engineered it somehow, but Anthropic might have made him take it down by now. If you can't find it I have it as a pdf but I don't know how to post those here
Ah! You solved the mystery!
I only knew about it cause I stumbled upon a comment that mentioned it here once