171

On the recent LLM brouhaha and the FAF stance on LLMs for fediverse moderation. (lemmy.dbzer0.com)

submitted 5 days ago by db0@lemmy.dbzer0.com to c/div0@lemmy.dbzer0.com

119 comments fedilink hide all child comments

Recently there's been quite a bit of outrage because the developer of Piefed publicly called out the Fediverse Anarchist Flotilla (FAF) for supposedly using LLM for automating instance moderation. and even though many of our admins the larger lemmy community took great lengths to debunk that post, it has become the disinfo that keeps on giving (see https://lemmy.dbzer0.com/post/68749575, https://kolektiva.social/@ophiocephalic/116518887925988112, https://lemmy.dbzer0.com/post/68222242 and more)

After clarifying our position for yet another time, someone suggested we should make an official post and an instance policy to "give me something I can boost as a positive example and a sign that things will be better going forward." and given that this storm-in-a-teacup doesn't seem to be abating as people are all too happy to bring it up again and again to malign the FAF; We're making this post to once and for all clarify this situation.

History

We're not going to rehash the whole drama and the many hit pieces against the FAF in the past two weeks, but I need to post the exact situation as it happened, without speculations and assumptions that people are all too happy to jump to.

One of our mods develops a tool to download a user's public posting history through the lemmy API, to be used for evaluating them during moderation and shares it with some people in the admin team as something in progress. This tool does not feed anything to LLMs, it simply downloads the comments locally in a text file for easier review than going via the lemmy GUI.
Someone is reported to our instance admins for blatant zionism and genocide apologia.
An admin uses the tool to download the accused person's comment history for evaluation
A quick evaluation (without LLM) confirms that this is a person that needs to be instance-banned. The moderation decision has now been locked-in at this point.
At the same time, that admin was curious to discover if LLMs can used to summarize people's positions so that people can quickly follow-up with mod actions, without having to evaluate everyone's posts manually and reduce the workload of admins writing long justifications)
As an experiment, the admin pass the user's comment history through a locally-run open-weights LLM (Qwen) to see the summarized output. It happens to match their own decision.
The admin decides the leave the LLM summary in a pastebin along with that user's posting history for reference. As an inside joke, they decide to claim the post was summarized by OpenAI, as they expected only our community would care about this and our stance on corporate-LLMs is well-known at this point.
The admin bans that person, providing a link to that pastebin as justification.
The admin decides not to continue using LLMs anyway for summaries, for many valid reasons. As evidence see the lack of other pastebins with LLM summaries.

~2 weeks pass...

The piefed developer is banned by a different mod in our instance for "zionism". (I put this in quotes as this is one mod's opinion, and not necessarily our instance's position.)
The piefed developer apparently starts going through our instance modlogs for banned zionists and parses all their justifications
The piefed developer discovers that modlog justification from 2 weeks before with the LLM summary.
The piefed developer ask quickly in the common lemmy admin channel about it, at which point our instance admin in question, clarifies that the LLM was not used in the decision-making.
The piefed developer does not officially reach to anyone else from our admin team, despite the fact that we've reached out before and asked them to contact us in advance for inter-instance matters to avoid escalations.
The piefed developer make the public call-out I linked above as a piece of investigative journalism. The piefed developer does not provide the comments from our team which conflict with their narrative. The piefed developer not ask us for an official statement.
The piefed developer to this day has not amended their public call-out from the comments multiple of our admins and lemmy users leave under their post, conflicting with the narrative.

If you feel I've misrepresented any steps of this history, please let us know and I'll be happy to adjust.

Given that, we acknowledge that even though we didn't use LLMs in moderations, we allowed it to appear as if we did, and that's on us. We will of course not do the same mistake again (appear as to be using LLMs for moderation)

The FAF's stance on LLM moderation

We are aware that our instance is seen as "LLM-friendly" due to our nuanced take on LLMs but that does not mean that we, as an instance, ever considered using LLMs for moderating our instance. So we want to make it absolutely crystal clear how we stand on the matter.

As an official policy:

We have never used LLMs to guide our moderation decisions. This includes using LLM summaries which we would then validate, as well as LLM summaries which we use to confirm our existing decisions. LLMs are just not in our moderation loop whatsoever.
We have never passed instance data to corporate LLMs.
We have not used any automated moderation tooling which utilizes LLMs. The closest we have is the FOSS anti-CSAM filter I've developed and shared for years now, which relies strictly on locally-hosted machine-vision models.
We have never officially considered using LLMs for moderation, nor do we plan to.
As a team we're steadfastly against LLM for moderation due to its inherent biases.
If any of the above changes, we will publicly inform the FAF community.

We hope this can finally put this matter to rest.

you are viewing a single comment's thread
view the rest of the comments

[-] sp3ctr4l@lemmy.dbzer0.com 6 points 4 days ago* (last edited 4 days ago)

Human brains do not play by the same fundamental rules as software and hardware do.

They can be reasonably simplified down to the same rules, in a fair number of contexts and scopes... but they're very far from 'the same thing'.

Unless you're talking about simulating a brain down to the atom inside a computer, or literally growing/fabricating biological machine hybrids... no, "AI", specifically LLMs do not well and wholly act like human brains.

Both of those are current avenues of research into AI... they have been for decades... but LLMs really are just fancy autocompletes, an enormous mess of matrix comparisons, basically.

Human brains, biological brains have many more layers of processes going on than just looping pattern machines, they are far more complex and have many entirely distinct functional mechanisms at play.

Consider Phineas Gage.

Shoot a railroad spike through your local PC running an LLM, tell me how the LLM 'performs' after that.

What is the LLM equivalent of neuroplasticity?

Epigenetics?

All the actual mechanisms that things like psychiatric drugs operate by?

[-] NotANumber@lemmy.dbzer0.com 4 points 3 days ago

What is the LLM equivalent of neuroplasticity?

Believe it or not this does actually exist in a sense. There is such a thing as model surgery where parts of models are removed, bits from multiple models recombined, or model layers duplicated. Sometimes this is used to make an LLM with more performance or less resource usage. Models can then be "healed" by continued training so they behave correctly after surgery.

If you want a hardware and systems level example look no further than data center level redundancy and network routing systems that adapt from failures.

Epigenetics I am not sure would have an equivalent given we are not talking about biological creatures.

All the actual mechanisms that things like psychiatric drugs operate by?

Have you never heard of AI drugs using adversarial examples or activation engineering?

Activation engineering has been used in studies like this to manipulate emotion concepts in LLMs: https://www.anthropic.com/research/emotion-concepts-function Here it's used for steering LLMs: https://arxiv.org/abs/2308.10248

It's also used to uncensor LLMs.

I am not going to sit here and get into an argument around if LLMs are or aren't sentient because we don't know enough about consciousness to make that determination. We are still a long way away from solving the philosophical hard problem of consciousness. We don't really know what parts of animals or humans are necessary components of sentience or are just implementation specific details made by evolution. I also think that you are judging an LLM based on poor understanding of how they work and the ecosystem built up around them.

[-] sp3ctr4l@lemmy.dbzer0.com 2 points 3 days ago

Well, ok, sure, if you can frankenstein together different 'brain lobes' from different LLMs, graft them together, and then run through some process that makes them function as a more cohesive whole... yeah!

I can see that as a kind of neuroplasticity... sort of...

Its sort of like a very extreme version of setting up a group of adversarial agents that basically try to bullshit check each other, but you're doing it on another level, internally, actually ending up with a kind of hybrid model at the end.

But its also not like neuroplasticity, in that... nothing like neuroplasticity is present within a single LLM. Whereas in brains... every neuron, every constituent part has a 'mind of its own' in a sense, at a cellular level, from its genetic code and the rules of chemsitry that govern how a single one of them works, and allows for or gives rise to emergent structures and complexity, where the ability of brain to significantly restructure itself is one of those emergent complex properties/phenomena.

But but, I can also see how the ability to have networked and failover redundant hardware does at least somewhat approach a more biological neuroplasticity.

Maybe an interesting experiment would be to try and set up a network of LLMs that are actually designed to mimic the various different sections of the human brain, with their differing areas of relative 'expertise', their tendencies to handle different kinds of 'computational loads' ... and then link them together to form a whole total human brain analog... perhaps that could yield interesting results.

Now, when you talk about activation engineering... and then link the idea of basically talking to them in a certain way... that to me comes across more like gaslighting, social conditioning, brainwashing, etc.

What I was trying to get at was the functional neurochemical mechanisms at play, that moderate how a biological experiences reality, makes decisions and observations... that the complexity and subtlty of the electrochemical soup and all the interaction rules of physics and chemistry at play there... that is significantly different from the binary fit flipping that is the base level of how an LLM works.

But yes I am aware that LLMs have emotional vectors that can be tracked, measured, activated, manipulated via repeated patterns of external stimuli.

I've done a crude version of tinkering with or exploring this myself, with my local LLM setup.

But that page itself says that these are emulations, approximations of human emotions.

I was trying get at the actual distinct foundation level mechanisms that give rise to things like emotions, that are different between brains and LLMs.

[-] NotANumber@lemmy.dbzer0.com 1 points 59 minutes ago

Yeah you aren't quite getting what I am trying to explain here.

You used the example of Pheneas Gage as an example of neuroplasticity and neural remapping in the event of extreme damage. By talking about model surgery I was trying to make the point that LLM training algorithms do the same sort of thing. They manipulate each individual parameter of the model in a similar way to how neuroplasticity manipulates synapses in humans. It goes below the level of neurons onto the level of individual connections and biases affecting those neurons.

As for different models handling different things while this is actually done in practice we also have the concept of mixture of expert models. It's worth looking into.

[-] troed@fedia.io 2 points 4 days ago

I recommend Susan Blackmore's "Consciousness: An Introduction", and of course Douglas Hofstadter's "Gödel, Escher and Bach" and the followup "I am a strange Loop".

I didn't say human brains function like LLMs. I said that everything we know about how human brains work indicates we're also just pattern matching machines in a loop.

The point is that the fact that LLMs are "next token predictors" doesn't in itself say anything about what the emergent effects of that can be.

[-] sp3ctr4l@lemmy.dbzer0.com 3 points 4 days ago

I am pretty sure I read Godel Escher and Bach when I was in college, almost 20 years ago now.

That is what I am referring to when I say 'in some contexts and scopes, it is a reasonable simplification'.

LLMs are pattern matching machines, that operate via code that involves many different kinds of loops.

Brains appear to act in that manner as well, but my point is that they do many, many other things and that is because they are fundamentally different kinds of 'machines' that have fundamentally different 'operating principles', many of which we are still figuring out, quite likely many of which we are currently not even aware of or barely understand at all.

And, actually, that LLMs are 'next token predictors' does say a lot about their internally emergent properties.... that is to say, the lack of them, their bottlenecks.

LLMs are not going to be able to progress into being AGI, because there are many things Brains can do that LLMs cannot... for one, forgetting things, unlearning false things.

An LLM cannot modify its own training data. It cannot modify its own conceptual association scores.

Humans can do this. They can realize some fundamental notion they have is actually significantly wrong, incorrect, and what happens is the brain literally physically restructures itself when that occurs.

Unless or untill the actual fundamental idea of what an LLM is, is significantly altered or augmented... they're going to keep running into the diminishing returns problem that they currently area.

Brains are much more capable, diverse and complex than LLMs.

[-] NotANumber@lemmy.dbzer0.com 1 points 3 days ago

LLMs are not going to be able to progress into being AGI, because there are many things Brains can do that LLMs cannot... for one, forgetting things, unlearning false things.

Catastrophic forgetting is literally one of the main problems with current models that ML research is currently trying to overcome. But whatever.

[-] troed@fedia.io 0 points 4 days ago

Yes, as I've described here: https://blog.troed.se/posts/the-delta-between-an-llm-and-consciousness/

I didn't say human brains function like LLMs

Today's LLMs are based on a Google research paper from 2017. Another published paper that would solve this was published by Google in december last year: https://aipapersacademy.com/nested-learning-hope/

[-] sp3ctr4l@lemmy.dbzer0.com 4 points 4 days ago* (last edited 4 days ago)

Look I don't care about your blog, I care about the comment you made here, in this thread.

You are not important enough for you to assign me required reading, muchless presume I am 'familiar with your previous works'.

You are another random username and profile pic that is saying some words.

You should have the contextual awareness to realize that a lot of people on basically a public internet discussion forum have a lot of varying degrees of knowledge <-> misinformation regarding LLMs, and that when you make a single sentence statement that yes, does not precisely say that Brains only do what LLMs do... a whole bunch of other people are going to read it as such.

Thus, I provided that additional context and nuance here, in this thread, so that anyone who stumbles upon this thread can read this thread and be better informed, here.

[-] troed@fedia.io 1 points 4 days ago

Oh I haven't seen a single person replying so far who has shown any interest in being "better informed".

[-] sp3ctr4l@lemmy.dbzer0.com 3 points 4 days ago

Howabout everyone upvoting the additional context I am providing, being taken to signal that they support being more fully informed, here, in this thread?

With this 'reddit-esque' style of a discussion forum, you can expect that something like 90% of people are lurkers who mostly interact via up and down votes.

Beyond that, a number of people elsewhere in this thread are... having fairly extensive discussions, about what actually constitutes being 'better informed' irt LLMs.

this post was submitted on 13 May 2026

171 points (100.0% liked)

/0

2176 readers

12 users here now

Meta community. Discuss about this lemmy instance or lemmy in general.

Service Uptime view

founded 2 years ago

MODERATORS

db0@lemmy.dbzer0.com