22
submitted 1 day ago* (last edited 1 day ago) by CyberSage@piefed.social to c/piefed_meta@piefed.social

I was thinking about moderation in PieFed after reading @rimu@piefed.social mention he doesn’t want NSFW content because it creates more work to moderate. But if done right, moderation shouldn’t fall heavily on admins at all.

One of the biggest flaws of Reddit is the imbalance between users and moderators—it leads to endless reliance on automods, AI filters, and the usual complaints about power-mods. Most federated platforms just copy that model instead of proven alternatives like Discourse’s trust level system.

On Discourse, moderation power gets distributed across active, trusted users. You don’t see the same tension between "users vs. mods," and it scales much better without requiring admins to constantly police content. That sort of system feels like a much healthier direction for PieFed.

Implementing this system could involve establishing trust levels based on user engagement within each community. Users could earn these trust levels by spending time reading discussions. This could either be community-specific—allowing users to build trust in different communities—or instance-wide, giving a broader trust recognition based on overall activity across the instance. However, if not executed carefully, this could lead to issues such as overmoderation similar to Stack Overflow, where genuine contributions may be stifled, or it might encourage karma farming akin to Reddit, where users attempt to game the system using bots to repost popular content repeatedly.

Worth checking out this related discussion:
Rethinking Moderation: A Call for Trust Level Systems in the Fediverse.

top 11 comments
sorted by: hot top controversial new old
[-] wiki_me@lemmy.ml 1 points 9 hours ago

With all the reports on countries manipulating online content. that's like leaving your house in a crime ridden Neighbourhood with the door wide option. as far as i know there is no way to prevent bots when dealing with a highly sophisticated actor like china (which meta reported manipulated online content).

One thing that can help is keeping states about reports user made. if say 90% of the reports are good and the number of reports is say over 50 the person can become a mod. you could also have chart scoring who is the best reporter. The number can be tweaked and you could do some statistical analysis finding the optimal numbers.

[-] julian@activitypub.space 3 points 1 day ago* (last edited 1 day ago)

Discourse's Trust Levels are an interesting idea, but not one that is novel. It was lifted almost entirely from Stack Overflow. At the time, Discourse and Stack Overflow had a common founder, Jeff Atwood.

There's a reason Stack Overflow is rapidly fading into obscurity... its moderation team (built off of trust levels) destroyed the very foundation of what made Stack Overflow good.

I am also not saying that what we have now (first-mover moderation or top-down moderation granting) is better... merely that should you look into this, tread lightly.

[-] CyberSage@piefed.social 1 points 1 day ago

I mentioned the problem with StackOverflow earlier; it relates to gaining reputation from performing moderation actions.

[-] Skavau@piefed.social 5 points 1 day ago* (last edited 1 day ago)

Most federated platforms just copy that model instead of proven alternatives like Discourse’s trust level system.

Speaking for myself - I'm not opposed to taking some elements of this level-up system that gives users more rights as they show that they're not a troll. To what extent would vary though. Discourse seems to be somewhat different type of forum to Lemmy or Piefed.

One of the biggest flaws of Reddit is the imbalance between users and moderators—it leads to endless reliance on automods, AI filters, and the same complaints about power-hungry mods.

I cannot imagine any reddit-clone not, at some point, needing to rely on automoderation tools.

[-] OpenStars@piefed.social 1 points 6 hours ago

And when used correctly, them being a great boon - e.g. users can set up automated keyword filters (allowing None, All, or even just Some of the content through), which is a decision made by oneself not someone else.

Community mods likewise choose whatever they are comfortable with - e.g. if you dislike people who downvote literally every post in the community, then it helps to have tools to detect and put a stop to that. The main thing there is (it seems to me) to remain on top of understanding and making appropriate use of the tool so that it isn't doing something that you do not approve of - e.g. in the aforementioned example perhaps downvoting two posts in the community should not trigger a ban (however short in duration) whereas downvoting twenty should (so then what about 10? 5?).

[-] hendrik@palaver.p3x.de 4 points 1 day ago* (last edited 1 day ago)

Concerning NSFW in specific: I'm not sure, there might be more issues. I've had a look at lemmynsfw and a few others and had a short run-in with moderation. Most content is copied there by random people without consent of the original creators. So we predominantly got copyright issues and some ethical woes if it's amateur content that gets taken and spread without the depicted people having any sort of control over it. If I were in charge of that, I'd remove >90% of the content, plus I couldn't federate content without proper age-restriction with how law works where I live.

But that doesn't take away from the broader argument. I think an automatic trust level system and maybe even a web of trust between users could help with some things. It's probably a bit tricky to get it right not to introduce stupid hierarchies. Or incentivise Karma-farming and these things.

[-] eleijeep@piefed.social 1 points 1 day ago

I haven't used Discourse, but what you describe sounds like the way that Slashdot has been doing moderation since the late 90s, by randomly selecting users with positive karma to perform a limited number of moderation actions, including meta-moderation where users can rate other moderation decisions.

I always thought that this was the ideal way to do moderation to avoid the powermod problem that reddit and lemmy have, although I acknowledge the other comments here about neglecting minorities being a result of random sampling of the userbase, but it is likely that this also happens with self-selected moderation teams.

Within minority communities though, a plurality of members of that community will belong to that minority and so moderating their own community should result in fair selections. Another way to mitigate the exclusion of minorities might be to use a weighted sortition process, where users declare their minority statuses, and the selection method attempts to weight selections to boost representation of minority users.

A larger problem would be that people wanting to have strong influence on community moderation could create sock-puppet accounts to increase their chance of selection. This already happens with up/downvotes no doubt, but for moderation perhaps the incentive is even higher to cheat in this way.

I think a successful system based on this idea at least needs some strong backend support for detecting sock-puppetry, and this is going to be a constant cat and mouse game that requires intrusive fingerprinting of the user's browser and behaviour, and this type of tracking probably isn't welcome in the fediverse which limits the tools available to try to track bad actors. It is also difficult in an open source project to keep these systems secret so that bad actors cannot find ways to work around them.

[-] WillStealYourUsername@piefed.blahaj.zone 1 points 1 day ago* (last edited 1 day ago)

I'm always concerned with these kinds of systems and how minorities would be treated within them. Plenty of anti trans stuff gets upvoted by non trans people from a number of other instances, both on and off trans instances. Any such system would favor the most popular opinions, disallowing anything else, at least from how I interpret them when they are explained to me.

There's also the issue that mods would still have to be a thing and they would need to be able to both ban and remove spam and unacceptable content, so how do you make sure these features aren't also used to just do moderation the old fashioned way?

And how do trusted users work in a federated system? Are users trusted on one server trusted on another? If so that makes things worse for minorities again and allows for abusive brigading. Are users only trusted on their home instance? If so that's better, but minorities are still at a disadvantage outside of their own instances.

There's also the issue with scale. Piefed/lemmy isn't large. What is the threshold to remove something? What happens when there's few reports on a racist post? How long does it get to stay up before enough time passes for it to accrue enough reports? Any such system would need to be scaled individually and automatically to the activity level of each community, which might be an issue in small comms. There are cases where non-marginalized people struggle to understand when something is marginalizing, so they defend it as free speech. What happens in these cases? Will there be enough minorities to remove it? I doubt it.

I'm sure there is some way to make some form of self-moderation, but it would need to be well thought out.

[-] OpenStars@piefed.social 1 points 6 hours ago

PieFed, unlike Lemmy, allows access to community-specific values, yes "karma" if you will. So if someone builds up a strong reputation and length of membership elsewhere, that will not help one iota within the specific community in question, if the mod chooses those settings (disclosure that I've only read of these but have no direct mod experience on a PieFed instance).

Also, at least at the instance level but it would probably be helpful to extend this model to a community one as well, votes can be differentially weighted from "trusted" instances, let's say those not known as spreaders of disinformation.

So someone could spin up 10 private instances and 10 accounts on each to attempt to influence the vote counts, and since Lemmy only allows "upvote" vs. "downvote", Lemmy will be susceptible to this kind of malicious interference, but PieFed offers multiple methods to limit and attempt to minimize this kind of behavior. e.g. each of those 100 alt accounts would need to be considered helpful members of the community and he upvoted often in order to karma farm sufficiently in order to then be able to influence voting patterns. Though let's face it, if someone is willing to go to all that amount of trouble, could they really be kept at bay by any automated - or even entirely manual - system? Generally the best that can be done is to raise the level of effort required so as to not be worth the reward, and PieFed certainly does that! (While Lemmy does little to nothing, at least directly although some instance admins have their own approaches, using a mixture of automated help and manual decision-making.)

[-] CyberSage@piefed.social 1 points 1 day ago

I appreciate your insights, but I see many issues raised without clear suggestions for how to enhance the moderation system effectively.

[-] Skavau@piefed.social 1 points 1 day ago

Well are you against the idea that an individual or a few people, whether or not they gain the position democratically or on a first-come-first server basis are allowed to moderate a community as they see fit?

this post was submitted on 03 Oct 2025
22 points (100.0% liked)

PieFed Meta

1658 readers
41 users here now

Discuss PieFed project direction, provide feedback, ask questions, suggest improvements, and engage in conversations related to the platform organization, policies, features, and community dynamics.

Wiki

founded 2 years ago
MODERATORS