view the rest of the comments
News and Discussions about Reddit
Welcome to !reddit. This is a community for all news and discussions about Reddit.
The rules for posting and commenting, besides the rules defined here for lemmy.world, are as follows:
Rules
Rule 1- No brigading.
**You may not encourage brigading any communities or subreddits in any way. **
YSKs are about self-improvement on how to do things.
Rule 2- No illegal or NSFW or gore content.
**No illegal or NSFW or gore content. **
Rule 3- Do not seek mental, medical and professional help here.
Do not seek mental, medical and professional help here. Breaking this rule will not get you or your post removed, but it will put you at risk, and possibly in danger.
Rule 4- No self promotion or upvote-farming of any kind.
That's it.
Rule 5- No baiting or sealioning or promoting an agenda.
Posts and comments which, instead of being of an innocuous nature, are specifically intended (based on reports and in the opinion of our crack moderation team) to bait users into ideological wars on charged political topics will be removed and the authors warned - or banned - depending on severity.
Rule 6- Regarding META posts.
Provided it is about the community itself, you may post non-Reddit posts using the [META] tag on your post title.
Rule 7- You can't harass or disturb other members.
If you vocally harass or discriminate against any individual member, you will be removed.
Likewise, if you are a member, sympathiser or a resemblant of a movement that is known to largely hate, mock, discriminate against, and/or want to take lives of a group of people, and you were provably vocal about your hate, then you will be banned on sight.
Rule 8- All comments should try to stay relevant to their parent content.
Rule 9- Reposts from other platforms are not allowed.
Let everyone have their own content.
:::spoiler Rule 10- Majority of bots aren't allowed to participate here.
I'm telling myself it was terf and troll sock puppet accounts.
I'm very keenly waiting for captcha to be fixed, I hope most instances decide to make you fill out a captcha for EVERY post. That and paid instances will make a huge difference compared to Reddit. It will become way harder to spam and astroturf discussions, but I'm not sure how to handle legitimate bots.
AI can solve captcha easily, captcha only causes inconvenience for authentic users, it does nothing to prevent bots
That's provably false across many social media sites. No anti-bot solution will ever be perfect and it will always be a cat and mouse game. Captchas have a measurable effect on limiting registrations and comments from bots.
We don't say "deadbolt locks only cause inconvenience for homeowners, they do nothing to stop burglars breaking a window". We defense in depth. We use the deadbolt as one part of the security/defense plan.
Captcha is one part of the many actions and systems that would make up effective protections.
Magic happens when you only require captchas that a language model told you were inciting, hateful, or plain troll feeding. It even makes sense to make part of the score thread-global, as in "Someone already made a Hitler comparison, better throttle this thing". The worse the score, the more often claim that the user failed to solve the captcha.
Accusations of censorship will fall flat because you don't prevent anyone from posting, troll feeders won't bother posting because they don't care enough to bother, trolls get bored, trolls leave.
I've heard of PageRank based solutions, what sort of ai models should we be looking at?
As simple as SpamAssassin, that is a naive Bayes classifier, to running a full-fledged LLM which is very likely complete overkill. It really doesn't need to be particularly sophisticated as being inaccurate isn't really a problem, the whole scheme relies on the statistical impact it has on the whole forum, not the impact it has on a single post.
PageRank really only applies to analysing links.
Yeah I know we have an automated service at my work that automatically solves the captcha off some government site and then scrapes some data off of it every day (it's public data). The sucess rate is near 100% I believe.