402
submitted 2 months ago* (last edited 2 months ago) by geneva_convenience@lemmy.ml to c/fediverse@lemmy.ml

Dropsitenews published a list of websites Facebook uses to train its AI on. Multiple Lemmy instances are on the list as noticed by user BlueAEther

Hexbear is on there too. Also Facebook is very interested in people uploading their massive dongs to lemmynsfw.

Full article here.

Link to the full leaked list download: Meta leaked list pdf

you are viewing a single comment's thread
view the rest of the comments
[-] codexarcanum@lemmy.dbzer0.com 7 points 2 months ago

Hmmm... I don't see dbzer0 in the list, I wonder how we escaped? I think we're like the 3rd or 4th biggest instance, and positive leaning on AI. Maybe @db0@lemmy.dbzer0.com just has amazing sys admin skills?

[-] db0@lemmy.dbzer0.com 10 points 2 months ago

We do block several genai scrapers, so that could be related yes.

[-] tpyo@lemmy.world 3 points 2 months ago

Hey, I want to thank you for all you do. I'm not currently on db0 but I started there. I also recognized it when I was making the transition over to Lemmy. I see you pop up in comments all over and I'm taking this opportunity, because it's relevant, to let you know I appreciate your presence on here and your advocacy for the freedoms of information and communication

[-] db0@lemmy.dbzer0.com 3 points 2 months ago

Cheers mate. I appreciate it

[-] poVoq@slrpnk.net 8 points 2 months ago

Maybe they don't want to ingest AI generated content to prevent model decay and thus remove sites that promote AI use?

[-] LustyArgonianMana@lemmy.world 2 points 2 months ago

Maybe they just have enough incel misogyny already

this post was submitted on 08 Aug 2025
402 points (100.0% liked)

Fediverse

22088 readers
1 users here now

A community dedicated to fediverse news and discussion.

Fediverse is a portmanteau of "federation" and "universe".

Getting started on Fediverse;

founded 5 years ago
MODERATORS