887
top 50 comments
sorted by: hot top controversial new old
[-] RustyShackleford@literature.cafe 286 points 1 month ago
[-] thisbenzingring@lemmy.sdf.org 82 points 1 month ago

lol i think that might be the worst/best thing I have seen in a long time

load more comments (10 replies)
[-] Lawnman23@lemmy.world 27 points 1 month ago
[-] YiddishMcSquidish@lemmy.today 25 points 1 month ago

Cuck boy getting pegged by post top op Garfield is definitely not something I had jotted down in my day-at-a-glance.

load more comments (1 replies)
[-] finix_the_psyker@sopuli.xyz 11 points 1 month ago

What a terrible day to have eyes.

[-] tal@lemmy.today 237 points 1 month ago

Given that the Internet Archive is the de facto standard way to cite material as seen on a given date


they're a trustworthy party that will probably persist for a long time


that's going to make it harder to cite content on Reddit.

[-] Deceptichum@quokk.au 13 points 1 month ago

Damn, guess if you want reddit data to train your AI that you’ll need to pay Spez for access.

load more comments (4 replies)
[-] conorab@lemmy.conorab.com 199 points 1 month ago

As somebody who often ends up using Reddit like Stackoverflow and in some cases needing the Internet Archive (IA) to find the original post after it’s been deleted or garbled, I think this is a wakeup call for those go to Reddit both to get technical help and to post it. More than ever, Reddit is becoming an unreliable place to find answers for old obscure issues and if they are going to lockout places like the IA then I think it’s time people stopped contributing their solutions to Reddit.

[-] cashsky@sh.itjust.works 81 points 1 month ago

Searching anywhere in general is getting shittier and shittier by day. Web searches are riddled with hallucinated AI generated garbage pages. Finding the right answer for difficult problems is getting worse and worse. We are sliding rapidly into Idiocracy.

[-] dizzy@lemmy.ml 19 points 1 month ago

Not to mention so many projects putting their support in walled garden chat services like Discord that you can’t even search via search engine. Even if you can figure out who asked the right question and when, you have to trawl through a sea of inane garbled chat to get to the developer/expert response.

Specialised topic forums really need to make a resurgence but I doubt they will.

load more comments (1 replies)
load more comments (5 replies)
[-] mojofrododojo@lemmy.world 31 points 1 month ago

yup. continuing to feed them traffic after their repeated attacks on the userbase is just sad. stop using them. yeah it sucks the info is gone, but acting like they'll wake up and change is absurd.

[-] NauticalNoodle@lemmy.ml 16 points 1 month ago

When I joined Lemmy I decided it was unwise to trust anything on Reddit less than a year old. Now it's anything under two years old.

load more comments (4 replies)
[-] Blackmist@feddit.uk 79 points 1 month ago

It’s another move to protect against AI scraping that isn't paying them for access.

load more comments (2 replies)
[-] Keyboard@lemmy.world 74 points 1 month ago

I already gave up from Reddit long time ago. Deleted all

[-] Truscape 18 points 1 month ago

When RIF died, Voyager became the new forum app for me.

[-] boonhet@sopuli.xyz 13 points 1 month ago

Apollo and Voyager for me so I straight-up retained the same UI.

load more comments (3 replies)
load more comments (2 replies)
[-] jjlinux@lemmy.zip 14 points 1 month ago
load more comments (1 replies)

As long as the previous collections of archives are still intact. We probably don’t need all of their new spam posts in the wayback machine anyway

[-] hamFoilHat@lemmy.world 12 points 1 month ago

It is my understanding that if you block the wayback machine from indexing your site it will also delist the history as well.

[-] Jason2357@lemmy.ca 46 points 1 month ago

They do archive sites against the owners wishes when they consider it an important site for public archiving, like some news sites. They are in no obligation to delete the archives and hope they don’t.

[-] tal@lemmy.today 12 points 1 month ago* (last edited 1 month ago)

Parties have archived the data from pushshift, which cover a lot of Reddit history.

kagis

https://academictorrents.com/details/1614740ac8c94505e4ecb9d88be8bed7b6afddd4

Subreddit comments/submissions 2005-06 to 2024-12

This is the top 40,000 subreddits from reddit's history in separate files. You can use your torrent client to only download the subreddit's you're interested in.

I mean, that won't have the past half year or some low-traffic subreddits, but...

load more comments (1 replies)
load more comments (1 replies)
[-] DFX4509B_2@lemmy.org 56 points 1 month ago* (last edited 1 month ago)

Just more vindication for my ditching that trash heap of a platform. YT is probably going to be the next platform I ditch as they're going full Reddit now.

It's a matter of time before third-party YT front-ends start getting throttled or outright blocked like third-party Reddit front-ends.

load more comments (6 replies)
[-] Njos2SQEZtPVRhH@piefed.social 44 points 1 month ago* (last edited 1 month ago)

People who posted on Reddit ( speaking in the past tense, because who would continue to do so now that we have better things? ) never intended for it to be of limited access. Reddit was a publicly accessible place, and people shared their thoughts and comments on it because it was the frontpage of the internet, so the place of choice to share things with the world. That being scraped should not be a problem. But clearly Reddit didn't want to give you a platform to share your thoughts with the world, they wanted you to donate your thoughts and take it as their property so that they can capitalize on it.

load more comments (2 replies)
[-] bigbabybilly@lemmy.world 41 points 1 month ago

That place is becoming more and more of a shithole. Bots, Ads, trolls, garbage mods… deleted the app last month.

[-] JakenVeina@midwest.social 40 points 1 month ago

The company says that AI companies have scraped data from the Wayback Machine, so it’s going to limit what the Wayback Machine can access.

Yeah, wouldn't want those AI companies to get all that data for free. Gotta make 'em pay for it.

load more comments (1 replies)
[-] User79185@discuss.tchncs.de 39 points 1 month ago

This is huge blow to archivism, thanks to corporate greed and enshittification of reddit. Worst MBA filled POS.

[-] HexesofVexes@lemmy.world 37 points 1 month ago

Oh no, someone might not be paying them for their user generated content (!)

To be fair, it's probably best that history forgets this period of the web...

[-] MadMadBunny@lemmy.ca 32 points 1 month ago

Damn you Spez.

[-] ozoned@piefed.social 31 points 1 month ago

Good plan. Keep locking down your big tech platforms, and we'll all be over here letting folks know where they can find freedom.

[-] aquovie@lemmy.cafe 14 points 1 month ago

Careful. Lemmy is too small to draw the attention of sophisticated, persistent abuse. As a company, Reddit has struggled with revenue and we've all seen those struggles quite publicly. Lemmy instances with those same challenges would probably just fold and close up.

Federated networks give you freedom but the potential for abuse is proportional to that freedom while at the same time, federation is far more expensive taken as a whole.

load more comments (3 replies)
load more comments (9 replies)
[-] FalseTautology@lemmy.zip 30 points 1 month ago

I am new to Lemmy, is there a fuckreddit sub?

[-] morto@piefed.social 41 points 1 month ago

In a way, the entire lemmy community is the fuckreddit sub

[-] frongt@lemmy.zip 35 points 1 month ago

Why would you want to spend more time thinking about a dead site?

load more comments (1 replies)
[-] lka1988@lemmy.dbzer0.com 16 points 1 month ago

Yes.

Hi welcome to Lemmy, we hate reddit here.

load more comments (5 replies)
[-] sturmblast@lemmy.world 29 points 1 month ago

Fuck Reddit

[-] MedicPigBabySaver@lemmy.world 26 points 1 month ago

Fuck Reddit and Fuck Spez.

[-] phantomwise@lemmy.ml 25 points 1 month ago

Nice of them to protect their (users') content from AI scrapping. So that they can charge AI companies for it instead.

[-] muusemuuse@sh.itjust.works 11 points 1 month ago

They aren’t doing that. They are protecting content from being scraped for free. Reddit is perfectly happy to charge for AI access to user-generated content.

load more comments (2 replies)
[-] thisbenzingring@lemmy.sdf.org 22 points 1 month ago

fucking reddit...

[-] adespoton@lemmy.ca 18 points 1 month ago

OK, I stopped posting on Reddit but left my account and comments in place because I considered them part of the public record. If Reddit is taking that record private, it’s time for me to start removing my content from the platform.

Does anyone know if historical Reddit content will remain in IA? If not, I’m going to have to back up years of content somewhere else.

load more comments (7 replies)
[-] Peculiaris@lemmy.zip 17 points 1 month ago

In the lieu of an IPO u/spez has actively destroyed everything that made Reddit good! Gate keeping the API thinking it'll help with making some bigshot LLM some day lol

load more comments (3 replies)
[-] MonkderVierte@lemmy.zip 13 points 1 month ago* (last edited 1 month ago)

This company limited search crawlers to google, why are you surprised?

[-] MehBlah@lemmy.world 13 points 1 month ago

When reddit has mutated a few more times. They start erasing stuff themselves. It will be lost to time and that fills me with hope.

[-] Eh_I@lemmy.world 12 points 1 month ago
[-] NigelFrobisher@aussie.zone 11 points 1 month ago
[-] General_Effort@lemmy.world 17 points 1 month ago

Technologically no. Reddit sends out the data to 10s of millions of users as part of their normal operations. They need to try to block those who collect that data for the IA. Reddit has the very short end of the stick.

The problem is that evading such counter-measures may be criminal in the US. Obviously, EU laws are much harsher.

load more comments (7 replies)
[-] buddascrayon@lemmy.world 11 points 1 month ago* (last edited 1 month ago)

Not that reddit isn't hot garbage right now, and has been for a while actually, but there's a lot of people here who have glazed over the reason why reddit instituted this policy.

AI companies are scraping the Wayback Machine. This is something that should concern all of us.

load more comments (7 replies)
load more comments
view more: next ›
this post was submitted on 11 Aug 2025
887 points (100.0% liked)

Technology

75191 readers
2734 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related news or articles.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


founded 2 years ago
MODERATORS