1052
top 50 comments
sorted by: hot top controversial new old
[-] IchNichtenLichten@lemmy.world 129 points 7 months ago

A LLM that behaves like a typical Redditor?

What possible use is that?

[-] SonnyVabitch@lemmy.world 70 points 7 months ago

Air Canada offering a refund of tree fiddy.

[-] IchNichtenLichten@lemmy.world 22 points 7 months ago

You'll get your refund eventually but first it will try and gaslight you that Air Canada is a woke mind virus before calling you an asshole and then stalking you.

load more comments (1 replies)
load more comments (2 replies)
[-] honey_im_meat_grinding 30 points 7 months ago

What possible use is that?

I've noticed "has this sub gotten more right wing recently?" posts reaching the top post of the day in the last 6 months or so. r/norge and r/unitedkingdom being examples. You can automate bots that change a subreddit's consensus on certain topics by bot-spamming threads pertaining to those topics, especially in the first hour of a thread going up. I don't know if that's happening, or if it has more to do with the Reddit protest that saw mods abdicate their positions last June and new mods being responsible for the change... but it could also be a bit of both.

load more comments (1 replies)
load more comments (6 replies)
[-] garibaldi_biscuit@lemmy.world 113 points 7 months ago

This is what the 3rd party access to API was really all about.

When API access was allowed , all reddit content was effectively free: They needed to ban 3rd party apps so they could sell the accumulated content. I expect using content to train AI also factors into it.

[-] bier@feddit.nl 15 points 7 months ago

Is it? Because when you build a bot and just scrape Reddit I don't think you can just use the content to train AI, just like the New York Times. The API change was definitely to sell more ads and get a higher IPO, but I don't think it was because of AI.

load more comments (2 replies)
[-] tigerjerusalem@lemmy.world 110 points 7 months ago* (last edited 7 months ago)

Reddit is a trove of user built content under the guise of community. What Spez did was to say "thanks for all the free work, suckers!", put a price sticker on it, and laughed all the way to the bank.

~~And this is why I'm not active on any Internet community anymore.~~ Nevermind, I guess I just can't help myself...

[-] Adulated_Aspersion@lemmy.world 23 points 7 months ago

And that is another unintended example of why all of my post history was purged before migration.

load more comments (6 replies)
load more comments (2 replies)
[-] Verserk@lemmy.dbzer0.com 81 points 7 months ago

Considering some of the very wrong and upvoted domain specific knowledge I've seen on Reddit over the years I'm not sure the training data is going to be useful for much beyond what every other model can do.

[-] aStonedSanta@lemm.ee 16 points 7 months ago

lol subreddits with troll names like trees vs marijuana enthusiasts. Good fun. John cena has one also but can’t recall which subreddit is actually about John cena though.

load more comments (1 replies)
load more comments (2 replies)
[-] Voyajer@lemmy.world 62 points 7 months ago

This is why I don't blame anyone for editing/deleting their post history on reddit.

[-] FaceDeer@kbin.social 18 points 7 months ago

I do. It's frankly selfish. Having an AI get training on my old comments costs me nothing and it results in the development of useful AI tools. Trying to sabotage that is petty and pointless. It's not like you could somehow collect the fraction of a pittance that you think you're owed retroactively. I never commented on Reddit thinking "awesome, I'm going to make bank on the content I'm generating here."

People complain about the capitalist mindset of the world and then they do this. Sigh.

[-] TORFdot0@lemmy.world 35 points 7 months ago

I had an 11 year old account that I deleted all my old comments and posts from because of the API debacle. Does that make me selfish that I felt like Reddit wasn’t holding up its end of the unwritten agreement?

Reddit doesn’t deserve my content anymore than I deserve access from the third party API.

load more comments (5 replies)
[-] Zellith@kbin.social 26 points 7 months ago

Selfish? Perhaps you forget why people deleted their content in the first place.

load more comments (1 replies)
[-] Voyajer@lemmy.world 20 points 7 months ago

It's their comment to do with as they see fit. I can't get mad at them for wanting to erase their presence on a site they don't use anymore.

load more comments (1 replies)
load more comments (11 replies)
[-] Strayce@lemmy.sdf.org 58 points 7 months ago

Considering how much of Reddit is already bots, I'm sure this will end fantastically.

load more comments (1 replies)
[-] KairuByte@lemmy.dbzer0.com 56 points 7 months ago
load more comments (2 replies)
[-] gedaliyah@lemmy.world 56 points 7 months ago

The AI:

"IANAL so could you ELI5, so AITA?

THIS."

[-] bigkahuna1986@lemmy.ml 26 points 7 months ago

Ann frankly, I did Nazi that coming.

[-] storcholus@feddit.de 22 points 7 months ago

Holy shit do I hate that comment

load more comments (1 replies)
load more comments (2 replies)
[-] ozoned@lemmy.world 44 points 7 months ago

"Reddit has given access to YOUR conversations and posts to AI companies.". FTFY

These were created by people, for peoole, and I will ALWAYS disagree that this data is Reddit's or any other platforms.

Don't forget your direct messages aren't end to end encrypted on Reddit, so now AI will be trained on your craziest "private" conversations

load more comments (5 replies)
[-] NutWrench@lemmy.world 43 points 7 months ago

Reddit is all bots, porn, ads and political shit posts. Good luck getting any useful training content out of that.

[-] ladicius@lemmy.world 21 points 7 months ago

Maybe that's the point? Training the AI to produce the blabbering bullshit that's preferred in social media?

load more comments (6 replies)
[-] etrotta@kbin.social 41 points 7 months ago

Out of all things to hate Reddit for, giving data to AI isn't something fediverse users can really criticize it for, though making money from it perhaps.
Remember: All data in federated platforms is available for free and likely already being compiled into datasets. Don't be surprised if this post and its comments end up in GPT5 or 6 training data.

[-] treadful@lemmy.zip 17 points 7 months ago

The problem isn't that AI is being trained on the data. The problem is that they locked down all third party data access so they could monetize our content. On a federated platform, everyone gets equal access and can do whatever they want with it.

We sure can criticize them for that.

load more comments (4 replies)
[-] Bobmighty@lemmy.world 32 points 7 months ago

With reddits severe bot problem, it'll be like training on unfiltered sewage. Garbage in, garbage out.

load more comments (1 replies)
[-] SVcross@lemmy.world 28 points 7 months ago

Damn it. I haven't deleted my account due to how many people I've supported and helped, I stopped using it while ago. It seems I'll have to.

[-] HowManyNimons@lemmy.world 18 points 7 months ago

I wouldn't bother. They'll just mark all your stuff DELETED=1 and feed it to their AI anyway.

load more comments (1 replies)
load more comments (3 replies)
[-] Yokozuna@lemmy.world 28 points 7 months ago

Good thing I scrubbed all of my posts and comments that I could. Fuck that site, straight up and down.

[-] ItsAFake@lemmus.org 21 points 7 months ago

You really think they don't have your original comments stored?

[-] EdibleFriend@lemmy.world 28 points 7 months ago

It's literally been proven that they do. A guy here on Lemmy was a very common poster on some tech support subreddit. He used one of those account scrubbers and deleted his account. He went back to look a few weeks later and all his comments were back.

load more comments (2 replies)
load more comments (4 replies)
[-] asymmetric@lemmy.ca 27 points 7 months ago

One of the original Reddit memes was quite prescient:

https://i.imgur.com/Fza1Cut.jpg

[-] DudeImMacGyver@sh.itjust.works 26 points 7 months ago
[-] Fake4000@lemmy.world 40 points 7 months ago

You signed it all away the moment you scrolled down that EULA 😂

[-] admiralteal@kbin.social 36 points 7 months ago* (last edited 7 months ago)

Can't wait for the day a major court declares EULAs universally nonbinding outside of the most common-sense terms. Even though I doubt it will ever happen.

"We can store and display your content and use stuff you publicly post as examples in advertisements for our platform" is pretty common sense.

"We can use the things you post to do complex data analytics to package and sell your identity to advertisers" is fucking sus.

"We can use the things you post to train ANN generative systems to build next-generation technologies to impersonate you and your peers" is simply nuts.

The idea that displaying an EULA with an "agree" button is informed consent is just preposterous. Even lawyers don't read them.

load more comments (1 replies)
load more comments (1 replies)
load more comments (10 replies)
[-] erAck@discuss.tchncs.de 25 points 7 months ago

It will get trained on some comment posts.

Let reddit die. Join Lemmy or /kbin. https://join-lemmy.org/ https://kbin.pub/

load more comments (11 replies)
[-] hansl@lemmy.world 23 points 7 months ago

In before poisoning your comments on Reddit turns into the new protest.

[-] 31337@sh.itjust.works 23 points 7 months ago

I wish there was a license for content like the GPL, that states if you use this content to train generative AI, the model must be open source. Not sure that would legally be enforceable though (due to fair-use).

[-] HowManyNimons@lemmy.world 22 points 7 months ago

Good. Maybe when it cogitates the things I've written it might start offering up some better ideas.

[-] aidan@lemmy.world 18 points 7 months ago

*laughs villainously* This is all going to plan, now there will be some chatbot spewing my insane beliefs

[-] DozensOfDonner@mander.xyz 16 points 7 months ago

Why does it sound like reddit trained AI will only get dumber.

[-] jol@discuss.tchncs.de 15 points 7 months ago

That would explain why GPT is often so confidently incorrect.

[-] BetaDoggo_@lemmy.world 15 points 7 months ago

Who's dumb enough to pay for that? Everyone else is just scraping it for free.

load more comments (3 replies)
[-] giddy@aussie.zone 15 points 7 months ago

Glad I nuked all my posts and comments and deleted my account last year

load more comments (3 replies)
[-] doingthestuff@lemmy.world 14 points 7 months ago

Good thing I had multiple bots overwrite my content before I deleted it all. Not that someone couldn't recover it, I'm not naive. But the AI bots should miss me.

load more comments (5 replies)
load more comments
view more: next ›
this post was submitted on 17 Feb 2024
1052 points (100.0% liked)

Technology

58160 readers
2773 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 1 year ago
MODERATORS