922
submitted 5 days ago* (last edited 5 days ago) by dwazou@lemm.ee to c/technology@lemmy.world
you are viewing a single comment's thread
view the rest of the comments
[-] FaceDeer@fedia.io 2 points 5 days ago

A lot of the griping about AI training involves data that's been freely published. Stable Diffusion, for example, trained on public images available on the internet for anyone to view, but led to all manner of ill-informed public outrage. LLMs train on public forums and news sites. But people have this notion that copyright gives them some kind of absolute control over the stuff they "own" and they suddenly see a way to demand a pound of flesh for what they previously posted in public. It's just not so.

I have the right to analyze what I see. I strongly oppose any move to restrict that right.

[-] kittenzrulz123 12 points 5 days ago

Publically available =/= freely published

Many images are made and published with anti AI licenses or are otherwise licensed in a way that requires attribution for derivative works.

[-] FaceDeer@fedia.io 2 points 5 days ago

The problem with those things is that the viewer doesn't need that license in order to analyze them. They can just refuse the license. Licenses don't automatically apply, you have to accept them. And since they're contracts they need to offer consideration, not just place restrictions.

An AI model is not a derivative work, it doesn't include any identifiable pieces of the training data.

it doesn't include any identifiable pieces of the training data.

It does. For example, Harry Potter books can be easily identified.

It's also pretty clear they used a lot of books and other material they didn't pay for, and obtained via illegal downloads. The practice of which I'm fine with, I just want it legalised for everyone.

[-] ferrule@sh.itjust.works 3 points 4 days ago

I'm wondering when i go to the library and read a book, does this mean i can never become an author as I'm tainted? Or am I only tainted if I stole the book?

To me this is only a theft case.

That's the whole problem with AI and artists complaining about theft. You can't draw a meaningful distinction between what people do and what the ai is doing.

[-] ferrule@sh.itjust.works 2 points 4 days ago

i think that is a very important observation. people want to gloss over that when it might be the most important thing to talk about.

[-] ILoveUnions@lemmy.world 5 points 5 days ago

And what of the massive amount of content paywalled that ai still used to train?

[-] FaceDeer@fedia.io 2 points 5 days ago

If it's paywalled how did they access it?

[-] ILoveUnions@lemmy.world 4 points 5 days ago

You are dull. Very dull. There is no shortage of ways to pirate content on the internet, including torrents. And they wasted no time doing so

this post was submitted on 10 May 2025
922 points (100.0% liked)

Technology

70031 readers
3646 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related news or articles.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


founded 2 years ago
MODERATORS