1561

OpenAI Pleads That It Can’t Make Money Without Using Copyrighted Materials for Free (futurism.com)

submitted 1 year ago by flop_leash_973@lemmy.world to c/technology@lemmy.world

380 comments fedilink hide all child comments

(page 3) 50 comments

sorted by: hot top controversial new old

[-] PixeIOrange@lemmy.world 16 points 1 year ago* (last edited 1 year ago)

"because it's supposedly "impossible" for the company to train its artificial intelligence models — and continue growing its multi-billion-dollar-business — without them."

O no! Poor richs cant get more rich fast enough :(

[-] datendefekt@lemmy.ml 15 points 1 year ago

So I got a crazy idea - hear me out - how about we just abolish copyright completely, for everyone?

I mean, it works in China pretty well.

[-] WhatThaFudge@lemmy.sdf.org 13 points 1 year ago

https://en.wikipedia.org/wiki/Intellectual_property_in_China

Looks like there are still copyright laws in China. What are you on about?

load more comments (3 replies)

load more comments (1 replies)

[-] HipsterTenZero@dormi.zone 15 points 1 year ago

It's impossible for me to make money without robbing a bank, please let me do that parliament it would be so funny

[-] genuineparts@infosec.pub 15 points 1 year ago

Oh no. Anyway...

[-] bizza@lemmy.zip 15 points 1 year ago

Copyright is a pain in the ass, but Sam Altman is a bigger pain in the ass. Send him to prison and let him rot. Then put his tears in a cup and I'll drink them

load more comments (1 replies)

[-] mm_maybe@sh.itjust.works 13 points 1 year ago

What irks me most about this claim from OpenAI and others in the AI industry is that it's not based on any real evidence. Nobody has tested the counterfactual approach he claims wouldn't work, yet the experiments that came closest--the first StarCoder LLM and the CommonCanvas text-to-image model--suggest that, in fact, it would have been possible to produce something very nearly as useful, and in some ways better, with a more restrained training data curation approach than scraping outbound Reddit links.

All that aside, copyright clearly isn't the right framework for understanding why what OpenAI does bothers people so much. It's really about "data dignity", which is a relatively new moral principle not yet protected by any single law. Most people feel that they should have control over what data is gathered about their activities online, as well as what is done with those data after it's been collected, and even if they publish or post something under a Creative Commons license that permits derived uses of their work, they'll still get upset if it's used as an input to machine learning. This is true even if the generative models thereby created are not created for commercial reasons, but only for personal or educational purposes that clearly constitute fair use. I'm not saying that OpenAI's use of copyrighted work is fair, I'm just saying that even in cases where the use is clearly fair, there's still a perceived moral injury, so I don't think it's wise to lean too heavily on copyright law if we want to find a path forward that feels just.

load more comments (9 replies)

[-] OsrsNeedsF2P@lemmy.ml 13 points 1 year ago

Y'all have the wrong take. Fuck copyright.

load more comments (1 replies)

[-] forgotmylastusername@lemmy.ml 12 points 1 year ago* (last edited 1 year ago)

The internet has been primarily derivative content for a long time. As much as some haven't wanted to admit it. It's true. These fancy algorithms now take it to the exponential factor.

Original content had already become sparsely seen anymore as monetization ramped up. And then this generation of AI algorithms arrived.

The several years before prior to LLMs becoming a thing, the internet was basically just regurgitating data from API calls or scraping someone else's content and representing it in your own way.

load more comments (3 replies)

[-] Bishma@discuss.tchncs.de 11 points 1 year ago

We can't make money paying for "AI", going to theaters, or paying for streaming services.

So I guess everybody gets a piracy!

[-] ulkesh@lemmy.world 11 points 1 year ago

Aww poor shit company and their poor money problems.

[-] recursive_recursion@programming.dev 11 points 1 year ago

well fuck you Sam Altman

[-] meliodas_101@lemmy.world 11 points 1 year ago

What kind of a pathetic statement is that ?

[-] LarmyOfLone@lemm.ee 11 points 1 year ago* (last edited 1 year ago)

I feel we need a term for "copyright bros".

The more important point is that social media companies can claim to OWN all the content needed to train AI. Same for image sites. That means they get to own the AI models. That means the models will never be free. Which means they control the "means of generation". That means that forever and ever and ever most human labour will be worth nothing while we can't even legally use this power. Double fucked.

YOU the user/product will not gain anything with this copyright strongmanning.

And to the argument itself: Just because AI is better at learning from existing works, faster, more complete, better memory, doesn't meant that it's fundamentally different than humans learning from artwork. Almost EVERY artist arguing for this is stealing themselves since they learned and was inspired by existing works.

But I guess the worst possible outcome is inevitable now.

load more comments (2 replies)

[-] ricecake@sh.itjust.works 10 points 1 year ago

As written the headline is pretty bad, but it seems their argument is that they should be able to train from publicly available copywritten information, like blog posts and social media, and not from private copywritten information like movies or books.

You can certainly argue that "downloading public copywritten information for the purposes of model training" should be treated differently from "downloading public copywritten information for the intended use of the copyright holder", but it feels disingenuous to put this comment itself, to which someone has a copyright, into the same category as something not shared publicly like a paid article or a book.

Personally, I think it's a lot like search engines. If you make something public someone can analyze it, link to it, or derivative actions, but they can't copy it and share the copy with others.

load more comments (1 replies)

[-] RangerJosie@lemmy.world 10 points 1 year ago

Then go out of business.

Literally, "fuck you go die" situation.

load more comments

this post was submitted on 03 Sep 2024

1561 points (100.0% liked)

Technology

75758 readers

2337 users here now

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related news or articles.
Be excellent to each other!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
Check for duplicates before posting, duplicates may be removed
Accounts 7 days and younger will have their posts automatically removed.

Approved Bots

founded 2 years ago

MODERATORS

L3s@lemmy.world

enu@lemmy.world

technopagan@lemmy.world

L4s@lemmy.world

L3s@hackingne.ws

L4s@hackingne.ws