1313
Google search is over
(mastodon.social)
This is a most excellent place for technology news and articles.
AI is its own worst enemy. If you can't identify AI output, that means AIs are going to train on AI generated content, which really hurts the model.
Its literally in everyone's best interest, including AI itself, to start leaving identification of some kind inherent to all output.
Those studies are flawed. by definition when you can no longer tell the difference the difference on training is nil.
It's more like successive generations of inbreeding. Unless you have perfect AI content, perfect meaning exactly mirroring the diversity of human content, the drivel will amplify over time.
Given chinchilla law, nobody in their right mind trains models via shotgun ingesting all data anymore. Gains are made with quality of data at this point, less than volume.