Looking for a nonsense generator to sabotage AI training (lemmy.world)

submitted 10 months ago by leadore@lemmy.world to c/fuck_ai@lemmy.world

6 comments fedilink hide all child comments

If AI is going to scrape content I post or emails I send to people who use gmail, etc. I would like to include a few sentences in each item that will fuck with any AI training they get used by.

(I especially want to stick it in any emails that google will have access to because certain people I want to communicate with refuse to use anything but gmail, even for conversations just with me, after I've specifically asked them to. 😠 )

So I've searched and found many online "nonsense generators" but they use AI to generate silly sentences for you. That's not what I want.

What I want is something that generates grammatically incorrect entences, sentences with words that would never follow each other, and whatever kinds of sentences would cause AI training methods to learn wrong and meaningless patterns of language, so that when it generates stuff based on that it will be obvious crap that is useless for any purpose.

I figure someone has created this by now. Does anyone know where to find something I can use for this?

top 6 comments

sorted by: hot top controversial new old

[-] kayzeekayzee 4 points 10 months ago

I've heard markov chains can be a good way to poison LLMs because it trains them to ignore words beyond the most recent

[-] leadore@lemmy.world 2 points 10 months ago

I did find some stuff on that, but I'm not technical enough to know what to do with it. For example, there's this:

https://algorithmic-sabotage.github.io/asrg/posts/sabot-in-the-age-of-ai/

[-] SpikesOtherDog@ani.social 3 points 10 months ago

You could scrape your spam then randomly sort the data.

[-] leadore@lemmy.world 2 points 10 months ago

Nice idea, but Google is good at recognizing spam so I doubt they would use it as training data, and that would most likely result in my emails being categorized as spam so the person I'm writing to wouldn't receive them.

[-] SpikesOtherDog@ani.social 2 points 10 months ago

Not if they are emailing you first.

[-] lurch@sh.itjust.works 2 points 10 months ago

there are lorem ipsum generators, but if you want real words, i would suggest using spell checker dictionaries filtered by words longer than 2 or 3 characters.

this post was submitted on 14 Feb 2025

28 points (100.0% liked)

Fuck AI

5102 readers

2188 users here now

"We did it, Patrick! We made a technological breakthrough!"

A place for all those who loathe AI to discuss things, post articles, and ridicule the AI hype. Proud supporter of working people. And proud booer of SXSW 2024.

AI, in this case, refers to LLMs, GPT technology, and anything listed as "AI" meant to increase market valuations.

founded 2 years ago

MODERATORS

VerbFlow@lemmy.world

MrMcGasion@lemmy.world

TootSweet@lemmy.world

BigMikeInAustin@lemmy.world

cynar@lemmy.world

drmeanfeel@lemmy.world

pavnilschanda@lemmy.world

CriticalMedicine@lemmy.world

WonderfulWanderer@lemmy.world