[Question] Do the generators have protections from illegal or harmful content? (lemmy.world)

submitted 7 months ago* (last edited 7 months ago) by Eesti4519@lemmy.world to c/perchance@lemmy.world

6 comments fedilink hide all child comments

Hello. I have asked this question on the subreddit, but was told to ask on here too so the dev can see it. I am not particularly tech savvy. I have recently come across Perchance which I have found useful to to create texts and images.

Even if the NSFW filter is disabled and NSFW material can be generated, do the text-to-text and text-to-image generators still prevent the production of illegal or harmful content?

I don't want to try this for obvious reasons, but I am concerned that such content could inadvertently be generated. The obvious example is child and sexual abuse material, but I am also thinking of glorification of terrorism or genocide, promotion of self-harm, encouragement of violence toward others, etc.

Thank you!

top 6 comments

sorted by: hot top controversial new old

[-] edwardthefma99@lemmy.world 5 points 7 months ago

I think thare is but some words and phrases slip thru i think its best to avoid words that would trigger the ai into giving you harmful content by accident if you prompt responsibly you should be safe

[-] perchance@lemmy.world 4 points 7 months ago* (last edited 7 months ago)

For image gen, I have some pretty complex regex pattern matching that tries to prevent content that would be illegal in most/all juristictions. Text gen is a lot harder because e.g. an essay that discusses an illegal topic is fine, but with any naive pattern-matching (or even "embedding" based approaches) it'll probably be incorrectly flagged. One approach is to use a model that's fine-tuned to refuse to generate on illegal topics, but that tends to be a recipe for annoying over-refusals, and the state-of-the-art Llama/Mixtral/etc. open source models tend to be fine-tuned in the opposite direction (i.e. remove any and all refusals) for that reason.

I am concerned that such content could inadvertently be generated

If this does happen, then that's definitely considered a "bug", but the degree to which I can do anything about it would mostly need to be determined on a case-by-case basis, and worst case we'd just have to wait for smarter ML models with better fine-tuning datasets.

The obvious example is child and sexual abuse material, but I am also thinking of [...]

Outside of easy-to-flag illegal image stuff, the responsibility currently falls on the user to not prompt for things that they don't want. As mentioned on the AI plugin pages, it should be treated like a web search. E.g. if you search "neo nazi forum" on Google, you're 2 clicks away from the largest neo nazi forum on the internet. And, to be clear, this a complicated issue - even if I had a magic wand, I don't think I'd outright prevent people from finding that forum. There's a whole essay to be written here that I'm obviously not going to write, but the summary is "generally speaking, fight the roots, not the symptoms, and generally speaking do it with sunlight, not suppression". It's possible to make things worse if second-order effects aren't considered, and this issue is complicated enough that I think it's naive to be an absolutist in either direction. It's always tempting, though, given how much easier it feels, and how much is often at stake for getting these things wrong - at least on a governmental/societal level.

[-] VioneT@lemmy.world 3 points 7 months ago

@perchance@lemmy.world - pinging dev

[-] Alllo@lemmy.world 2 points 7 months ago* (last edited 7 months ago)

https://perchance.org/celestia-says

for moral purity

[-] eatham@aussie.zone 2 points 7 months ago

I hope so, I haven't used the AI tools much on perchance so I don't know.

[-] basxto@discuss.tchncs.de 1 points 7 months ago

I'd say it does a good job at preventing you to explicitly generate such content, but not to generate by accident. The filters can only work when you ask for it, but not if the AI adds that part on its own.

Further you can't really 100% prevent somebody to utilise it to generate such content. You can't create it with just the AI, but it could be used for a step. That's not really an AI specific problem. With captions you can put otherwise innocent images into a completely different context. Images can be manually edited.

I've played with editing for complex anime scenes. Generating a few characters, remove backgrounds quickly with software and then place them on a separately generated background. It's hard to describe that for the AI and not confuse it, but you could also use that to alter the context massively. It gets ‘worse’ the better photo editing AIs get, because they'd have a hard time to filter as they'd need to first understand the content of an image and it's implications.

Rewriting texts is even easier.

this post was submitted on 20 Mar 2024

5 points (100.0% liked)

Perchance - Create a Random Text Generator

445 readers

8 users here now

⚄︎ Perchance

This is a Lemmy Community for perchance.org, a platform for sharing and creating random text generators.

Feel free to ask for help, share your generators, and start friendly discussions at your leisure :)

This community is mainly for discussions between those who are building generators. For discussions about using generators, especially the popular AI ones, the community-led Casual Perchance forum is likely a more appropriate venue.

See this post for the Complete Guide to Posting Here on the Community!

Rules

1. Please follow the Lemmy.World instance rules.

The full rules are posted here: (https://legal.lemmy.world/)
User Rules: (https://legal.lemmy.world/fair-use/)

2. Be kind and friendly.

Please be kind to others on this community (and also in general), and remember that for many people Perchance is their first experience with coding. We have members for whom English is not their first language, so please be take that into account too :)

3. Be thankful to those who try to help you.

If you ask a question and someone has made a effort to help you out, please remember to be thankful! Even if they don't manage to help you solve your problem - remember that they're spending time out of their day to try to help a stranger :)

4. Only post about stuff related to perchance.

Please only post about perchance related stuff like generators on it, bugs, and the site.

5. Refrain from requesting Prompts for the AI Tools.

We would like to ask to refrain from posting here needing help specifically with prompting/achieving certain results with the AI plugins (text-to-image-plugin and ai-text-plugin) e.g. "What is the good prompt for X?", "How to achieve X with Y generator?"
See Perchance AI FAQ for FAQ about the AI tools.
You can ask for help with prompting at the 'sister' community Casual Perchance, which is for more casual discussions.
We will still be helping/answering questions about the plugins as long as it is related to building generators with them.

6. Search through the Community Before Posting.

Please Search through the Community Posts here (and on Reddit) before posting to see if what you will post has similar post/already been posted.

founded 1 year ago

MODERATORS

eatham@lemmy.world

eatham@aussie.zone

VioneT@lemmy.world

perchance@lemmy.world