It's surprisingly difficult for AI to create just a plain white image (www.bleepingcomputer.com)

submitted 5 months ago by BrikoX@lemmy.zip to c/technology@lemmy.zip

15 comments fedilink hide all child comments

Generative AI services like Midjourney and OpenAI's DALL-E can deliver the unimaginable when it comes to stunning artifacts produced from simple text prompts. Sketching complex art imagery may be AI's specialty, yet some of the simplest tasks are evidently what AI struggles with the most.

top 15 comments

sorted by: hot top controversial new old

[-] kakes@sh.itjust.works 40 points 5 months ago

It's actually surprisingly difficult to ask for an absence of anything, since the training data doesn't normally include what isn't in the image.

I think they call it the Giraffe Problem or something like that. If you ask an AI for "an image containing no giraffes," you'll end up with a bunch of giraffes. It's all about how the training data is tagged.

[-] groet@feddit.de 26 points 5 months ago

That works for humans too:

"DON'T THINK ABOUT GIRAFFES!"

[-] RedditWanderer@lemmy.world 7 points 5 months ago

Or in sports when you're like "dont throw/kick/shoot the ball at the exact thing you're focusing on" before doing exactly that

[-] Steve@startrek.website 5 points 5 months ago

Or dont drunk drive into the back of a fire truck

[-] usualsuspect191@lemmy.ca 4 points 5 months ago

Gets me every time

[-] msage@programming.dev 3 points 5 months ago

Easy solution: pick any other animal and think about them.

Don't think about pink elephants? Sure, here comes fluffy bunnies.

[-] SchmidtGenetics@lemmy.world 4 points 5 months ago

That’s why they have negative and positive prompts.

[-] kakes@sh.itjust.works 2 points 5 months ago

Definitely worth noting, yeah.

[-] SchmidtGenetics@lemmy.world 4 points 5 months ago

I seriously think a lot of this is just people not knowing how it works. It’s a knew tech, people need to figure out the quirks and nail down techniques.

Hell getting people to google basic things can be a lesson in futility sometimes, and they want to talk this down? Come on.

[-] kakes@sh.itjust.works 2 points 5 months ago

Absolutely. Hearing the way people interact with LLMs as if they are an all-knowing AGI is honestly a bit terrifying at times. Hopefully the longer these systems are in the mainstream, the better people get a sense of the boundaries.

[-] Anticorp@lemmy.world 1 points 5 months ago

That's where most of the hate towards AI comes from on this platform. People here are always talking about how stupid it is, how it's not real AI, and how it's just a fancy autocorrect. I'm convinced that's all user error. Anyone who has used AI correctly, and understands how to prompt it, has seen how remarkably powerful and useful it can be.

[-] ryven@lemmy.dbzer0.com 8 points 5 months ago* (last edited 5 months ago)

I don't know much about image generators, but it's not surprising to me at all that ChatGPT can't fulfill the request to respond with nothing. Your prompt gets converted into tokens, tokens get processed by the model, it outputs a resulting set of tokens, and those get converted into text. Expecting the token-outputting-machine not to output any tokens is going to lead to disappointment.

[-] vithigar@lemmy.ca 4 points 5 months ago

This isn't asking for nothing though. It's asking for a very specific uniform thing. A better analogy to text generation would be asking for an uninterrupted string of nothing but repeated uppercase A's.

[-] agressivelyPassive@feddit.de 4 points 5 months ago

That's a really good way to break chatgpt BTW. Just ask it to repeat something over and over again it will create very interesting results.

[-] ryven@lemmy.dbzer0.com 2 points 5 months ago

The "asking for nothing" experiment is from later in the article, where they try with ChatGPT.

this post was submitted on 31 Mar 2024

60 points (100.0% liked)

Technology

1191 readers

116 users here now

Which posts fit here?

Anything that is at least tangentially connected to the technology, social media platforms, informational technologies and tech policy.

Rules

1. English only

Title and associated content has to be in English.

2. Use original link

Post URL should be the original link to the article (even if paywalled) and archived copies left in the body. It allows avoiding duplicate posts when cross-posting.

3. Respectful communication

All communication has to be respectful of differing opinions, viewpoints, and experiences.

4. Inclusivity

Everyone is welcome here regardless of age, body size, visible or invisible disability, ethnicity, sex characteristics, gender identity and expression, education, socio-economic status, nationality, personal appearance, race, caste, color, religion, or sexual identity and orientation.

5. Ad hominem attacks

Any kind of personal attacks are expressly forbidden. If you can't argue your position without attacking a person's character, you already lost the argument.

6. Off-topic tangents

Stay on topic. Keep it relevant.

7. Instance rules may apply

If something is not covered by community rules, but are against lemmy.zip instance rules, they will be enforced.

Companion communities

!globalnews@lemmy.zip
!interestingshare@lemmy.zip

Icon attribution | Banner attribution

founded 10 months ago

MODERATORS

BrikoX@lemmy.zip