514

We have to stop ignoring AI’s hallucination problem (www.theverge.com)

submitted 1 year ago by misk@sopuli.xyz to c/technology@lemmy.world

185 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[-] ALostInquirer@lemm.ee 39 points 1 year ago

Why do tech journalists keep using the businesses' language about AI, such as "hallucination", instead of glitching/bugging/breaking?

[-] superminerJG@lemmy.world 41 points 1 year ago

hallucination refers to a specific bug (AI confidently BSing) rather than all bugs as a whole

[-] Blackmist@feddit.uk 16 points 1 year ago

Honestly, it's the most human you'll ever see it act.

It's got upper management written all over it.

[-] ALostInquirer@lemm.ee 5 points 1 year ago* (last edited 1 year ago)

(AI confidently BSing)

Isn't it more accurate to say it's outputting incorrect information from a poorly processed prompt/query?

[-] vithigar@lemmy.ca 30 points 1 year ago

No, because it's not poorly processing anything. It's not even really a bug. It's doing exactly what it's supposed to do, spit out words in the "shape" of an appropriate response to whatever was just said

[-] ALostInquirer@lemm.ee 3 points 1 year ago* (last edited 1 year ago)

When I wrote "processing", I meant it in the sense of getting to that "shape" of an appropriate response you describe. If I'd meant this in a conscious sense I would have written, "poorly understood prompt/query", for what it's worth, but I see where you were coming from.

[-] Danksy@lemmy.world 36 points 1 year ago

It's not a bug, it's a natural consequence of the methodology. A language model won't always be correct when it doesn't know what it is saying.

[-] vrighter@discuss.tchncs.de 13 points 1 year ago

it never knows what it's saying

[-] TheDarksteel94@sopuli.xyz 2 points 1 year ago

Oh, at some point it will lol

[-] Danksy@lemmy.world 2 points 1 year ago

That was what I was trying to say, I can see that the wording is ambiguous.

[-] ALostInquirer@lemm.ee 9 points 1 year ago

Yeah, on further thought and as I mention in other replies, my thoughts on this are shifting toward the real bug of this being how it's marketed in many cases (as a digital assistant/research aid) and in turn used, or attempted to be used (as it's marketed).

[-] Danksy@lemmy.world 8 points 1 year ago

I agree, it's a massive issue. It's a very complex topic that most people have no way of understanding. It is superb at generating text, and that makes it look smarter than it actually is, which is really dangerous. I think the creators of these models have a responsibility to communicate what these models can and can't do, but unfortunately that is not profitable.

[-] machinin@lemmy.world 20 points 1 year ago

https://en.m.wikipedia.org/wiki/Hallucination_(artificial_intelligence)

The term "hallucinations" originally came from computer researchers working with image producing AI systems. I think you might be hallucinating yourself 😉

[-] ALostInquirer@lemm.ee 1 points 1 year ago

Fun part is, that article cites a paper mentioning misgivings with the terminology: AI Hallucinations: A Misnomer Worth Clarifying. So at the very least I'm not alone on this.

[-] xthexder@l.sw0.com 20 points 1 year ago

Because hallucinations pretty much exactly describes what's happening? All of your suggested terms are less descriptive of what the issue is.

The definition of hallucination:

A hallucination is a perception in the absence of an external stimulus.

In the case of generative AI, it's generating output that doesn't match it's training data "stimulus". Or in other words, false statements, or "facts" that don't exist in reality.

[-] ALostInquirer@lemm.ee 5 points 1 year ago

perception

This is the problem I take with this, there's no perception in this software. It's faulty, misapplied software when one tries to employ it for generating reliable, factual summaries and responses.

[-] xthexder@l.sw0.com 1 points 1 year ago* (last edited 1 year ago)

I have adopted the philosophy that human brains might not be as special as we've thought, and that the untrained behavior emerging from LLMs and image generators is so similar to human behaviors that I can't help but think of it as an underdeveloped and handicapped mind.

I hypothesis that a human brain, who's only perception of the world is the training data force fed to it by a computer, would have all the same problems the LLMs do right now.

To put it another way... The line that determines what is sentient and not is getting blurrier and blurrier. LLMs have surpassed the Turing test a few years ago. We're simulating the level of intelligence of a small animal today.

[-] blazeknave@lemmy.world 8 points 1 year ago

Ty. As soon as I saw the headline, I knew I wouldn't be finding value in the article.

[-] ALostInquirer@lemm.ee 7 points 1 year ago

It's not a bad article, honestly, I'm just tired of journalists and academics echoing the language of businesses and their marketing. "Hallucinations" aren't accurate for this form of AI. These are sophisticated generative text tools, and in my opinion lack any qualities that justify all this fluff terminology personifying them.

Also frankly, I think students have one of the better applications for large-language model AIs than many adults, even those trying to deploy them. Students are using them to do their homework, to generate their papers, exactly one of the basic points of them. Too many adults are acting like these tools should be used in their present form as research aids, but the entire generative basis of them undermines their reliability for this. It's trying to use the wrong tool for the job.

You don't want any of the generative capacities of a large-language model AI for research help, you'd instead want whatever text-processing it may be able to do to assemble and provide accurate output.

this post was submitted on 17 May 2024

514 points (100.0% liked)

Technology

75597 readers

1782 users here now

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related news or articles.
Be excellent to each other!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
Check for duplicates before posting, duplicates may be removed
Accounts 7 days and younger will have their posts automatically removed.

Approved Bots

founded 2 years ago

MODERATORS

L3s@lemmy.world

enu@lemmy.world

technopagan@lemmy.world

L4s@lemmy.world

L3s@hackingne.ws

L4s@hackingne.ws