Google scrambles to manually remove weird AI answers in search : technology

[-] FlihpFlorp@lemm.ee 213 points 2 years ago

I remember seeing a comment on here that said something along the lines of “for every dangerous or wrong response that goes public there’s probably 5, 10 or even 100 of those responses that only one person saw and may have treated as fact”

[-] lvxferre@mander.xyz 49 points 2 years ago

Fuck. I'm stealing this comment - it's brilliant.

[-] FlihpFlorp@lemm.ee 55 points 2 years ago

It’s why I stole it

[-] recursive_recursion@programming.dev 61 points 2 years ago

"The Internet." by Anthony Clark

[-] FlihpFlorp@lemm.ee 46 points 2 years ago

What a nice original comic you made

[-] davidgro@lemmy.world 28 points 2 years ago

... I've never seen that attributed before. Wow.

load more comments (2 replies)

load more comments (1 replies)

[-] Nobody@lemmy.world 135 points 2 years ago

Tech company creates best search engine —-> world domination —> becomes VC company in tech trench coat —-> destroy search engine to prop up bad investments in ~~artificial intelligence~~ advanced chatbots

[-] stellargmite@lemmy.world 60 points 2 years ago

Then Hire cheap human intelligence to correct the AIs hallucinatory trash, trained from actual human generated content in the first place which the original intended audience did understand the nuanced context and meaning of in the first place. Wow more like theyve shovelled a bucket of horse manure on the pizza as well as the glue. Added value to the advertisers. AI my arse. I think calling these things language models is being generous. More like energy and data hungry vomitrons.

[-] WhatAmLemmy@lemmy.world 21 points 2 years ago* (last edited 2 years ago)

Calling these things Artificial Intelligence should be a crime. It's false advertising! Intelligence requires critical thought. They possess zero critical thought. They're stochastic parrots, whose only skill is mimicking human language, and they can only mimic convincingly when fed billions of examples.

load more comments (3 replies)

[-] cm0002@lemmy.world 17 points 2 years ago

You either die a hero or live long enough to become the villain

load more comments (1 replies)

[-] iAvicenna@lemmy.world 134 points 2 years ago

"Many of the examples we’ve seen have been uncommon queries,"

Ah the good old "the problem is with the user not with our code" argument. The sign of a truly successful software maker.

[-] voluble@lemmy.world 51 points 2 years ago

"We don't understand. Why aren't people simply searching for Taylor Swift"

load more comments (1 replies)

[-] calabast@lemm.ee 13 points 2 years ago

I mean...I guess you could parahrase it that way. I took it more as "Look, you probably aren't going to run into any weird answers.". Which seems like a valid thing for them to try to convey.

(That being said, fuck AI, fuck Google, fuck reddit.)

[-] radicalautonomy@lemmy.world 30 points 2 years ago* (last edited 2 years ago)

"I'm feeling depressed" is not an uncommon query under capitalism run amok. "One Reddit user recommends jumping off the Golden Gate Bridge" is not just a weird answer, it is a wholly irresponsible one.

So, no, their response is not valid. It is entirely user-blaming in order to avoid culpability.

load more comments (3 replies)

load more comments (1 replies)

[-] lvxferre@mander.xyz 82 points 2 years ago* (last edited 2 years ago)

The reason why Google is doing this is simply PR. It is not to improve its service.

The underlying tech is likely Gemini, a large language model (LLM). LLMs handle chunks of words, not what those words convey; so they have no way to tell accurate info apart from inaccurate info, jokes, "technical truths" etc. As a result their output is often garbage.

You might manually prevent the LLM from outputting a certain piece of garbage, perhaps a thousand. But in the big picture it won't matter, because it's outputting a million different pieces of garbage, it's like trying to empty the ocean with a small bucket.

I'm not making the above up, look at the article - it's basically what Gary Marcus is saying, under different words.

And I'm almost certain that the decision makers at Google know this. However they want to compete with other tendrils of the GAFAM cancer for a turf called "generative models" (that includes tech like LLMs). And if their search gets wrecked in the process, who cares? That turf is safe anyway, as long as you can keep it up with enough PR.

Google continues to say that its AI Overview product largely outputs “high quality information” to users.

There's a three letters word that accurately describes what Google said here: lie.

[-] lemmyvore@feddit.nl 21 points 2 years ago

At some point no amount of PR will hide the fact search has become useless. They know this but they're getting desperate and will try anything.

I'm waiting for Yahoo to revive their link directory or for Mozilla to revive DMOZ. That will be the sign that shit level is officially chin-height.

[-] TheDude@kbin.social 76 points 2 years ago

Bummer. I like weird Al.

load more comments (3 replies)

[-] ano_ba_to@sopuli.xyz 72 points 2 years ago

Now, instead of debugging the code, you have to debug the data. Sounds worse.

load more comments (1 replies)

[-] SlopppyEngineer@lemmy.world 71 points 2 years ago

Correcting over a decade of Reddit shitposting in what, a few weeks? They're pretty ambitious.

[-] atrielienz@lemmy.world 26 points 2 years ago

This is perhaps the most ironic thing about the whole reddit data scraping thing and Spez selling out the user data of reddit to LLM'S. Like. We spent so much time posting nonsense. And then a bunch of people became mods to course correct subreddits where that nonsense could be potentially fatal. And then they got rid of those mods because they protested. And now it's bots on bots on bots posting nonsense. And they want their LLM'S trained on that nonsense because reasons.

load more comments (1 replies)

[-] Juice@midwest.social 64 points 2 years ago

Good, remove all the weird reddit answers, leaving only the "14 year old neo-nazi" reddit answers, "cop pretending to be a leftist" reddit answers, and "39 year old pedophile" reddit answers. This should fix the problem and restore google back to its defaults

load more comments (1 replies)

[-] maxenmajs@lemmy.world 58 points 2 years ago

Isn't the model fundamentally flawed if it can't appropriately present arbitrary results? It is operating at a scale where human workers cannot catch every concerning result before users see them.

The ethical thing to do would be to discontinue this failed experiment. The way it presents results is demonstrably unsafe. It will continue to present satire and shitposts as suggested actions.

load more comments (1 replies)

[-] originalucifer@moist.catsweat.com 56 points 2 years ago* (last edited 2 years ago)

kinda reads like 'Weird Al' answers.. like, yankovic seems like a nice guy and i like his music, but how many answers could he have?

load more comments (5 replies)

[+] xorollo@leminal.space 51 points 2 years ago

[deleted]

[-] Wappen@lemmy.world 42 points 2 years ago

"Select the URL that answers the question most appropriately"

load more comments (2 replies)

[-] TheReturnOfPEB@reddthat.com 43 points 2 years ago

I was at first wondering what google had done to piss off Weird Al. He seems so chill.

[-] TachyonTele@lemm.ee 13 points 2 years ago

First Madonna kills Weird Al, and now Google.
WILL THE NIGHTMARE EVER END

load more comments (3 replies)

load more comments (2 replies)

[-] flop_leash_973@lemmy.world 42 points 2 years ago

If you have to constantly manually intervene in what your automated solutions are doing, then it is probably not doing a very good job and it might be a good idea to go back to the drawing board.

load more comments (1 replies)

[+] SavedKriss@lemmy.world 38 points 2 years ago

[deleted]

[-] Fridgeratr@lemmy.dbzer0.com 18 points 2 years ago

Yes. But that's not ✨AI✨

load more comments (2 replies)

[-] jeffw@lemmy.world 15 points 2 years ago* (last edited 2 years ago)

That cant answer most questions though. For example, I hung a door recently and had some questions that it answered (mostly) accurately. An encyclopedia can’t tell me how to hang a door

load more comments (6 replies)

load more comments (1 replies)

[-] JdW@lemmy.world 36 points 2 years ago

If only there was a way to show the whole world in one simple example how Enshitification works.

Google execs: Hold my beer!

[-] wildcardology@lemmy.world 35 points 2 years ago

Here's an idea google, why not set it back like it was 10-15 years ago

[-] ilinamorato@lemmy.world 12 points 2 years ago

The problem is, the internet has adapted to the Google of a year ago, which means that setting Google search back to 2009 just means that every "SEO hacker" gets to have a field day to get spam to the top of results without any controls to prevent them.

Google built a search engine optimized for the early internet. Bad actors adapted, to siphon money out of Google traffic. Google adapted to stop them. Bad actors adapted. So began a cat-and-mouse game which ended with the pre-AI Google search we all know and hate today. Through their success, Google has destroyed the internet that was; and all that's left is whatever this is. No matter what happens next, Google search is toast.

load more comments (1 replies)

[-] Luisp@lemmy.dbzer0.com 32 points 2 years ago

They are going back a century, to the manual teleoperator. When all started, it's the circle of tech

[-] trollbearpig@lemmy.world 31 points 2 years ago* (last edited 2 years ago)

I looove how the people at Google are so dumb that they forgot that anything resembling real intelligence in ChatGPT is just cheap labor in Africa (Kenya if I remember correctly) picking good training data. So OpenAI, using an army of smart humans and lots of data built a computer program that sometimes looks smart hahaha.

But the dumbasses in Google really drank the cool aid hahaha. They really believed that LLMs are magically smart so they feed it reddit garbage unfiltered hahahaha. Just from a PR perspective it must be a nigthmare for them, I really can't understand what they were thinking here hahaha, is so pathetically dumb. Just goes to show that money can't buy intelligence I guess.

load more comments (2 replies)

[-] Deebster@programming.dev 25 points 2 years ago

[...] a lot of AI companies are “selling dreams” that this tech will go from 80 percent correct to 100 percent.

In fact, Marcus thinks that last 20 percent might be the hardest thing of all.

Yeah, it's well known, e.g. people say "the last 20% takes 80% of the effort". All the most tedious and difficult stuff gets postponed to the end, which is why so many side projects never get completed.

[-] scrion@lemmy.world 14 points 2 years ago

It's not just the difficult stuff, but often the mundane, e. g. stability, user friendliness, polish, scalability etc. that takes something from working in a constrained environment to an actual product - it's a chore to work on and a lot less "sexy", with never enough resources allocated to it: We have done all the difficult stuff already, how much more work can this be?

Turns out, a fucking lot.

load more comments (1 replies)

[-] gedaliyah@lemmy.world 20 points 2 years ago

At this point, it seems like google is just a platform to message a google employee to go google it for you.

[-] acetanilide@lemmy.world 11 points 2 years ago

Is that employee named Jeeves?

load more comments (1 replies)

[-] uebquauntbez@lemmy.world 16 points 2 years ago

'I'm sorry, Google, I'm afraid, I cant do that!'

[-] werefreeatlast@lemmy.world 14 points 2 years ago

Okay Google... I'm about to go to sleep but I must know something before I go.... If I could get the perfect penis to attract my perfect female counterpart, describe my penis, where my wife put it and how many pieces did she cut it to. Most importantly, will the scars make ribbed for her pleasure?

[-] jeffw@lemmy.world 16 points 2 years ago

0_0

Go to sleep

load more comments (1 replies)

[-] Bobmighty@lemmy.world 11 points 2 years ago

I once had a Christmas day post blow up and become top of the day from a stupid pic I uploaded. I wonder if some of those comments or a weird version of that pic will pop up. Anyone that had similar things happen should keep their eye out. Anything that blew up probably gets a bit more weight.

Oh God, cumbox! All of cumbox is in there. I wonder what kind of unrelated search could summon up that bit of fuzzy fun?

[-] Aceticon@lemmy.world 11 points 2 years ago

Probably one of the shitstains in Google's C-suite after having signed a "wonderful" contract to get access to "all that great data from Reddit" forced the Techies to use it against their better judgement and advice.

It would certainly match the kind of thing I've seen more than once were some MBA makes a costly decision with technical implications without consulting the actual techies first, then the thing turns out to be a massive mistake and to save themselves they just double up and force the techies to use it anyway.

That said, that's normally about some kind of tooling or framework from a 3rd party supplier that just makes life miserable for those forced to use it or simply doesn't solve the problem and techies have to quietly use what they wanted to use all along and then make believe they're using the useless "sollution" that cost lots of $$$ in yearly licensing fees, and stuff like this that ends up directly and painfully torpedoing at the customer-facing end the strategical direction the company is betting on for the next decade, is pretty unusual.

load more comments (1 replies)

[-] randon31415@lemmy.world 10 points 2 years ago

Is Google not using a RAG (Retrieval Augmented Generation) system with their llms?

[-] powerofm@lemmy.ca 22 points 2 years ago

They are, but they're retrieving Reddit comments, often just straight shitposts

[-] dexa_scantron@lemmy.world 12 points 2 years ago

And Onion articles, and other satire.

load more comments (1 replies)

Technology

Our Rules

Approved Bots