1037

submitted 2 years ago by Gork@lemm.ee to c/196

87 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[-] Daxtron2@startrek.website 6 points 2 years ago

Wow another repost of incorrectly prompting an LLM to produce garbage output. What great content!

[-] Umbrias@beehaw.org 10 points 2 years ago

This is genuinely great content for demonstrating that ai search engines and chat bots are not in a place where you can trust them implicitly, though many do

[-] Daxtron2@startrek.website 1 points 2 years ago

Which is exactly why every LLM explicitly says this before you start.

[-] Umbrias@beehaw.org 12 points 2 years ago

"Why, we aren't at fault people are using the tool we are selling for the thing we marketed it for, we put a disclaimer!"

[-] Daxtron2@startrek.website 1 points 2 years ago

You've seen marketing for the big LLMs that's marketing them as search engines?

[-] Umbrias@beehaw.org 3 points 2 years ago

https://www.microsoft.com/en-us/edge/features/the-new-bing?form=MA13FJ

It would have been easier for you to just search it up yourself, you know.

[-] Daxtron2@startrek.website 1 points 2 years ago

Bing w/ LLM summarization of results is not an LLM being used as a search engine

[-] Umbrias@beehaw.org 3 points 2 years ago

Lol.

[-] ProgrammingSocks@pawb.social 1 points 1 year ago

Bing literally has a copilot frame pop up every time you search with it that tries to answer your question

[-] Daxtron2@startrek.website 1 points 1 year ago

Again, LLM summarization of search results is not using an LLM as a search engine

[-] fidodo@lemmy.world 8 points 2 years ago

They didn't ask it to produce incorrect output, the prompts are not leading it to an incorrect answer. It does highlight an important limitation of LLMs which is that it doesn't think, it just produces words off of probability.

However it's wrong to think that just because it's limited that it's useless. It's important to understand the flaws so we can make them less common through how we use the tool.

For example, you can ask it to think everything through step by step. By producing a more detailed context window for itself it can reduce mistakes. In this case it could write out the letters with the count numbered and that would give it enough context to properly answer the question since it would have the numbers and letters together giving it more context. You could even tell it to write programs to assist itself and have it generate a letter counting program to count it accurately and produce the correct answer.

People can point out flaws in the technology all they want but smarter people are going to see the potential and figure out how to work around the flaws.

[-] Daxtron2@startrek.website 2 points 2 years ago

Yeah which is why I get so aggravated when someone says that prompt engineering is pointless or not a real skill. It's a rapidly evolving discipline with lots of active research.

[-] petrol_sniff_king 1 points 2 years ago

If all of your time is spent correcting the answers you know you want it to give you, what use it to you exactly?

Like, I'll take your word for it: you can trick it into giving more correct answers.

You would only do this if you already know the correct answers.

I mean, you can use it for rubber-ducking, I suppose. I don't know if that's revolutionary, but I guess it's not useless.

[-] fidodo@lemmy.world 1 points 2 years ago

I pointed out general strategies to make it more accurate without supervision. Getting LLMs to be reliable enough to use without supervision will be a matter of adding multiple layers of safe guards.

this post was submitted on 10 Feb 2024

1037 points (100.0% liked)

196

18160 readers

298 users here now

Be sure to follow the rule before you head out.

Rule: You must post before you leave.

Other rules

Behavior rules:

No bigotry (transphobia, racism, etc…)
No genocide denial
No support for authoritarian behaviour (incl. Tankies)
No namecalling
Accounts from lemmygrad.ml, threads.net, or hexbear.net are held to higher standards
Other things seen as cleary bad

Posting rules:

No AI generated content (DALL-E etc…)
No advertisements
No gore / violence
Mutual aid posts are not allowed

NSFW: NSFW content is permitted but it must be tagged and have content warnings. Anything that doesn't adhere to this will be removed. Content warnings should be added like: [penis], [explicit description of sex]. Non-sexualized breasts of any gender are not considered inappropriate and therefore do not need to be blurred/tagged.

If you have any questions, feel free to contact us on our matrix channel or email.

Other 196's:

founded 2 years ago

MODERATORS