222

LLMs still don't understand the word "no", much like their creators (www.quantamagazine.org)

submitted 2 years ago by dgerard@awful.systems to c/techtakes@awful.systems

24 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[-] froztbyte@awful.systems 38 points 2 years ago

it's almost like this thing has no internal conceptual representation! I know this can't possibly be, millions of promptfans and prompfondlers have told me it can't be so, but it sure does look that way! wild!

[-] kogasa@programming.dev 4 points 2 years ago

It must have some internal models of some things, or else it wouldn't be possible to consistently make coherent and mostly reasonable statements. But the fact that it has a reasonable model of things like grammar and conversation doesn't imply that it has a good model of literally anything else, which is unlike a human for whom a basic set of cognitive skills is presumably transferable. Still, the success of LLMs in their actual language-modeling objective is a promising indication that it's feasible for a ML model to learn complex abstractions.

[-] sc_griffith@awful.systems 26 points 2 years ago

if I copy a coherent sentence into my clipboard, my clipboard becomes capable of consistently making coherent statements

[-] kogasa@programming.dev 5 points 2 years ago* (last edited 2 years ago)

Yes, but that's not how LLMs work. My statement depends heavily on the fact that a LLM like GPT is coaxed into coherence by unsupervised or semi-supervised training. That the training process works is the evidence of an internal model (of language/related concepts), not just the fact that something outputs coherent statements.

[-] sc_griffith@awful.systems 14 points 2 years ago* (last edited 2 years ago)

if I have a bot pick a random book and copy the first sentence into my clipboard, my clipboard becomes capable of consistently making coherent statements. unsupervised training 👍

[-] self@awful.systems 14 points 2 years ago

let me free up some of your time so you can go figure out how LLMs actually work

[-] ondoyant@beehaw.org 12 points 2 years ago

this isn't necessarily true. patterns in data aren't by nature proof of an underlying system of logic. if you run the line-fitting machine on any kind of data, its going to output a line. considering just how much data is encoded into these transformers, i don't think we can conclusively say that it has a underlying conception of how language works, much less an understanding of the concepts that language represents. it could really just be using the vast quantities of data it has to output approximately correct statements. there's absolutely structure there, but it doesn't have to have the kind of structured understanding humans have about language to produce language, in the same way a less sophisticated machine learning model doesn't have to know what kind of data its fitting a line to to make a line.

[-] mawhrin@awful.systems 16 points 2 years ago

it doesn't. that's why we're calling it “spicy autocompletion” .

[-] kogasa@programming.dev 2 points 2 years ago

It does, which is why it's autocompletion and not auto-gibberish.

[-] slopjockey@awful.systems 15 points 2 years ago

It must have some internal models of some things, or else it wouldn’t be possible to consistently make coherent and mostly reasonable statements.

Talk about begging the question

this post was submitted on 23 May 2024

222 points (100.0% liked)

TechTakes

2626 readers

108 users here now

Big brain tech dude got yet another clueless take over at HackerNews etc? Here's the place to vent. Orange site, VC foolishness, all welcome.

This is not debate club. Unless it’s amusing debate.

For actually-good tech, you want our NotAwfulTech community

founded 3 years ago

MODERATORS

dgerard@awful.systems