367
top 50 comments
sorted by: hot top controversial new old
[-] Zess@lemmy.world 9 points 10 hours ago

You asked a stupid question and got a stupid response, seems fine to me.

[-] Grabthar@lemmy.world 11 points 14 hours ago

Doc: That’s an interesting name, Mr…

Fletch: Babar.

Doc: Is that with one B or two?

Fletch: One. B-A-B-A-R.

Doc: That’s two.

Fletch: Yeah, but not right next to each other, that’s what I thought you meant.

Doc: Isn’t there a children’s book about an elephant named Babar.

Fletch: Ha, ha, ha. I wouldn’t know. I don’t have any.

Doc: No children?

Fletch: No elephant books.

[-] humorlessrepost@lemmy.world 6 points 13 hours ago* (last edited 13 hours ago)

Works fine for me in o3-mini-high:

Counting letters in “strawberry”

Alright, I’m checking: the word “strawberry” is spelled S T R A W B E R R Y. Let me count the letters: S (1), T (2), R (3), A (4), W (5), B (6), E (7), R (8), R (9), Y (10). There are three R’s: in positions 3, 8, and 9. So, the answer is 3. Even if we ignore case, the count still holds. Therefore, there are 3 r’s in “strawberry.”

[-] sheogorath@lemmy.world 1 points 12 minutes ago

Finally! With a household energy consumption for one day we can count how many Rs are in strawberry.

[-] winkly@lemmy.world 4 points 13 hours ago

How many strawberries could a strawberry bury if a strawberry could bury strawberries 🍓

[-] ClusterBomb 26 points 1 day ago

"My hammer is not well suited to cut vegetables" 🤷

There is so much to say about AI, can we move on from "it can't count letters and do math" ?

[-] Strykker@programming.dev 5 points 14 hours ago

But the problem is more "my do it all tool randomly fails at arbitrary tasks in an unpredictable fashion" making it hard to trust as a tool in any circumstances.

[-] desktop_user 2 points 14 hours ago

it would be like complaining that a water balloon isn't useful because it isn't accurate. LLMs are good at approximating language, numbers are too specific and have more objective answers.

[-] daniskarma@lemmy.dbzer0.com 26 points 1 day ago* (last edited 1 day ago)

That happens when do you not understand what is a llm, or what its usecases are.

This is like not being impressed by a calculator because it cannot give a word synonym.

[-] Strykker@programming.dev 5 points 14 hours ago

But everyone selling llms sells them as being able to solve any problem, making it hard to know when it's going to fail and give you junk.

[-] daniskarma@lemmy.dbzer0.com 6 points 14 hours ago

And redbull give you wings.

Marketing within a capitalist market be like that for every product.

[-] NikkiDimes@lemmy.world 1 points 14 hours ago

Is anyone really pitching AI as being able to solve every problem though?

[-] gerryflap@feddit.nl 31 points 1 day ago* (last edited 1 day ago)

These models don't get single characters but rather tokens repenting multiple characters. While I also don't like the "AI" hype, this image is also very 1 dimensional hate and misreprents the usefulness of these models by picking one adversarial example.

Today ChatGPT saved me a fuckton of time by linking me to the exact issue on gitlab that discussed the issue I was having (full system freezes using Bottles installed with flatpak on Arch). This was the URL it came up with after explaining the problem and giving it the first error I found in dmesg: https://gitlab.archlinux.org/archlinux/packaging/packages/linux/-/issues/110

This issue is one day old. When I looked this shit up myself I found exactly nothing useful on both DDG or Google. After this ChatGPT also provided me with the information that the LTS kernel exists and how to install it. Obviously I verified that stuff before using it, because these LLMs have their limits. Now my system works again, and figuring this out myself would've cost me hours because I had no idea what broke. Was it flatpak, Nvidia, the kernel, Wayland, Bottles, some random shit I changed in a config file 2 years ago? Well thanks to ChatGPT I know.

They're tools, and they can provide new insights that can be very useful. Just don't expect them to always tell the truth, or to actually be human-like

[-] lennivelkant@discuss.tchncs.de 5 points 1 day ago

Just don't expect them to always tell the truth, or to actually be human-like

I think the point of the post is to call out exactly that: people preaching AI as replacing humans

[-] desktop_user 0 points 14 hours ago

it can, in the same way a loom did, just for more language-y tasks, a multimodal system might be better at answering that type of question by first detecting that this is a question of fact and that using a bucket sort algorithm on the word "strawberry" will answer the question better than it's questionably obtained correlations.

[-] eggymachus@sh.itjust.works 37 points 1 day ago

A guy is driving around the back woods of Montana and he sees a sign in front of a broken down shanty-style house: 'Talking Dog For Sale.'

He rings the bell and the owner appears and tells him the dog is in the backyard.

The guy goes into the backyard and sees a nice looking Labrador Retriever sitting there.

"You talk?" he asks.

"Yep" the Lab replies.

After the guy recovers from the shock of hearing a dog talk, he says, "So, what's your story?"

The Lab looks up and says, "Well, I discovered that I could talk when I was pretty young. I wanted to help the government, so I told the CIA. In no time at all they had me jetting from country to country, sitting in rooms with spies and world leaders, because no one figured a dog would be eavesdropping, I was one of their most valuable spies for eight years running... but the jetting around really tired me out, and I knew I wasn't getting any younger so I decided to settle down. I signed up for a job at the airport to do some undercover security, wandering near suspicious characters and listening in. I uncovered some incredible dealings and was awarded a batch of medals. I got married, had a mess of puppies, and now I'm just retired."

The guy is amazed. He goes back in and asks the owner what he wants for the dog.

"Ten dollars" the guy says.

"Ten dollars? This dog is amazing! Why on Earth are you selling him so cheap?"

"Because he's a liar. He's never been out of the yard."

[-] VintageGenious@sh.itjust.works 93 points 1 day ago

Because you're using it wrong. It's good for generative text and chains of thought, not symbolic calculations including math or linguistics

[-] Grandwolf319@sh.itjust.works 23 points 1 day ago

Because you're using it wrong.

No, I think you mean to say it’s because you’re using it for the wrong use case.

Well this tool has been marketed as if it would handle such use cases.

I don’t think I’ve actually seen any AI marketing that was honest about what it can do.

I personally think image recognition is the best use case as it pretty much does what it promises.

[-] scarabic@lemmy.world 1 points 7 hours ago

Really? AI has been marketed as being able to count the r’s in “strawberry?” Please link to this ad.

[-] joel1974@lemmy.world 17 points 1 day ago

Give me an example of how you use it.

[-] scarabic@lemmy.world 1 points 6 hours ago

We have one that indexes all the wikis and GDocs and such at my work and it’s incredibly useful for answering questions like “who’s in charge of project 123?” or “what’s the latest update from team XYZ?”

I even asked it to write my weekly update for MY team once and it did a fairly good job. The one thing I thought it had hallucinated turned out to be something I just hadn’t heard yet. So it was literally ahead of me at my own job.

I get really tired of all the automatic hate over stupid bullshit like this OP. These tools have their uses. It’s very popular to shit on them. So congratulations for whatever agreeable comments your post gets. Anyway.

[-] slaacaa@lemmy.world 2 points 14 hours ago

I have it write for me emails in German. I moved there not too long ago, works wonders to get doctors appointment, car service, etc. I also have it explain the text, so I’m learning the language.

I also use it as an alternative to internet search, which is now terrible. It’s not going to help you to find smg super location specific, but I can ask it to tell me without spoilers smg about a game/movie or list metacritic scores in a table, etc.

It also works great in summarizing long texts.

LLM is a tool, what matters is how you use it. It is stupid, it doesn’t think, it’s mostly hype to call it AI. But it definitely has it’s benefits.

[-] L3s@lemmy.world 33 points 1 day ago* (last edited 1 day ago)

Writing customer/company-wide emails is a good example. "Make this sound better: we're aware of the outage at Site A, we are working as quick as possible to get things back online"

Dumbing down technical information "word this so a non-technical person can understand: our DHCP scope filled up and there were no more addresses available for Site A, which caused the temporary outage for some users"

Another is feeding it an article and asking for a summary, https://hackingne.ws/ does that for its Bsky posts.

Coding is another good example, "write me a Python script that moves all files in /mydir to /newdir"

Asking for it to summarize a theory or protocol, "explain to me why RIP was replaced with RIPv2, and what problems people have had since with RIPv2"

[-] Corngood@lemmy.ml 25 points 1 day ago

Make this sound better: we’re aware of the outage at Site A, we are working as quick as possible to get things back online

How does this work in practice? I suspect you're just going to get an email that takes longer for everyone to read, and doesn't give any more information (or worse, gives incorrect information). Your prompt seems like what you should be sending in the email.

If the model (or context?) was good enough to actually add useful, accurate information, then maybe that would be different.

I think we'll get to the point really quickly where a nice concise message like in your prompt will be appreciated more than the bloated, normalised version, which people will find insulting.

[-] L3s@lemmy.world 18 points 1 day ago* (last edited 1 day ago)

Yeah, normally my "Make this sound better" or "summarize this for me" is a longer wall of text that I want to simplify, I was trying to keep my examples short. Talking to non-technical people about a technical issue is not the easiest for me, AI has helped me dumb it down when sending an email, and helps correct my shitty grammar at times.

As for accuracy, you review what it gives you, you don't just copy and send it without review. Also you will have to tweak some pieces that it gives out where it doesn't make the most sense, such as if it uses wording you wouldn't typically use. It is fairly accurate though in my use-cases.

Hallucinations are a thing, so validating what it spits out is definitely needed.

Another example: if you feel your email is too stern or gives the wrong tone, I've used it for that as well. "Make this sound more relaxed: well maybe if you didn't turn off the fucking server we wouldn't of had this outage!" (Just a silly example)

[-] otp@sh.itjust.works 20 points 1 day ago

As for accuracy, you review what it gives you, you don't just copy and send it without review.

Yeah, I don't get why so many people seem to not get that.

It's like people who were against Intellisense in IDEs because "What if it suggests the wrong function?"...you still need to know what the functions do. If you find something you're unfamiliar with, you check the documentation. You don't just blindly accept it as truth.

Just because it can't replace a person's job doesn't mean it's worthless as a tool.

load more comments (4 replies)
load more comments (5 replies)
load more comments (1 replies)
load more comments (8 replies)
[-] lime@feddit.nu 15 points 1 day ago* (last edited 1 day ago)

i'm still not entirely sold on them but since i'm currently using one that the company subscribes to i can give a quick opinion:

i had an idea for a code snippet that could save be some headache (a mock for primitives in lua, to be specific) but i foresaw some issues with commutativity (aka how to make sure that a + b == b + a). so i asked about this, and the llm created some boilerplate to test this code. i've been chatting with it for about half an hour and testing the code it produces, and had it expand the idea to all possible metamethods available on primitive types, together with about 50 test cases with descriptive assertions. i've now run into an issue where the __eq metamethod isn't firing correctly when one of the operands is a primitive rather than a mock, and after having the llm link me to the relevant part of the docs, that seems to be a feature of the language rather than a bug.

so in 30 minutes i've gone from a loose idea to a well-documented proof-of-concept to a roadblock that can't really be overcome. complete exploration and feasibility study, fully tested, in less than an hour.

load more comments (11 replies)
load more comments (8 replies)
[-] rumba@lemmy.zip 13 points 1 day ago

Yeah and you know I always hated this screwdrivers make really bad hammers.

[-] whotookkarl@lemmy.world 47 points 1 day ago

I've already had more than one conversation where people quote AI as if it were a source, like quoting google as a source. When I showed them how it can sometimes lie and explain it's not a primary source for anything I just get that blank stare like I have two heads.

[-] AA5B@lemmy.world 2 points 23 hours ago

I’ve been avoiding this question up until now, but here goes:

Hey Siri …

  • how many r’s in strawberry? 0
  • how many letter r’s in the word strawberry? 10
  • count the letters in strawberry. How many are r’s? ChatGPT …..2
[-] Grandwolf319@sh.itjust.works 35 points 1 day ago* (last edited 1 day ago)

There is an alternative reality out there where LLMs were never marketed as AI and were marketed as random generator.

In that world, tech savvy people would embrace this tech instead of having to constantly educate people that it is in fact not intelligence.

[-] daniskarma@lemmy.dbzer0.com 4 points 1 day ago

They are not random per se. They are just statistical with just some degree of randomization.

[-] Static_Rocket@lemmy.world 6 points 1 day ago

That was this reality. Very briefly. Remember AI Dungeon and the other clones that were popular prior to the mass ml marketing campaigns of the last 2 years?

[-] whynot_1@lemmy.world 37 points 1 day ago

I think I have seen this exact post word for word fifty times in the last year.

[-] clay_pidgin@sh.itjust.works 17 points 1 day ago

Has the number of "r"s changed over that time?

load more comments (1 replies)
load more comments (1 replies)
[-] Tgo_up@lemm.ee 30 points 1 day ago

This is a bad example.. If I ask a friend "is strawberry spelled with one or two r's"they would think I'm asking about the last part of the word.

The question seems to be specifically made to trip up LLMs. I've never heard anyone ask how many of a certain letter is in a word. I've heard people ask how you spell a word and if it's with one or two of a specific letter though.

If you think of LLMs as something with actual intelligence you're going to be very unimpressed.. It's just a model to predict the next word.

If you think of LLMs as something with actual intelligence you're going to be very unimpressed.. It's just a model to predict the next word.

This is exactly the problem, though. They don’t have “intelligence” or any actual reasoning, yet they are constantly being used in situations that require reasoning.

[-] Tgo_up@lemm.ee 1 points 1 day ago

What situations are you thinking of that requires reasoning?

I've used LLMs to create software i needed but couldn't find online.

Creating software is a great example, actually. Coding absolutely requires reasoning. I’ve tried using code-focused LLMs to write blocks of code, or even some basic YAML files, but the output is often unusable.

It rarely makes syntax errors, but it will do things like reference libraries that haven’t been imported or hallucinate functions that don’t exist. It also constantly misunderstands the assignment and creates something that technically works but doesn’t accomplish the intended task.

load more comments (3 replies)
load more comments (4 replies)

It's predictive text on speed. The LLMs currently in vogue hardly qualify as A.I. tbh..

load more comments (1 replies)
[-] artificialfish@programming.dev 13 points 1 day ago

This is literally just a tokenization artifact. If I asked you how many r’s are in /0x5273/0x7183 you’d be confused too.

load more comments (1 replies)
load more comments
view more: next ›
this post was submitted on 05 Feb 2025
367 points (100.0% liked)

Technology

61774 readers
3442 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


founded 2 years ago
MODERATORS