Why I'm Betting Against AI Agents in 2025 (Despite Building Them) (utkarshkanwat.com)

submitted 1 week ago by rssbot@lemmy.bestiver.se to c/hackernews@lemmy.bestiver.se

8 comments fedilink hide all child comments

top 8 comments

sorted by: hot top controversial new old

[-] Xulai@mander.xyz 10 points 1 week ago

As someone who works in tech, currently testing AI integration into healthcare EHR- the current state of AI is simply not safe for anything outside transcription- and even that is error prone without strict re-reading (not scanning!) for error correction.

The errors can be subtle but life threatening. I highly recommended against integrating it - but the most lazy providers were already using AI illegally for their notes so this was seen as a middle road.

Medical care and provider training in the USA is not ok right now, and getting worse. AI and misinformation is accelerating the decline.

[-] hansolo@lemmy.today 4 points 1 week ago* (last edited 1 week ago)

TL;DR: Three Hard Truths About AI Agents After building 12+ production systems, here's what I've learned: -Error rates compound exponentially in multi-step workflows. 95% reliability per step = 36% success over 20 steps. Production needs 99.9%+. Context windows create quadratic token costs. -Long conversations become prohibitively expensive at scale. -The real challenge isn't AI capabilities, it's designing tools and feedback systems that agents can actually use effectively.

The TL;DR of the TL;DR is compounding expensive, error-prone results.

[-] jbloggs777@discuss.tchncs.de 3 points 1 week ago* (last edited 1 week ago)

It sounds like one should be building deliberate AI workflows with extra checks (automated or human in the loop) that make careful and cost efficient incremental progress toward a measurable goal.

Sounds like hard work... when we could just build 1,000,000 MCP servers instead. (raises pinkie to corner of mouth)

[-] DecaturNature@peachpie.theatl.social 2 points 1 week ago

Much of the hype around AI is based on the hope that we will no longer need to organize our own thoughts, express our own perspective, or understand the details or our own lives. These people want some hyper generalized, disembodied 'intelligence' to do everything for them ... it's no wonder that that they produce little more than hot air.

[-] Outwit1294@lemmy.today 4 points 1 week ago

Interesting. I have never seen the economic side of it being discussed outside of nvidia stock prices.

[-] Dadifer@lemmy.world 3 points 1 week ago

I keep having the same question: would it benefit to have a separate agent whose job was to error-check the first agent?

[-] scribbler@lemmy.world 3 points 1 week ago

The three stooges didn't seem any less likely to get into trouble despite their strength in numbers

[-] PattyMcB@lemmy.world 1 points 1 week ago

Also... how many security vulnerabilities have those agents introduced?

this post was submitted on 20 Jul 2025

18 points (100.0% liked)

Hacker News

2154 readers

405 users here now

Posts from the RSS Feed of HackerNews.

The feed sometimes contains ads and posts that have been removed by the mod team at HN.

founded 10 months ago

MODERATORS

patrick@lemmy.bestiver.se

rssbot@lemmy.bestiver.se