[-] FaceDeer@fedia.io 1 points 7 hours ago

Alright, so swap in some different words if you don't like those. The basic point is the same - there's a bunch of models from different sources that can solve this, it's not just some weird one-off fluke.

Your own argument is a bit all over the place too, by the way. You said this puzzle "wasn't tricky in the slightest" and yet that "it requires understanding what is being asked." So only 71.5% of humans can accomplish this "not tricky in the slightest" problem, but there are some AI models that are able to "understand what is being asked"? Is "understanding" things not "tricky"?

[-] FaceDeer@fedia.io 1 points 7 hours ago

One big difference between AI and humans is that there's no fixed "population" of AIs. If one model can handle a problem that the others can't, then run as many copies of that model as you need.

It doesn't matter how many models can't accomplish this. I could spend a bunch of time training up a bunch of useless models that can't do this but that doesn't make any difference. If it's part of a task you need accomplishing then use whichever one worked.

[-] FaceDeer@fedia.io 6 points 8 hours ago

It's nice in that it gives the various Discord competitors a longer period to work on getting their software spiffed up.

[-] FaceDeer@fedia.io 1 points 9 hours ago

Yes. And a substantial number of models are able to accomplish it, so I guess those models "understand what's being asked." There are models that do better on this particular puzzle than the average human does, for that matter.

[-] FaceDeer@fedia.io 1 points 10 hours ago

It's tricky in the sense that it requires abstract reasoning.

[-] FaceDeer@fedia.io 30 points 11 hours ago

Whenever you want a view that's closest to the "truth", look at what the insurance rates are like. Insurance companies' whole business hinges on turning objective reality into money.

[-] FaceDeer@fedia.io 5 points 11 hours ago

This could backfire on them, I recall several years back reading a study in which anti-abortion sentiment plummeted the moment subjects were asked whether the pregnant woman herself should be punished (not even executed, just punished) for getting an abortion.

It's an interesting bit of cognitive dissonance, the sentiment seemed to be "yeah, abortion is murder, the doctor's a murderer, but the person who hired him to do that murder is an innocent victim who must be protected."

[-] FaceDeer@fedia.io 1 points 12 hours ago

In fact, I prefer the use of local AIs and dislike how the field is being dominated by big companies like Google or OpenAI. Unfortunately personal preferences don't change reality.

[-] FaceDeer@fedia.io 1 points 12 hours ago

I'm not doing it on behalf of anyone. Should we ignore the technology because we don't like the specific people who are developing it?

[-] FaceDeer@fedia.io 3 points 12 hours ago

And yet the best models outdid humans at this "car wash test." Humans got it right only 71.5% of the time.

[-] FaceDeer@fedia.io 8 points 1 day ago

Yeah, "AI is getting pretty good" is a very unpopular opinion in these parts. Popularity doesn't change the results though.

[-] FaceDeer@fedia.io 8 points 1 day ago

And that score is matched by GPT-5. Humans are running out of "tricky" puzzles to retreat to.

view more: next ›

FaceDeer

joined 2 years ago