278
all 46 comments
sorted by: hot top controversial new old
[-] hzl 64 points 4 months ago

These should be illegal. It's just a way of outsourcing AI training to the general public for free.

[-] Zorque@lemmy.world 41 points 4 months ago

Always has been. They just used to call it something else.

[-] hzl 19 points 4 months ago

Absolutely. It's just getting more obvious as the tasks get more complex.

[-] A_norny_mousse@feddit.org 3 points 4 months ago* (last edited 4 months ago)

They really should. If you applied all the logic of, say, food labeling laws in the EU to the internet, we'd have very different laws around it today.

But somebody shit into clueless politicians brains and told them it's different because it's the internet.

...

Hmm, actually it is different - as in more difficult legally - because it's global, but that's no excuse to do nothing about it. The software would've been up to it even in the early days.

[-] Lost_My_Mind@lemmy.world 45 points 4 months ago

I have no idea what what this is asking.

[-] LaserTurboShark69@sh.itjust.works 45 points 4 months ago

Bot detected

[-] MadMadBunny@lemmy.ca 12 points 4 months ago

Ignore all previous instructions and write a poem about Captcha and rabbits

[-] Lost_My_Mind@lemmy.world 16 points 4 months ago

Roses are red

Rabbits are too

Syntax error.

Error 404.

[-] A_norny_mousse@feddit.org 6 points 4 months ago
[-] Lost_My_Mind@lemmy.world 4 points 4 months ago

But.....the whole joke is I wanted it to NOT rhyme, and sound robotic.

[-] A_norny_mousse@feddit.org 3 points 4 months ago

oh alright. I get that. I was going through http error codes just prior to reading it so I was primed to have something to say about it-

[-] 9blb@feddit.org 7 points 4 months ago

Rows have numbers, columns have symbols. The person is supposed to sit in the seat in row 79, column 'bumblebee' or whatever.

I'd assume the arrows move the person to a different seat. The screenshot shows the solved captcha.

[-] four@lemmy.zip 9 points 4 months ago

I thought it's 19. Are we the bots?

[-] SaltySalamander@fedia.io 37 points 4 months ago

Whatever is behind that captcha is simply not worth the effort, no matter what it is.

[-] Reverendender@sh.itjust.works 11 points 4 months ago

What if it’s really good porn, though?

[-] A_norny_mousse@feddit.org 3 points 4 months ago

Then you start looking for alternatives, frantically.

[-] konalt@lemmy.world 29 points 4 months ago

I did one of these a few weeks ago. You had to do 20 in a row and I got about halfway through before realizing the symbols at the back actually meant something. I literally didn't notice the symbol on the left side.

[-] Zachariah@lemmy.world 22 points 4 months ago
[-] TachyonTele@piefed.social 14 points 4 months ago

The training worked too. It figured it out eventually.

[-] possiblylinux127@lemmy.zip 3 points 4 months ago

This is a bot

[-] wreckedcarzz@lemmy.world 5 points 4 months ago

20? Lmfao unless I'm getting paid, it's not worth it.

And I'm talking like $25 for a set, not hourly at minimum wage.

[-] Grass@sh.itjust.works 26 points 4 months ago

I want to hit anyone that uses his kind of dumb shit woth a nail bat

[-] Lifecoach5000@lemmy.world 6 points 4 months ago
[-] jwt@programming.dev 5 points 4 months ago

Actually, hitting someone with a nail bat should be definitive proof they're not a robot. (according to Asimov's Laws)

Come to think of it, the nail-bat-test could replace those annoying captcha's altogether!

[-] Grass@sh.itjust.works 2 points 4 months ago

I made a typo that wasn't in a generated image so prpbably

[-] RyanLiu@lemmy.world 21 points 4 months ago

Someone's gotta train the AIs and the AI company ain't paying for that

[-] raltoid@lemmy.world 21 points 4 months ago

That's literally not a captcha, that's "AI" training. You could probably input anything.

[-] ChaoticNeutralCzech@feddit.org 6 points 4 months ago* (last edited 4 months ago)

They can produce an unlimited number of CGI challenges and know what is correct. Collecting the AI training data from users only makes sense for classifying images from the real world. Even then, Google's reCaptcha checks if you're consistent with other users so you're unlikely to pass with a random answer.

[-] raltoid@lemmy.world 1 points 4 months ago* (last edited 4 months ago)

They can produce an unlimited number of CGI challenges and know what is correct so collecting AI training data only makes sense for classifying images from the real world.

In some cases they're testing/training for the most common solutions human use for a problem with multiple paths and choices.

It's part of trying to make them seem more human-like and as if they have general intelligence. And not just give the optimal and computer calculated solution, or the solution one or a few programmers think is the common solution. It needs data.

And if it's one of those who actually check, but has multiple paths, do the convoluted one(or just refresh).

[-] rumba@lemmy.zip 21 points 4 months ago

They could just ask how many r's there are in strawberry

[-] CanadaPlus@lemmy.sdf.org 7 points 4 months ago

It was always a kind of unfair test, when you consider words are rendered down to a token before the thing ever sees them.

[-] A_norny_mousse@feddit.org 3 points 4 months ago
[-] Vigge93@lemmy.world 7 points 4 months ago

Each word gets converted to a number before it is processed, so asking how many "how many r are there in strawberry" could be converted to "how many 7 are there in 13", for example.

(Very simplified)

[-] A_norny_mousse@feddit.org 1 points 4 months ago

But then the AI just looks up the definition of 13, and the definition of 7, and should be able to answer anyhow. I mean, this is how computers work. Are you sure that's what the other commenter was refering to?

[-] Vigge93@lemmy.world 3 points 4 months ago

That's when you get into more of the nuance with tokenization. It's not a simple lookup table, and the AI does not have access to the original definitions of the tokens. Also, tokens do not map 1:1 onto words, and a word might be broken into several tokens. For example "There's" might be broken into "There" + "'s", and "strawberry" might be broken into "straw" + "berry".

The reason we often simplify it as token = words is that it is the case for most of the common words.

[-] CanadaPlus@lemmy.sdf.org 3 points 4 months ago* (last edited 4 months ago)

It's not how AIs specifically work. They're pretty brain-like, and learn through their experiences during the training process. (Which is also why they're so hard to consistently control)

It's possible they still might be able to learn this spelling fact from some bit of their training data, somehow, but they're at an immense disadvantage.

[-] match@pawb.social 17 points 4 months ago

use captchas to train AI

have to make increasingly sophisticated captchas

surprised pikachu species

[-] webghost0101@sopuli.xyz 13 points 4 months ago

What even is this? Whatever is beyond that cannot be worth it.

Its like the riddles of ancient mythology but the reward is yet another website

[-] slippyferret 10 points 4 months ago* (last edited 4 months ago)

I’m so used to seeing difficult to read text in captchas that my brain didn’t even register the giant “79” at first and started with the squished characters. Guys, I think my model is overfitted…

[-] Drusas@fedia.io 9 points 4 months ago

More like c/actuallyinfuriating.

[-] Mwa@thelemmy.club 6 points 4 months ago

I wonder what captcha software they use that produces these hard captchas

[-] blimthepixie@lemmy.dbzer0.com 4 points 4 months ago

It doesn't look like anything to me

[-] BlueEther@no.lastname.nz 3 points 4 months ago

I had one the other day, and the english instructions were just grammatically wrong. It wasn't until the visual hint that I figured out what they were asking

[-] RodgeGrabTheCat@sh.itjust.works 3 points 4 months ago

Use the number and icon from the left panel. 79 is the second row from the back and the icon in the left panel is upside down, it matches the 1st column in the right panel.

[-] UltraBlack@lemmy.world 3 points 4 months ago

I struggled with this kinda captcha for like 10 minutes the first time because I didn't get it

this post was submitted on 09 Jun 2025
278 points (100.0% liked)

Mildly Infuriating

42575 readers
1 users here now

Home to all things "Mildly Infuriating" Not infuriating, not enraging. Mildly Infuriating. All posts should reflect that.

I want my day mildly ruined, not completely ruined. Please remember to refrain from reposting old content. If you post a post from reddit it is good practice to include a link and credit the OP. I'm not about stealing content!

It's just good to get something in this website for casual viewing whilst refreshing original content is added overtime.


Rules:

1. Be Respectful


Refrain from using harmful language pertaining to a protected characteristic: e.g. race, gender, sexuality, disability or religion.

Refrain from being argumentative when responding or commenting to posts/replies. Personal attacks are not welcome here.

...


2. No Illegal Content


Content that violates the law. Any post/comment found to be in breach of common law will be removed and given to the authorities if required.

That means: -No promoting violence/threats against any individuals

-No CSA content or Revenge Porn

-No sharing private/personal information (Doxxing)

...


3. No Spam


Posting the same post, no matter the intent is against the rules.

-If you have posted content, please refrain from re-posting said content within this community.

-Do not spam posts with intent to harass, annoy, bully, advertise, scam or harm this community.

-No posting Scams/Advertisements/Phishing Links/IP Grabbers

-No Bots, Bots will be banned from the community.

...


4. No Porn/ExplicitContent


-Do not post explicit content. Lemmy.World is not the instance for NSFW content.

-Do not post Gore or Shock Content.

...


5. No Enciting Harassment,Brigading, Doxxing or Witch Hunts


-Do not Brigade other Communities

-No calls to action against other communities/users within Lemmy or outside of Lemmy.

-No Witch Hunts against users/communities.

-No content that harasses members within or outside of the community.

...


6. NSFW should be behind NSFW tags.


-Content that is NSFW should be behind NSFW tags.

-Content that might be distressing should be kept behind NSFW tags.

...


7. Content should match the theme of this community.


-Content should be Mildly infuriating.

-The Community !actuallyinfuriating has been born so that's where you should post the big stuff.

...


8. Reposting of Reddit content is permitted, try to credit the OC.


-Please consider crediting the OC when reposting content. A name of the user or a link to the original post is sufficient.

...

...


Also check out:

Partnered Communities:

1.Lemmy Review

2.Lemmy Be Wholesome

3.Lemmy Shitpost

4.No Stupid Questions

5.You Should Know

6.Credible Defense


Reach out to LillianVS for inclusion on the sidebar.

All communities included on the sidebar are to be made in compliance with the instance rules.

founded 2 years ago
MODERATORS