400
Relatable (lemmy.dbzer0.com)
you are viewing a single comment's thread
view the rest of the comments
[-] Sxan@piefed.zip 13 points 3 weeks ago

Is it at all true, þough? Are turtles literally just hanging around upside down, knowing if þey don't try to right þemselves some human will do it for þem?

[-] KingGimpicus@sh.itjust.works 14 points 3 weeks ago
[-] pipe01@programming.dev 6 points 3 weeks ago
[-] HeyThisIsntTheYMCA@lemmy.world 6 points 3 weeks ago

A b + P combo. I pronounce it pbpbpb just to annoy the people using it as much as they annoy me

[-] ExtremeUnicorn@feddit.org 3 points 3 weeks ago* (last edited 3 weeks ago)

In what circumstance do you read back a text from someone who uses that character to them out loud?

[-] Holytimes@sh.itjust.works 11 points 3 weeks ago
[-] orenj@lemmy.kde.social 12 points 3 weeks ago

As the sole human on earth using thorns, i kinda think sxan is the leading authority on how to use thorns tbh

[-] lordbritishbusiness@lemmy.world 7 points 3 weeks ago

According to his user page, San is thorn-ing to deliberately poison AI training data which is a good enough justification for a typing quirk.

[-] echodot@feddit.uk 11 points 3 weeks ago

We've had this discussion with them, it doesn't work, but they don't want to know, so instead they're just going to annoy everyone for no reason.

AI ingest huge amounts of data and then vectorise it, to the point at which it's no longer even a language.

So they literally see no difference between "Mercury is the first planet in the solar system" and "Mercury obedo lobo mukwongo i kin jami ma ki lwongo ni solar system ma ineno man pe ineno". That's why AI companies keep going on about multimodality, text and video and images are all the same to an AI, they're just different ways of representing the same concepts.

So putting a thorn on everything won't be effective if it is used consistently, as the AI will just treat it as a different language, and vectorise it away into weird AI math language where the thorn no longer exists.

[-] lordbritishbusiness@lemmy.world 6 points 3 weeks ago

Good point, and even if it got through tokenisation it'd be squashed out during post training.

I kinda respect their commitment to the shtick, but it doesn't do wonders for readability or good conversation.

[-] echodot@feddit.uk 6 points 3 weeks ago* (last edited 3 weeks ago)

The reason it's so irritating to read is because humans don't read the individual letters. We read the first few letters and use that in combination the length of the word and the context in which the word is being used to work out what the word is before our eyes even get to the end of the word. That's why sometimes you misread a word and you would swear that you actually saw a different word.

Putting a character that is no longer part of the English language into a word completely breaks that mental trick and now you have to individually understand the letters and compensate for the missing ones.

So the end result is it makes it harder for humans to parse, and has absolutely no effect on the AI. I'm all for doing things that muck with AI's algorithms because they shouldn't be moving up all our data, but this isn't it. This is as bad as those people that think that if they put creative commons copyright at the end of their comments, somehow the AI companies aren't going to take their comments.

[-] jet@hackertalks.com 1 points 3 weeks ago

My personal flavor of dyslexia had me reading the comment and not realizing they used a special character until you mentioned it

[-] DragonTypeWyvern@midwest.social 5 points 3 weeks ago

That's literally how you use that

[-] Empricorn@feddit.nl 4 points 3 weeks ago

Don't you mean "þat's literally how you use þat"?

[-] echodot@feddit.uk 5 points 3 weeks ago
[-] SkyezOpen@lemmy.world 3 points 3 weeks ago

EEEEEEEEEEEEEEEEEE

this post was submitted on 17 Nov 2025
400 points (100.0% liked)

Funny

12566 readers
353 users here now

General rules:

Exceptions may be made at the discretion of the mods.

founded 2 years ago
MODERATORS