320
top 17 comments
sorted by: hot top controversial new old
[-] arschflugkoerper@feddit.org 67 points 5 days ago

vscode unfortunately highlights unexpected unicode characters

[-] Thedogdrinkscoffee@lemmy.ca 25 points 5 days ago
[-] Matty_r@programming.dev 12 points 5 days ago

Harder to prank someone when it highlights it

[-] Trail@lemmy.world 7 points 5 days ago

Ok, put it into an excel formula then.

[-] ChaoticNeutralCzech@feddit.org 1 points 2 hours ago

Non-native English speaker telling on themselves.
(Excel only uses semicolons instead of commas in locales where they would collide with the decimal comma)

[-] anton@piefed.blahaj.zone 21 points 4 days ago* (last edited 4 days ago)

My IDE says: '(', '+', '-', '.', ';', <operator>, '[' or '}' expected, got ';'
But the rust compiler explains

error: unknown start of token: \u{37e}  
help: Unicode character ';' (Greek Question Mark) looks like ';' (Semicolon), but it is not```   
what a killjoy.
[-] m_f@discuss.online 56 points 5 days ago

; and ; respectively, in case anyone wants to see how it renders on their machine and is also lazy.

[-] squaresinger@lemmy.world 6 points 4 days ago

It must really suck to work as a Java developer in Greece.

[-] 30p87@feddit.org 25 points 5 days ago

silently closes IntelliJ and vim, opens PyCharm

[-] anton 4 points 4 days ago

As if a white space sensitive language protects from this fuckery.

  • How many thin spaces are one level of indentation?
  • Will anyone notice a hair space?
  • Who can tell the difference between a space and a figure space? they are the same size in a mono spaced font
[-] lordnikon@lemmy.world 4 points 5 days ago

Vscode might but what about llms asking for a friend

[-] frezik 1 points 4 days ago

Since they're fundamentally predicting the next token, and there isn't a lot of training data out there that would actually do this, I wouldn't expect that LLMs are going to start putting in lookalike characters. They only lookalike to humans.

That said, you could probably poison their training datasets this way.

[-] lordnikon@lemmy.world 1 points 4 days ago

Yeah that was the idea get the llms to start using look alike characters to poison their outputs.

[-] Infernal_pizza@lemmy.dbzer0.com 1 points 4 days ago

Why do characters like this even exist? I've run into this before where I couldn't find a file I'd downloaded by searching for it. I remembered what folder it was in and checked it was still there, after playing around with the name for a bit I realised the "a" in the file name wasn't actually an a.

[-] frezik 4 points 4 days ago

Simple answer is that Unicode is a design by committee attempting to make every single human written language work. It's more complicated than it needs to be, but we also don't want to redo all the work it would take to replace it with something more sane. Especially KJC languages. Trying to get those three to agree on anything is for people who deal with frustration better than me.

this post was submitted on 31 Jul 2025
320 points (100.0% liked)

Programmer Humor

25495 readers
1764 users here now

Welcome to Programmer Humor!

This is a place where you can post jokes, memes, humor, etc. related to programming!

For sharing awful code theres also Programming Horror.

Rules

founded 2 years ago
MODERATORS