Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Did anyone else notice the weird GNU/Linux screed on line 803? In particular, the "Additional Comments" content is actually written with a large amount of look-alike Unicode. Playing around with this data in Excel somehow byte-shifted it into garbage. I noticed that other folks thought it would be funny to put things like =SUM(G1:G1000) in other cells, but this seems a little bit more sinister for some reason. Anyone savvy enough in UTF-8 to deduce what's going on there?


I don't see anything weird (except that it's a useless entry). The Additional Comments column has a long quote by Stallman starting "I'd like to interject for a moment..."

https://en.wikiquote.org/wiki/Talk:Richard_Stallman


Depending on the system you use to view the text, many of the characters are actually printed using non-Roman lookalike characters that still render as the "usual" ones.

In OSX / Firefox I see the usual text in the web view, but in source view there's a variation in shading between characters, in what should be an unadorned monospaced font. Viewing the same source in Chrome shows the trick for what it is...

https://www.dropbox.com/s/oj9cqlh3kh90zep/Screenshot%202016-...

Subtle and not always visible due to differences in display normalization in various libraries? Way out of my depth for system fonts / encoding issues. But hopefully the above shows what I'm talking about. Could there be some data hidden in the lookalike string values?

http://www.lookout.net/2012/04/generating-confusable-lookali...




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: