[HN Gopher] Breaking our Latin-1 assumptions ___________________________________________________________________ Breaking our Latin-1 assumptions Author : sundarurfriend Score : 38 points Date : 2022-06-18 19:48 UTC (3 hours ago) (HTM) web link (manishearth.github.io) (TXT) w3m dump (manishearth.github.io) | Dylan16807 wrote: | > While this doesn't affect rendering, Unicode, as a system for | describing text, also has a concept of interlinear annotation | characters. These are used to represent furigana / ruby. Fonts | don't render this, but it's useful if you want to represent text | that uses ruby. | | Useful as long as "represent" means internal in-process use only. | You're supposed to never save them into documents or send them | between systems. | RicoElectrico wrote: | If I had a dollar for every B2B software in my corp that screws | up such basic thing... | | Yet the box-ticker drones from IT procurement are content with | vendors who assume we live in a world without diacritic marks, I | guess. Using 8-bit character sets in 2022 is a "brown M&M's" [1] | indicator for me. If they can't be bothered to use Unicode, what | else they don't care about? | | [1] https://www.entrepreneur.com/article/232420 | notriddle wrote: | (2017) | robin_reala wrote: | This is usually the point to link to the Big List of Naughty | Strings: https://github.com/minimaxir/big-list-of-naughty-strings | | If your system can handle these it can probably handle most | global text. ___________________________________________________________________ (page generated 2022-06-18 23:00 UTC)