[HN Gopher] Breaking our Latin-1 assumptions
       ___________________________________________________________________
        
       Breaking our Latin-1 assumptions
        
       Author : sundarurfriend
       Score  : 38 points
       Date   : 2022-06-18 19:48 UTC (3 hours ago)
        
 (HTM) web link (manishearth.github.io)
 (TXT) w3m dump (manishearth.github.io)
        
       | Dylan16807 wrote:
       | > While this doesn't affect rendering, Unicode, as a system for
       | describing text, also has a concept of interlinear annotation
       | characters. These are used to represent furigana / ruby. Fonts
       | don't render this, but it's useful if you want to represent text
       | that uses ruby.
       | 
       | Useful as long as "represent" means internal in-process use only.
       | You're supposed to never save them into documents or send them
       | between systems.
        
       | RicoElectrico wrote:
       | If I had a dollar for every B2B software in my corp that screws
       | up such basic thing...
       | 
       | Yet the box-ticker drones from IT procurement are content with
       | vendors who assume we live in a world without diacritic marks, I
       | guess. Using 8-bit character sets in 2022 is a "brown M&M's" [1]
       | indicator for me. If they can't be bothered to use Unicode, what
       | else they don't care about?
       | 
       | [1] https://www.entrepreneur.com/article/232420
        
       | notriddle wrote:
       | (2017)
        
       | robin_reala wrote:
       | This is usually the point to link to the Big List of Naughty
       | Strings: https://github.com/minimaxir/big-list-of-naughty-strings
       | 
       | If your system can handle these it can probably handle most
       | global text.
        
       ___________________________________________________________________
       (page generated 2022-06-18 23:00 UTC)