[HN Gopher] A prisoner who worked out how to input Chinese chara...
       ___________________________________________________________________
        
       A prisoner who worked out how to input Chinese characters into a
       machine
        
       Author : dangerman
       Score  : 113 points
       Date   : 2022-01-26 15:13 UTC (7 hours ago)
        
 (HTM) web link (www.wired.com)
 (TXT) w3m dump (www.wired.com)
        
       | karmakaze wrote:
       | It seems like the Hangul[0]/Korean writing system was retrofitted
       | into Chinese with the challenge being to make the imprecise
       | precise. Hangul was designed with the purpose of solving
       | illiteracy and so the character forms were built simply from
       | sound components. Doing it in reverse with a larger unstructured
       | set of symbols is quite the feat.
       | 
       | [0] https://en.wikipedia.org/wiki/Hangul
        
         | pseingatl wrote:
         | Korean is not a tonal language. But Hangul is great, I wish
         | Thailand would adopt it.
        
           | monocasa wrote:
           | The same idea works tonemes distinguished by common graphical
           | features as well.
        
           | karmakaze wrote:
           | Thai spoken language is the opposite extreme of German,
           | vowels are the salient parts and consonants are often barely
           | present and highly interchangeable. I don't know how the Thai
           | written language works or how complex it is in comparison but
           | I imagine it also wouldn't be a straightforward mapping.
           | Quick lookup says it's tonal and analytic [idk what this is]
           | like Chinese and Vietnamese.
        
           | VygmraMGVl wrote:
           | When Hangul was invented, Korean was tonal and Hangul
           | accounted for this with diacritics.
           | 
           | Apparently some dialects of Korean are still tonal.
           | 
           | https://autolingual.com/korean-tones/
        
       | jjcc wrote:
       | Here's the bio of the prisoner:
       | https://en.wikipedia.org/wiki/Zhi_Bingyi
        
         | rabbits77 wrote:
         | I was wondering how he ended up and the biography on Wikipedia
         | is unclear on how his life may have improved after publishing
         | his results. He published in 1978 and passed away fifteen years
         | later. Other than a note that he joined The Party in 1991 there
         | is no information on his last years.
         | 
         | I am not sure what to make of the information that is given.
         | Does it imply he was treated poorly, even tortured, until 1991
         | when he relented?
        
       | neonate wrote:
       | https://archive.is/Cl2eK
        
       | kabell wrote:
       | > Because the code took only the first letter, rather than the
       | complete sound of the character, most regional speech variations
       | did not matter.
       | 
       | This bit is really interesting. This is one of the motivations
       | for an open source Chinese IME framework called RIME:
       | https://rime.im There are configurations available for a number
       | of dialects like Suzhou or Shanghai dialects so that users who
       | prefer those languages don't have to use Mandarin/putonghua
       | pinyin.
        
       | maupin wrote:
       | The article doesn't go into the implementation of his system,
       | which was likely a real bitch.
        
       | est wrote:
       | He is just one of the early pioneers of Chinese input methods,
       | his particular method is stroke-based and works on 0-9 number
       | pads, went obsolete decades ago, but surely had an influence over
       | later popular ones like Wubi.
       | 
       | Today, input methods were dominated by pronunciation based one
       | like Pinyin. The heuristics based "smart" Pinyin input methods
       | was actually invented by a non-tech folk and eventually hired by
       | a then-large Internet company called Sogou, it was quite novel
       | for the input method to suggest word pairs like a search engine
       | with most frequent used ones readily type-able. Some later
       | versions even include crazy macros like the current timestamp,
       | emojis, or even read your contact list (!!) for quicker name &
       | address suggestions.
        
         | gibolt wrote:
         | Pinyin may dominate in Mainland China, but places like Taiwan
         | and Hong Kong still prefer their own input methods.
         | 
         | Both generally use Traditional instead of Simplified characters
         | and HK's Cantonese has an underused pinyin equivalent, but that
         | isn't a cause. Other systems became prevalent first, and stuck.
        
           | xster wrote:
           | Taiwan's Zhuyin is 90+% the same as Pinyin except the input
           | is composed with Chinese script radicals (like hiragana)
           | rather than Latin script.
           | 
           | > Both generally use Traditional instead of Simplified
           | 
           | That doesn't impact the human side of the IME. It's just
           | different output for the same input. And the input (Mandarin
           | pronunciation) is the same for mainland and Taiwan.
        
             | yorwba wrote:
             | There are differences in the Mandarin pronunciations
             | standardized on the Mainland and Taiwan. Most of them
             | involve using the fifth tone or not, but there are a few
             | more noticeable exceptions, like He  _he_ HAN\ (which would
             | correspond to pinyin _han_ ) and Ya  _ya_ IAI/ (which would
             | correspond to pinyin _yai_ ).
             | 
             | Stuff like that occasionally trips me up when switching
             | between pinyin and zhuyin.
        
         | hangonhn wrote:
         | The problem with pinyin is that it's phonetic based and it's
         | based on Mandarin. For large sub-populations of Chinese
         | speakers who primarily use other dialects, it's not as
         | intuitive -- although this is changing overtime as Mandarin
         | becomes more dominant across China. A stroke/character based
         | system preserves a key characteristic of Chinese, which is that
         | speakers of different Chinese dialects can all use the same
         | writing system. However, I think Pinyin is easier for Mandarin
         | speakers (in PRC at least) and also non-native Chinese speakers
         | who likely come from a phonetic based writing system and have
         | learned Mandarin using Pinyin.
        
         | odiroot wrote:
         | Pinyin is really peculiar. It makes it so much easier to learn
         | to outsiders.
         | 
         | I know some overseas Chinese people who (not being raised in
         | China) never learned pinyin and prefer the stroke method. Some
         | of the even prefer drawing the strokes by hand (instead of
         | choosing from the keyboard).
         | 
         | Any Chinese word, I've ever learned, I can only input using
         | pinyin. The stroke method adds so much more mental overhead.
        
           | thaumasiotes wrote:
           | > I know some overseas Chinese people who (not being raised
           | in China) never learned pinyin and prefer the stroke method.
           | 
           | I knew someone who used wubi in preference to pinyin input.
           | She grew up in Guangzhou until age 16, so it wasn't an issue
           | of not learning pinyin. The problem, for her, was that she
           | had no way to know the correct pinyin for a character, since
           | she didn't distinguish between a bunch of sounds that are
           | distinct in pinyin.
           | 
           | > Any Chinese word, I've ever learned, I can only input using
           | pinyin. The stroke method adds so much more mental overhead.
           | 
           | If you were capable of writing by hand, this wouldn't be
           | true.
        
         | GauntletWizard wrote:
         | Contact list has become the standard - You've probably not even
         | noticed it, but Android's "Gboard" has contacts permissions,
         | and will use your contact's names and nicknames as well as
         | e-mail addresses as suggestions.
        
           | MiddleEndian wrote:
           | I noticed this (albeit with a different keyboard app) as soon
           | as I switched from Windows Phone to Android. Couldn't stand
           | it really, it would auto-suggest random email addresses and
           | even numbers when I was just trying to type words.
           | 
           | The least-bad Android keyboard I've ever used is the HTC
           | keyboard, since it had a setting to only learn words I
           | explicitly told it to remember.
        
         | mkehrt wrote:
         | I don't think reading your contact list is that weird. iOS
         | autocorrect in English clearly does it.
        
         | seanmcdirmid wrote:
         | Pinyin is popular in the PRC, but not in Taiwan or HK. Taiwan
         | input is dominated by Bopomofo, which is also pronunciation
         | based. Mainland China is the largest country that uses Chinese
         | input methods, however.
         | 
         | I still remember entering Chinese using a pinyin T9 method on
         | my Nokia handset in 2002. Pinyin on a 0-9 number pad, fun
         | times.
        
         | nojs wrote:
         | As a non-chinese pinyin input is so much easier. One of the
         | cool things about it is you can type a whole sentence just
         | using the first letter of each character. Eg "wxzdnwsmbq" on my
         | phone is sufficient to produce "Wo Xiang Zhi Dao Ni Wei Shi Yao
         | Bu Qu ". In English I'd have to type the whole sentence "I want
         | to know why you didn't go".
        
       | mc32 wrote:
       | This wasn't just some random prisoner.
       | 
       | This was a highly educated pioneer who studied in the West and
       | was recruited in the West but chose to go help his country and
       | was initially rewarded with a very prestigious and secure job and
       | also deserved --to then get caught up in the hong weibing purges.
       | This is the lesson of ideology -it goes astray fast. Lukily for
       | him, he didn't end up beaten to a pulp and dead in a semi
       | anonymous lynching[1] in a no-name square.
       | 
       | [1]https://www.theguardian.com/world/2016/may/07/mao-little-
       | gen...
        
         | dang wrote:
         | Please don't take HN threads on generic ideological tangents.
         | All that does is replace an interesting, specific, fresh topic
         | with a shallow, predictable and boring one.
         | 
         | I don't mean that it isn't exciting, but something can be both
         | exciting, in the sense that it excites strong feelings, and
         | boring, in the sense that it doesn't contain any new
         | information. Rhetoric like "beaten to a pulp and dead in a semi
         | anonymous lynching in a no-name square" is an excellent example
         | of what I mean: it's highly exciting and at the same time not
         | intellectually interesting.
         | 
         | The larger generic topics are like black holes that suck in
         | inquisitive spaceships (us!) that happen to veer nearby (https:
         | //hn.algolia.com/?dateRange=all&page=0&prefix=true&que...). It
         | takes a certain awareness to remember this, so that we can
         | stick to more interesting conversation. Certainly the discovery
         | of how to mechanize the processing of Chinese characters is
         | much more interesting than "the lesson of ideology"--at least
         | in HN's sense of the word "interesting".
         | 
         | That's why we have this rule in the site guidelines. If you'd
         | please review them, we'd appreciate it:
         | https://news.ycombinator.com/newsguidelines.html.
         | 
         | Past explanations: https://hn.algolia.com/?dateRange=all&page=0
         | &prefix=true&sor....
        
           | mc32 wrote:
           | I understand, but I will respectfully disagree. These events
           | occurred during the Cultural Revolution and the article
           | prefaces it with such. It's inextricable. It is true he was
           | an innovator, no doubt, but for the politics at the time,
           | many other minds would have been able to contribute to the
           | progress of the country but were not "as lucky".
        
             | dang wrote:
             | Ok, I can see the argument that I read it too generically
             | and am happy to retreat this time. Thanks for the kind
             | reply!
        
           | jrochkind1 wrote:
           | Agree with other dissenters here that this was not a "generic
           | ideological tangent", it was specific and relevant to the OP.
           | And i found it interesting.
           | 
           | I also understand it's difficult to keep a comment on
           | politics, even when it's not a "generic ideological tangent",
           | from prompting a mess, and respect your inteventions to avoid
           | the messes.
           | 
           | But this was not a generic tangent.
        
             | dang wrote:
             | Ok, you guys, I'll take the point and back off on this one.
             | Thanks for the feedback!
        
               | jrochkind1 wrote:
               | Now watch it turn into a political flamewar and prove you
               | right haha. I mean, you do generally know what you're
               | doing. It was just the template "generic ideological
               | tangent" that got me going, it wasn't at all generic or a
               | tangent... it still might result in a mess though! We'll
               | see!
        
           | 6543ger23 wrote:
           | mc32's comment was informational and worth reading and the
           | reference to the cultural revolution was IMHO very relevant
           | because the guy lactually ived through it when he might very
           | well not have. It comes across as a statement of fact to me,
           | not at all a generic ideological tangent.
           | 
           | The cultural revolution is a mystery to me just as much as
           | anything else on HN, perhaps more so because it was about
           | people, which are more complex and interesting than tech. The
           | link to the article was useful.
        
       | hbarka wrote:
       | This man's biography would make an epic movie. From China, how
       | did he manage to attend and graduate from Leipzig University in
       | Germany during the height of WW2?
        
       | irrational wrote:
       | What happened to Zhi afterwards? I felt like the article ended
       | just as things were getting good.
        
         | jhbadger wrote:
         | He was elected a member of the Chinese Academy of Sciences in
         | 1980 (two years after publishing his paper about the input
         | method) but he was already almost 70 years old, so he didn't
         | have much of a career afterwards. He died in 1993
        
         | morninglight wrote:
         | He left China and went to Japan where he founded "Just Systems"
         | in 1979. https://en.wikipedia.org/wiki/JustSystems
         | 
         | Shortly afterwards, Just began selling the Japanese word
         | processor, Ichitaro.
         | https://en.wikipedia.org/wiki/Ichitaro_(word_processor)
         | 
         | Of course, not everyone agrees with these details;-)
        
           | wizzwizz4 wrote:
           | So why do you believe them?
        
       | btbuildem wrote:
       | I guess I get why he persisted while imprisoned -- anything to do
       | to keep the mind busy. But why would he continue to work
       | afterwards, and benefit the regime that betrayed him? I'd want to
       | watch it burn and then sink, be left as far behind as possible,
       | not enable them to keep up with technological advancements.
        
         | bllguo wrote:
         | is Mandarin the govt's language or his and the people's
         | language? "benefiting the regime" was a mere side effect of his
         | work (in fact the Maoists that persecuted him crumbled in the
         | 70s, so ultimately they did not benefit at all! karmic)
         | 
         | or maybe he just had an intellectual interest in the problem
         | and wanted to solve it no matter what. who knows, but the world
         | is better off for it
        
         | spicybright wrote:
         | This is an interesting question that shouldn't be down voted
         | like it's being.
         | 
         | From how the article is written (although admittedly I only
         | skimmed it), it seems imprisonment didn't have a huge effect on
         | his motivation to solve the problem.
         | 
         | While I can't say I'd be as chipper after being jailed for a
         | year, I can certainly understand the allure of attacking a hard
         | problem with tenacity.
         | 
         | He may have also separated the party from the people he
         | interacted with everyday. In fact, I'd imagine being imprisoned
         | built a lot of comradory with his fellow inmates.
         | 
         | (Would love for someone else more knowledgeable to chime in, I
         | just wanted to start the conversation)
        
       | anonymousiam wrote:
       | This is a good story for the modern age. Not so much because of
       | the technical achievements of the protagonist, but more as a
       | warning for the perpetrators/victims of class warfare in the
       | world today.
       | 
       | My father-in-law (now deceased) was similarly branded and nearly
       | died while spending several years in a labor camp. He was
       | forbidden to work upon his release, and banished to the
       | countryside along with his family. Decades later, he received a
       | formal apology from the CCP, but no compensation.
        
       | jiehong wrote:
       | Looks like this happened 2 years after Cangjie was invented in
       | 1976 according to Wikipedia [0]. Although not in mainland China.
       | 
       | Cangjie is at least still used a little bit.
       | 
       | [0]: https://en.wikipedia.org/wiki/Cangjie_input_method
        
       ___________________________________________________________________
       (page generated 2022-01-26 23:00 UTC)