libera/#maemo/ Tuesday, 2020-10-13

brolin_empeyCan text written in a language such as Chinese or Japanese, not using something like Pinyin that uses an alphabet, be sorted like text written in a language such as English that uses an alphabet?07:33
deepybrolin_empey: it's a bit more complicated than that, but it can be done10:50
MaxdamantusIs the question about whether it's feasible to come up with an ordering, or whether there is a sensible ordering that normal people will understand?10:56
MaxdamantusBecause the answer to the former question is a fairly simple "yes".10:56
MaxdamantusYou can just order by Unicode code points (or equivalently, UTF-8 bytes) if you want *some* order.10:57
MaxdamantusBased on some quick Googling, it seems like there are conventional orderings for Chinese logograms involving stroke count.10:58
Maxdamantusand regarding Japanese, I think most Japanese text you'll normally find nowadays is in kana, which I imagine has a typical ordering just like there are orderings of western alphabets, since there are only something like 12*5 kana.10:59
MaxdamantusI guess if it's 12*5 it's a bit more than a typical western alphabet .. I don't think there are that many actually.11:00
Maxdamantus10*5 - 3 + 111:01
Maxdamantushttps://en.wikipedia.org/wiki/Goj%C5%ABon11:01
Maxdamantushttps://en.wikipedia.org/wiki/Chinese_characters#Indexing11:07
deepyLast I heard of Japanese there was an intermediate step, you converted everything to something that made more sense and then used that for ordering13:54
deepyAnd just sorting it like you'd do with our alphabet ended up being absolutely terrible13:55
dreamerg3415:40
dreamerwoeps15:40

Generated by irclog2html.py 2.17.0 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!