brolin_empey | Can text written in a language such as Chinese or Japanese, not using something like Pinyin that uses an alphabet, be sorted like text written in a language such as English that uses an alphabet? | 07:33 |
---|---|---|
deepy | brolin_empey: it's a bit more complicated than that, but it can be done | 10:50 |
Maxdamantus | Is the question about whether it's feasible to come up with an ordering, or whether there is a sensible ordering that normal people will understand? | 10:56 |
Maxdamantus | Because the answer to the former question is a fairly simple "yes". | 10:56 |
Maxdamantus | You can just order by Unicode code points (or equivalently, UTF-8 bytes) if you want *some* order. | 10:57 |
Maxdamantus | Based on some quick Googling, it seems like there are conventional orderings for Chinese logograms involving stroke count. | 10:58 |
Maxdamantus | and regarding Japanese, I think most Japanese text you'll normally find nowadays is in kana, which I imagine has a typical ordering just like there are orderings of western alphabets, since there are only something like 12*5 kana. | 10:59 |
Maxdamantus | I guess if it's 12*5 it's a bit more than a typical western alphabet .. I don't think there are that many actually. | 11:00 |
Maxdamantus | 10*5 - 3 + 1 | 11:01 |
Maxdamantus | https://en.wikipedia.org/wiki/Goj%C5%ABon | 11:01 |
Maxdamantus | https://en.wikipedia.org/wiki/Chinese_characters#Indexing | 11:07 |
deepy | Last I heard of Japanese there was an intermediate step, you converted everything to something that made more sense and then used that for ordering | 13:54 |
deepy | And just sorting it like you'd do with our alphabet ended up being absolutely terrible | 13:55 |
dreamer | g34 | 15:40 |
dreamer | woeps | 15:40 |
Generated by irclog2html.py 2.17.0 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!