Theiling Online    Sitemap    Conlang Mailing List HQ   

Re: TECH: Cyrillics (was: Russian, anyone?)

From:Pavel Iosad <edricson@...>
Date:Friday, April 25, 2003, 18:20

> Now, I'm no expert on either Cyrillic or encoding systems, but isn't > there a cyrillic encoding (and I think it _was_ KOI-8R, but I'm not > sure) designed specifically so that if it gets the 8th bit removed, > the ASCII characters produced are a rough transliteration? I remember > reading an article on this a long time ago - seems it contributed to > the mass adoption of this encoding, as mails could be read even when > bit-stripped, which happened a lot in the early days of the Russian > Internet.
FIDO rather. But yes, in KOI8 (not only KOI8-R, there's, for instance, the KOI8-U) the Cyrillic characters are encoded, as far as this is possible, according to the corresponding Latin letters. So in Win-1251 the Cyrillic letters go _a, b, v, g, d, e_, and in KOI8 _a, b, ts, d, e, f, g_. Some correspondences however are rather unpredictable (AFAIR the jery (bl) is coded where the _q_ is) Pavel -- Pavel Iosad Is mall a mharcaicheas am fear a bheachdaicheas --Scottish proverb