Theiling Online    Sitemap    Conlang Mailing List HQ   

Re: ML4 (was Re: TECH: Testing again etc.)

From:John Cowan <cowan@...>
Date:Thursday, November 20, 2003, 11:58
Isaac Penzev scripsit:

> > > And I would recommend to substitute t-comma-below with t-cedilla > > > _??_, because t-comma-below has a bit dubious status (nobody knows > > > for sure if they are the same or not). > > > > Very good point, thanks. Cedillas not being available on all the > > consonants I might like them on, I was using comma-below as a quick fix, > > but noticed that some applications mangle them with higher frequency > > than other bits of unicode. > > John Cowan may enlighten us Re this matter, but I'm afraid > t-with-comma-below is not treated well in the present state of Unicoded > applications...
Here's the story. In Latin-2 (ISO 8859-2), which was meant to cover both Turkish and Romanian (among other languages), the Romanian s with comma below was merged into the Turkish s with cedilla, and as a consequence the Romanian t with comma below became the t with cedilla, a character not otherwise used (as far as anyone knows). In the event, Turkish ended up using Latin-5 (ISO 8859-9) instead, a modified Latin-1 that replaced the Icelandic-specific characters with Turkish-specific ones, so the unification was pointless but official. In Unicode versions up to 2.1, this unification was maintained, but in Unicode 3.0, the Romanian characters were added. The bogus t with cedilla could not be removed, of course, due to the Unicode stability policies.
> Why? I use Arial Unicode MS (here at the office), and the word looks > perfect!
Note that AUMS is a Unicode 2.1 font and is not being maintained. You might want to look at Code2000; in the realm of Latin and Greek character, Gentium is also an excellent font (Cyrillic coming soon). -- "No, John. I want formats that are actually John Cowan useful, rather than over-featured megaliths that http://www.ccil.org/~cowan address all questions by piling on ridiculous http://www.reutershealth.com internal links in forms which are hideously jcowan@reutershealth.com over-complex." --Simon St. Laurent on xml-dev

Reply

Isaac Penzev <isaacp@...>TECH: Unicode ranges in various fonts (was Re: ML4)