Theiling Online    Sitemap    Conlang Mailing List HQ   

Re: Um...help with unicode?

From:Muke Tever <mktvr@...>
Date:Monday, November 4, 2002, 21:42
From: "Roger Mills" <romilly@...>
> OK, now I have a question: In the Font Creator Program I recently > installed, there is a long list of (MS) Unicode stuff, in Hex format > (decimal is available too I think) such as, among others...(I quote): > > $01AC Latin Cap. I with tilde > $01AE Latin Cap. I with macron > ..... > $01B6 and $01B7 Latin Cap/Small k with cedilla > .... > $01F1 - $022B most of the Greek Alphabet > $011C on, some of the Cyrillic alphabet. > > (Very last) $028C Small Greek letter pi > Unquote > (I take it 028C = 268 in base ten??)
Nope, it's 652.
> It strikes me that many of these correspond to characters already available > on my Win98/WP8 machine in the various Special Characters options, > Multinational fonts etc. So I'm wondering, if I used these code nos. in a > web document, would they show up as advertised? Presumably that "$" has to > be replaced with "&", but anything else? (Obviously I haven't tried it yet. > These things scare me)
Actualy, I don't think it would. First off, since it's hex, it'd have to be "&#x028C" (I think is the format--with a semicolon at the end), and not all browsers understand character references in hex, so you'd want to convert it to decimal, "&#652" with a semicolon. But second off, even though it looks like you're describing the WGL4, these numbers don't correspond to Unicode values. 01AC = Latin Capital T with Hook (Cap I with tilde = 0128) 01AE = Latin Capital T with Retroflex hook (Cap I with Macron = 012A) 01B6 = Latin small letter z with stroke (K-cedilla = 0137) 01B7 = latin capital Ezh (k-cedilla = 0138) 01F1-022B = Latin capital DZ to small letter o with dieresis and macron (Greek = 0374 to 03E0, roughly) 011C = Latin Capital G with circumflex (Cyrillic = 0400 to 04F9 or so) 028C = the IPA symbol for [V] (small pi = 03C0) so I don't know what your program is doing. *Muke! -- http://www.frath.net/