Re: CPA - An ASCII-based phonetic alphabet
|From:||Lars Henrik Mathiesen <thorinn@...>|
|Date:||Saturday, November 17, 2001, 13:11|
> Date: Sat, 17 Nov 2001 12:58:03 +0100
> From: Boudewijn Rempt <boud@...>
> On Saturday 17 November 2001 12:29, Lars Henrik Mathiesen wrote:
> > Well, there's one replacement character in there which was
> > supposed to be a lax u (X-SAMPA /U/). The same thing happened to
> > Muke's reply, so I think it's the listserv that's not quite 8-bit
> > clean.
> > (It seems to be replacing 0x8C bytes (PLU control) with 0x20 bytes
> > (spaces). And I even took care to use QP encoding, which the
> > listserv itself removed --- perhaps it's the QP decoder that's
> > broken).
> The QP conversion might be the problem - the says that it
> autoconverted from qp to 8-bit, so if you were to send 8-bit text,
> utf-8 encoded, it might work better. Let's see. KMail should now
> send mail using utf-8, 8-bit clean, not mime-encoded.
> 孟子 見 粱 潓 王. 王 曰.
> 叟 不 �� 千 里 而 來. 亦
> 將 有 以 利 吾 國 乎.
> ॥अथ नलोपारवयानम॥
> ृबहदशव उवाच। उपपननो
I saw this perfectly in the copy I got directly --- but the listserv
mangled an 0xA0 into byte an 0x20. (There wasn't any 0x8C bytes this
(On the other hand, Emacs is being balky, insisting on putting in
weird ISO 2022 codes when I send without QP --- so I won't).
> > Anyway --- this list is not quite ready for Unicode IPA, it seems.
> If I send this message directly to another email account, it arrives
> in perfect form. I wonder what happens when I get it back through
> the listserver. I think it should be possible to get something
> working... I mean, there's no OS left nowadays that doesn't claim
> full Unicode support.
I'll just quote the upper quarter of Latin-1 here. When sent as UTF-8,
it will use all the byte values from 0x80 to 0xBF...
À Á Â Ã Ä Å Æ Ç È É � Ë Ì Í Î Ï
Ð Ñ Ò Ó Ô Õ Ö × Ø Ù Ú Û Ü Ý Þ ß
� á â ã ä å æ ç è é ê ë ì í î ï
ð ñ ò ó ô õ ö ÷ ø ù ú û ü ý þ ÿ
Lars Mathiesen (U of Copenhagen CS Dep) <thorinn@...> (Humour NOT marked)