Re: OT: TECH: Unicode email clients (was Re: OT: Corpses, etc. (was: Re: Gender in conlangs (was: Re: Umlauts (was Re: Elves and Il
From: | Paul Bennett <paul-bennett@...> |
Date: | Tuesday, November 4, 2003, 21:57 |
On 4 Nov 2003 at 14:39, John Cowan wrote:
> Paul Bennett scripsit:
>
> > > That looks more like something trying to be the Windows-specific encoding
> > > rather than either UTF-8 or Latin-1
> >
> > That's what I said. It can do WGL4, but not Unicode.
>
> You mean Win1252, the 8-bit Western European/American character set, not
> WGL4, which is a subset of Unicode (652 characters).
I don't know how to describe it. My understanding is it can input
messages in any 8-bit encoding, and has translation tables to convert
(depending on the character set specified in the headers) into the
appropriate WGL4 characters for display. It can certainly display
Greek, Cyrillic and various Eastern-european languages (if you tamper
with the configuration files), even though the characters needed
don't themselves appear in Win1252.
UTF-8, though byte-based, is not a strictly 8-bit format by that
description, since multiple bytes are needed to represent most of the
character set.
Paul