Re: OT: Question: Unicode
From: | Herman Miller <hmiller@...> |
Date: | Sunday, May 18, 2003, 22:08 |
On Sun, 18 May 2003 01:48:50 -0500, Carlos Thompson <chlewey@...>
wrote:
>Roger Mills wrote:
>
>
>> I've created a web page using MS Word, and Lucida Sans Unicode. In the
>> header, MS says "charset-MS 1252" or somesuch. Should this be changed to
>> UTF8?
>
>Well, you should say UTF-8 if the text file is in UTF format, that is, if
>you will give entities above ASCII with variable length codes (those that
>look like ë for an á). You should use MS 1252, or better: ISO-8859-1, if
>you plan to use Latin-1 codes (as in this e-mail) and html numeric entities
>(those codes that look like "”") for Unicode values over 255.
In theory this is correct, but some browsers won't correctly display
Unicode characters unless you specify the character set as UTF-8 in the
header.