Re: OT: Question: Unicode
From: | Roger Mills <romilly@...> |
Date: | Sunday, May 18, 2003, 15:28 |
Carlos Thompson wrote:
> Roger Mills wrote:
>
>
> > I've created a web page using MS Word, and Lucida Sans Unicode. In the
> > header, MS says "charset-MS 1252" or somesuch. Should this be changed to
> > UTF8?
>
> Well, you should say UTF-8 if the text file is in UTF format, that is, if
> you will give entities above ASCII with variable length codes (those that
> look like ë for an á).
(snip)
Gracias, Carlos. Yes, I do include characters from the range above 0255,
such as glottal stop, and several vowels with macrons or breves. So it
appears I should change the setting to UTF 8.
Oddly, although I can type in the characters (hex number e.g. 012B, plus
Alt-x) I see that they're automatically converted to decimal. How clever of
MS Word.
> UTF-8: makes shorter files if you are using lots of codes not available
in
> Latin-1 or any other ISO-8859 code page. The UTF-8 files are difficult to
> edit in common text editors (vi, pico, notepad, wordpad, etc)
In what way is it difficult?
but if you
> will never touch the HTML file with a text editor, you should not worried.
>
It is actually a rather temporary page, which will probably be deleted after
people answer my questions. (You'll see what I mean after it's available)
Replies