Theiling Online    Sitemap    Conlang Mailing List HQ   

Re: OT: Question: Unicode

From:Roger Mills <romilly@...>
Date:Sunday, May 18, 2003, 15:28
Carlos Thompson wrote:

> Roger Mills wrote: > > > > I've created a web page using MS Word, and Lucida Sans Unicode. In the > > header, MS says "charset-MS 1252" or somesuch. Should this be changed to > > UTF8? > > Well, you should say UTF-8 if the text file is in UTF format, that is, if > you will give entities above ASCII with variable length codes (those that > look like ë for an á).
(snip) Gracias, Carlos. Yes, I do include characters from the range above 0255, such as glottal stop, and several vowels with macrons or breves. So it appears I should change the setting to UTF 8. Oddly, although I can type in the characters (hex number e.g. 012B, plus Alt-x) I see that they're automatically converted to decimal. How clever of MS Word.
> UTF-8: makes shorter files if you are using lots of codes not available
in
> Latin-1 or any other ISO-8859 code page. The UTF-8 files are difficult to > edit in common text editors (vi, pico, notepad, wordpad, etc)
In what way is it difficult? but if you
> will never touch the HTML file with a text editor, you should not worried. >
It is actually a rather temporary page, which will probably be deleted after people answer my questions. (You'll see what I mean after it's available)

Replies

Jan van Steenbergen <ijzeren_jan@...>
Carlos Thompson <chlewey@...>