Theiling Online    Sitemap    Conlang Mailing List HQ   

Re: Tech: Unicode (was...)

From:Christophe Grandsire <christophe.grandsire@...>
Date:Friday, May 7, 2004, 13:49
En réponse à Philippe Caquant :


>I'm on Windows XP Home. > >I noticed that on the Web, you can find "ranges" of >Unicode characters to download, as single fonts.
Unicode is huge. Making a single font containing all the characters is just way too much work. There are fonts trying to do that, but they are not free (Arial MS Unicode, which is about 26MB large, used to be free, but not anymore...) and they never really fill all the ranges.
> I >first thought I just had to load one single Unicode >font and than all glyphs would be at hand when using >the function "Insert / Special Characters" in Word, be >it Tibetan, Amerindian or Braille. It seems like it >doesn't work this way.
You are extremely naive. Practically, we're already used to using various fonts for various parts of the same document (for instance Arial for titles and Times for text). What's wrong in using various fonts for various scripts? the only thing Unicode ensures is that the fonts will be compatible, i.e. that no font can have a Cyrillic letter where another font has a Japanese katakana.
> So I don't know exactly what I >should do.
Download fonts containing the ranges you need.
> At the moment, the Insert function just >proposes me some Latin, Cyrillic, Arabic, Hebraic, >Symbol codes (does this mean that other codes weren't >at hand in release 2.1, or just that the font >currently used is a subset of Unicode 2.1 ?) and >that's about all.
What font is it? The Insert works on a per font basis. Look at other fonts and see what ranges they propose. If you don't have the ranges you need, look on Internet and download the fonts you need.
> So I suppose I should load all >range-fonts one by one (how many are they ?),
That's a nonsense question. There's no body that created a set of fonts filling all the ranges in Unicode. Fonts are created by people who need them, and they put in the ranges they need. There's no such thing as a "range-font". Just look for fonts containing the ranges you need and download them. They will always overlap in some ranges.
> and >before doing an Insert / Special character, changing >to the font I would like to use. In that case, of >course the macros would be much more complicated to >write, because you had to add such tests as (for >decoding): > >- if the code is in the range[x,y], than first change >to font F, supposing font Z is at hand >- if it's in the range[x',y'], then tell the user, one >way or another, that he forgot to install font F' >- etc.
That's what Unicode capable programs already do, in reading. I look at my browser, Opera, for instance. If a character it has to show doesn't appear in the main font of the webpage, it will use the character from another font to show it. If I have no font containing it, it will show a "no character found" character (depending on the encoding, it can be an empty rectangle or a diamond with a question mark in). Of course, if you're using Internet Explorer, you don't have this chance. IE isn't able to pick characters from other fonts than the one the webpage is written in. That makes it very Unicode-unfriendly. Is it so difficult to understand that the macro you're talking about is unneeded? Just use modern Unicode-aware tools, and it will already be built in. But since not everyone on this list can use modern tools (for various reasons), don't use Unicode on the mailing list. People who could use the macro you're talking about won't need it, because they already have Unicode-aware tools, and those who cannot use Unicode-aware tools wouldn't be able to use your macro anyway. Why reiventing the wheel?
>Then probably the different fonts wouldn't belong to >the same release, etc.
As stated already, Unicode has been stable since 1996. Since then, it has only *added* characters. It hasn't *changed* any character that was already there. Do you have any font that is more than 8 years old? I doubt it. So it means that the only thing will be that your older fonts will contain less characters than the new ones. But those characters will be at the place where they are expected to be. In other words, all the fonts will be compatible. Christophe Grandsire. http://rainbow.conlang.free.fr You need a straight mind to invent a twisted conlang.

Reply

Philippe Caquant <herodote92@...>