Theiling Online    Sitemap    Conlang Mailing List HQ   

Re: OT: Unicode 5.0

From:<veritosproject@...>
Date:Tuesday, January 10, 2006, 0:54
Probably the easiest way to design a useful standard is to get rid of
the same entities.  Due to accent marks, tones, etc., we have about 50
"o" characters.  If the accent mark was a separate "modifier"
character, that could significantly reduce the number of characters
and make it more ordered.

On 1/9/06, John Vertical <johnvertical@...> wrote:
> >On 1/9/06, John Vertical <johnvertical@...> wrote: > > > ...At risk of threadjack accusations, I'll use the opening to also fire > >a > > > question that's been bothering me for a while - Why does Unicode include > > > several characters multiple times? There are 6561 different ways to > >write > > > "THAI POEM". If capital alpha is different from capital ay just because > >it's > > > used in a different alphabet to write a different language, isn't (eg) > > > Icelandic "A" also a different character then? Are they really purposely > > > randomly tagging unnecessary etymological/usage information to symbols, > >or > > > is it that they just fudged it up initially (for whatever political > >reasons) > > > and can't fix it at this stage any more? > > > >This is because Icelandic uses the same /script/ as English. Greek > >uses a different /script/, therefore capital alpha gets its own > >encoding, while Icelandic ay is encoded as the same as English ay. > > My argument is that Latin, Cyrillic and Greek capital letters are > essentially one and the same script. > ...Not that I see a point in differentiating by script anyway. I would just > stick with defining glyphs (shapes) and let the users sort out the meaning. > > > >Unicode certainly has fudged a bunch of stuff up initially, and > >unfortunately they can't fix it now. (One thing in particular, I think > >they should have encoded small caps a long time ago. One of the > >proposals that was linked to included a small-cap F and S, and > >mentioned that the only other small caps left unencoded were Q and X. > >Interesting, I thought, so I went on a hunt for all the small caps > >(other than F, Q, S, and X). I could only find a handful of them, and > >they're randomly dotted all over the place: Latin Extended A, IPA > >Extensions, Letterlike Symbols, etc. But anyway, enough of my rant.) > > > >-- > >Hasta la pasta, > >Jonathyn Bet'nct. > > But aren't there a lot of letters (OSVWXZ) which are exactly the same in > small caps and lower case? If they're randomly dotted all over the place > anyway, there isn't even the benefit of having the whole set in one place. > > John Vertical
>

Replies

taliesin the storyteller <taliesin-conlang@...>
Philip Newton <philip.newton@...>