Re: Multicode
From: | John Cowan <cowan@...> |
Date: | Thursday, May 6, 2004, 12:36 |
Danny Wier scripsit:
> I'm looking for more information, but I'm having little luck.
This scheme depends on a hopelessly broken encoding trick which requires
one to read a text from the beginning (alternatively, to look very far
back) before being able to even *start* figuring out what's going on.
In UTF-16, you can do random access to any 16-bit codepoint and always
know its meaning, and even in UTF-8 you have to look back a maximum of
three bytes before you are in registry.
Furthermore, it founders on two obviously bogus notions: that the boundaries
between languages are hard and fast, and that there are only 255 of them.
--
John Cowan jcowan@reutershealth.com http://www.ccil.org/~cowan
Does anybody want any flotsam? / I've gotsam.
Does anybody want any jetsam? / I can getsam.
--Ogden Nash, No Doctors Today, Thank You
Reply