Re: TECH: Testing again, no new on-topic content (was Re: "Language Creation" in your conlang)
From: | Muke Tever <hotblack@...> |
Date: | Monday, November 17, 2003, 15:34 |
On Mon, 17 Nov 2003 08:59:21 -0500, Mark J. Reed <markjreed@...>
wrote:
> On Mon, Nov 17, 2003 at 02:46:07PM +0200, Isaac Penzev wrote:
>> Georgian -- ქა?? თული ენა ლამაზია.
>
> FYI, there's definitely an encoding error there, at least in the
> message as it reached me. The byte-by-byte hex codes are as follows:
>
> 47 65 6f 72 67 69 61 6e 20 2d 2d 20 e1/83/a5 e1/83/90 e1/83/20
> G e o r g i a n SP - - SP ქ ა ?
>
> 0xE1 0x83 0x20 is an illegal UTF-8 sequence; 0xE1 can only begin
> a three-byte character, but 0x20 can only be a single-byte character.
> Looks like a byte got dropped somewhere.
It's been noted before that the Listserv appears to eat certain characters
[whether this is true or not still...?]; possibly one of these is in the
UTF-8 representation of that Georgian letter. (and in some of the other
test chars that have been eaten)
*Muke!
--
http://frath.net/
http://kohath.livejournal.com/
E jer savne zarjé mas ne (You put music in my heart
Se imné koone'f metha And with the spirit of an artist
Brissve mé kolé adâ. I will make the dreamtime)