Mangling Unicode (was: Fwd: Gzarondan: Spelling Review)
|From:||Isaac A. Penzev <isaacp@...>|
|Date:||Saturday, October 16, 2004, 20:02|
Paul Bennett wrote:
> Ordinarily, you could also use something called UTF-8 to send Unicode
> messages, which can be read by more people. However, the software that
> runs the mailing list occasionally (and, for all practical purposes,
> randomly) mangles UTF-8 characters. I think we ought to perform some kind
> of research to find out exactly which characters are mangled in this way,
> but here and now is probably not the environment. It has to do with their
> underlying bytes containing values 128-160, IIRC, but there's no simple
> memorable way to tell which actual characters will be effected.
If I remember correctly some tests we made about a year ago, they showed
that the Listserv mangles Unicode Cyrillics (usual KOI8 and Win-1251 are
safe) and Georgian.