Re: TECH: Testing again, no new on-topic content (was Re: "Language Creation" in your conlang)
From: | Paul Bennett <paul-bennett@...> |
Date: | Monday, November 17, 2003, 21:38 |
On Sun, 16 Nov 2003 22:14:21 -0700, Muke Tever <hotblack@...> wrote:
> On Sun, 16 Nov 2003 22:49:13 -0500, Paul Bennett <paul-bennett@...>
> wrote:
>> OTOH, what's the most useful purely 8-bit encoding for me to post in? My
>> main requirements are to post acuted and trema'd vowels, some kind of
>> accented "s" (ideally s-acute, but any other diacritised "s" will do),
>> and some kind of accented "n" (ideally n-acute, but eng would do). Also
>> nice to have are "combining acute", "combining ogonek" and "combining
>> dot below",
>> but I suspect there's not an 8-bit encoding that handles any of those.
>
> ISO-Latin-2 has those, actually (sans Ï and ï for some reason) except for
> the combining characters [it has the spacing ones, though, for acute and
> ogonek].
Latin-2 seems (http://nl.ijs.si/gnusl/cee/charset.html) to be close to
ideal for my new forthcoming language (still unnamed; provisionally WC8 or
UF1, I suppose, if I'm going to reinstate my old language-provisionally-
naming scheme), which has a wide variety of those glyphs.
Here's the total glyph set used by that language in UTF-8:
p b m f v
t́ d́ ń ś ź
t d n s z
ṭ ḍ ṇ ṣ ẓ
ḱ ǵ ŋ́ x́ q́
k g ŋ x q
ḳ ġ ŋ̇ x̣ q̇
š ž č
ř ŕ r
kt ct
l ł lh dlh
w ẅ y ÿ
h ħ
a á â
e é ê
i í î
o ó ô
u ú û
I might add the capability of nasalisation (shown via ogoneks) to the
language, since it doubles the number of syllables.
Also, in my notes, I'm using real IPA instad of X-Sampa, which is yet
another crimp in the wossname.
Yet again, I wish there was some way of switching encodings partway through
a message (several times), as it would be simple to encode almost of these
with a mix of Latin-1 and Latin-2. I could even downgrade eng to enya if
pressed, and drop h-bar either completely or replace it with kh or ch.
Paul
Replies