Theiling Online    Sitemap    Conlang Mailing List HQ    Attic   

Re: Avoiding near-collisions in vocabulary coinage

From:Alex Fink <000024@...>
Date:Tuesday, August 5, 2008, 0:44
On Mon, 4 Aug 2008 14:50:25 -0700, David J. Peterson <dedalvs@...> wrote:

>Jim: ><< >I should clarify; I meant, not avoiding new words in one's conlang >that sound too similar to existing natlang words, but avoding words >that are too similar to words already in the same conlang. > >>
I haven't spent any real effort on a language where this is a goal. All of my few engelangish projects so far have been more or less taxonomic, with all the ridiculous simultaneous resemblences in sound and meaning that come with this. /spliw/ 'coconut', /splIw/ 'date', /spljiw/ 'eggplant', /spljIw/ 'avocado'...
>I always keep things in dictionary format in alphabetical order >(usually an alphabetical order derived for that language). So >if it strikes me that, "Oh! Nalo would be a great word for >pheasant!" and then I look at the dictionary and see that "nalu" >means "game hen", I can reevaluate my decision.
If I had that particular sort of situation happen to me, picking words to suit my lamatyave (as opposed to forward-deriving, say), I'd probably do one of two things: either lump them together to _nalu_ 'pheasant, game hen' and then distinguish them by modifiers ('little _nalu_', '_nalu_ with a wattle', ...) or else leave them and invoke some mysterious nature of diachronic relation. (In fact pjaukra has a pair of the second sort in really the same semantic field: _xaru_ 'raven' and _xarana_ 'eagle'.)
>Jim: ><< >I also use a couple of scripts that tell me the most and >least common onsets and rimes in the lexicon, to give >me ideas for potential new wordforms that are >unlikely to conflict with existing ones. > >> > >This would be very, very cool, if you could adapt it. For >example, I have an idea which syllables end and begin >words most frequently in Kamakawi, but I'd like to see >it for certain for when I happen to want to create a new >word that's different. Alas, stupid Perl; stupid programming...
It would be really great if someone wrote a tool that could do this for a general phonotactics: you tell it your word / stem / whatever structure, it spins through your lexicon and notices, hm, you've used quite a lot of coda /n/ compared to onsets, but a comparative paucity of /un/, etc. This'd be handy not only for gap-filling but for looking for patterns which could be traces of former sound-changes if you were trying to erect a diachrony on an extant lexicon. Thing is, this would seem to need something that could represent a general phonotactics. Back when I was writing my (C++) reversible sound change applier this seemed eminently in reach of my techniques, and perhaps it is. But I've not the time to write anything like this atm, and it might be better left to someone with more familiarity with a more limber proglang anyways. Alex