Theiling Online    Sitemap    Conlang Mailing List HQ   

Re: Conlanging as a personal thing

From:Elyse Grasso <emgrasso@...>
Date:Wednesday, March 12, 2003, 19:56
On Wednesday 12 March 2003 10:52 am, Roger Mills wrote:
> Mike Ellis wrote: > > > Or to go even beyond that: that both writers may create new words,
BUT
> that > > > any person who does so must also make three cognate or related
words
> (i.e. > > > from the same root) or other words with concepts complementary /
related
> / > > > opposite to the new word. That way, every new word would seem to
have
> > > related words or concepts in the language PLUS the lexicon would
grow
> that > > > much faster. > > That accounts for the rapid growth of the Kash lexicon (and one of the > reasons I have trouble remembering things)... For ex. every
adjective/verb
> has a potential 6 derivatives-- base, inchoative, causative,
potential,
> accidental (verb forms); patient/abstract noun, agent noun; to say
nothing
> of possible full or partial reduplications, idioms, colloq. variants.
Thus,
> according to the statistics, there are 1649 head entries in the
dictionary,
> but considerably more words. > > (Is there some way I could tell the computer to count all words in > _boldface_? That would give a more accurate count. Oho-- could I tell
it to
> count "< bold > " in the html version???) > >
I'm building my lexicon in XML, with a perl program that sucks in the XML and 1) writes out a sorted xml file 2) writes out an xhtml version sorted by Term (the Jouevyaix or Imperial morpheme) 3) writes out an xhtml version sorted by Class 4) writes out an xhtml version sorted by Keyword (providing an English-to-Jouevyaix listing) 5) prints the numbers of terms on the command line (266 for Jouevyaix as of last night: about 240 more than there were a couple of weeks ago, though some of the older terms are several years old.) The lexicon on my website only has the sections sorted by Term and Class. The perl tool is currently hardcoded for my development environment, but if anyone is interested in the tool set, I can make a generalized version available. The XML dtd is currently very simple: A <lexicon> contains multiple instances of <entry> tags. An <entry> contains one <term>, one <gloss>, one <xsampa>, one or more <class> tags, zero or more <keyword> tags and zero or more <glyphs> tags. Most of my XSAMPA tags are currently empty... my next version of the perl tool will populate the empty ones by translating my Latin1 representation into XSAMPA. The resulting entries will need tweaking (eventually) but should be better than nothing. The <glyphs> tag is forward-looking to a time when I (1) develop the con-orthographies to a point where I'm happy with the way they look, (2) find a good way to display them on line and (3) research how to deal with the directional problem in a mixed rendering environment. [*] I'm planning to add <section> tags (paragraphs) within the glosses, and eventually <usage> tags for usage examples. (All the existing corpus has appeared in postings on this list in the past couple of weeks, so there isn't much usage available yet. Actually... making a rule that every word in the lexicon should be used in a sentence might help in learning the language.) Suggestions for additional tags that might be useful in a lexicon will be gratefully accepted. * Imperial is written right to left, top to bottom. Jouevyaix is formally written with beads, strings and crochet hooks, but has a diagrammatic 2-dimensional representation that can use almost any layout imaginable. If I ever do an example of spiral or labyrinthine page layout, it will have to be a big jpeg, but even something as conservative as 'vertical boustrophedon starting in the lower right corner' presents challenges to standard web rendering engines. -- Elyse Grasso