Theiling Online    Sitemap    Conlang Mailing List HQ   

Re: Online Language Identifier

From:Jim Henry <jimhenry1973@...>
Date:Tuesday, August 30, 2005, 17:47
On 8/30/05, David J. Peterson <dedalvs@...> wrote:

> a resource was just posted about an online language identifier. > It can be found here: > > http://www.xrce.xerox.com/competencies/content-analysis/tools/guesser- > ISO-8859-1.en.html > > Basically it identifies the language that you put into the text > field (a sentence of five words or more). It was reviewed on the > blog Tenser Said the Tensor. The author put in Klingon, Quenya > and Sindarin. Klingon apparently was fairly consistently identified > as Maltese. I tried a couple of mine. The results:
I tried two different sentences of gjax-zym-byn (in Unicode); one was identified as Irish_utf8 and the other as Swahili. The same sentence in ASCII encoding was identified as Maltese. A few extemporaneous words of Toki Pona were identified as Esperanto_iso3. -- Jim Henry http://www.pobox.com/~jimhenry/conlang.htm ...Mind the gmail Reply-to: field