Theiling Online    Sitemap    Conlang Mailing List HQ   

Re: Online Language Identifier

From:James W. <emindahken@...>
Date:Tuesday, August 30, 2005, 12:09
On Tue, 30 Aug 2005 00:50:14 -0700, "David J. Peterson"
<dedalvs@...> said:
> Radiohead will be pleased to know that Xerox is at it again! For > those of you who don't check Langmaker.com every two hours, > a resource was just posted about an online language identifier. > It can be found here: > > http://www.xrce.xerox.com/competencies/content-analysis/tools/guesser- > ISO-8859-1.en.html > > Basically it identifies the language that you put into the text > field (a sentence of five words or more). It was reviewed on the > blog Tenser Said the Tensor. The author put in Klingon, Quenya > and Sindarin. Klingon apparently was fairly consistently identified > as Maltese. I tried a couple of mine. The results:
Fun! I first put in a sentence of 7 words of my Or&#275;lynna. It came back as Esperanto. Yikes. I put in an additional sentence of 10 words and it guessed Slovakian...closer, I guess. The language was inspired by Icelandic and Finnish. James W.