Theiling Online    Sitemap    Conlang Mailing List HQ   

Re: Online Language Identifier

From:Michael Potter <mhpotter@...>
Date:Tuesday, August 30, 2005, 23:54
David J. Peterson wrote:
> Radiohead will be pleased to know that Xerox is at it again! For > those of you who don't check Langmaker.com every two hours, > a resource was just posted about an online language identifier. > It can be found here: > > http://www.xrce.xerox.com/competencies/content-analysis/tools/guesser- > ISO-8859-1.en.html > > Basically it identifies the language that you put into the text > field (a sentence of five words or more). It was reviewed on the > blog Tenser Said the Tensor. The author put in Klingon, Quenya > and Sindarin. Klingon apparently was fairly consistently identified > as Maltese. I tried a couple of mine. The results: >
[snip results] Using two verses from the Suvile Babel text (see my sig), I got Romanian and Esperanto (of all things). Strangely, the sentence with more þ's and ð's was the Esperanto one. If I ever actually *write* anything in Tutèlya or Andan, I'll try to put them in. -- Michael Potter Idzon Conworld: http://www.potterpcs.net/idzon