Re: Unknown Language Identifier!
From: | Dennis Paul Himes <dennis@...> |
Date: | Tuesday, January 30, 2001, 1:34 |
Padraic Brown <pbrown@...> wrote:
I put in the Gladilatian on my Examples web page and got:
Czech: 0.0272
Swahili: 0.0252
Hungarian: 0.0242
Polish: 0.0228
Since Gladilatian was designed to look alien I was pleased to get such
low scores. (0.2500 is the suggested threshold for a match.)
dirk elzinga <dirk.elzinga@...> wrote:
:
: So this leads to the following idea. Take an English text and
: respell it using Wijk's Regularized English and see how it
: scores.
:
: Try the same for New Spelling, Cut Spelling, etc.
I put in "Dhe Oull and Dha Puusceecat" and "Dha Bable Text" in Vermont
Revised English and got:
AngloSaxon: 0.1609
English: 0.1323
SchwytzZurich: 0.0722
Somali: 0.0708
===========================================================================
Dennis Paul Himes <> dennis@himes.connix.com
homepage: http://www.connix.com/~dennis/dennis.htm
Gladilatian page: http://www.connix.com/~dennis/glad/lang.htm
Disclaimer: "True, I talk of dreams; which are the children of an idle
brain, begot of nothing but vain fantasy; which is as thin of substance as
the air." - Romeo & Juliet, Act I Scene iv Verse 96-99