Theiling Online    Sitemap    Conlang Mailing List HQ   

Re: Unknown Language Identifier!

From:J Matthew Pearson <pearson@...>
Date:Monday, January 29, 2001, 18:40
John Cowan wrote:

> J Matthew Pearson wrote: > > > Wearing my linguist hat, I have to say that I'm extremely dubious that > > this would be useful tool in determining the relatedness of languages. > > In all fairness, it's not meant for that. It's meant when you have > a sample of an unknown (to you) language, and want to know where to > go with it. That being so, strong superficial resemblance is usually > what you want.
The author of the website says it's based on a computer program for calculating the relatedness of languages (echoes of lexicostatistics). I was commenting on its usefulness in that capacity. In particular, I was criticising the passage where the author explains the scoring system, in which he says that if the sample text you enter scores in the medium range against one of the comparison languages, it will likely be a member of the same family as that comparison language (the example he gives is that if you enter a text in Shona, a Bantu language, it will score highest against the Bantu comparison languages, namely Swahili and Sotho). I don't see how he can seriously claim that, given that orthographic similarity is a bad test for genetic relatedness (Malagasy being a case in point). Of course, I would need to know more about what exactly the program was doing when it compared texts. Maybe he compensates for this somehow... Matt.