Re: magic natlang corpus harvesting
From: | Roger Mills <rfmilly@...> |
Date: | Thursday, May 27, 2004, 5:31 |
Mark P. Line wrote:
>
> I googled for "ang mga" (including the quotes, so it looks for this as a
> phrase) and pulled up the Tagalog Subweb in all its glory.
>
Not sure I understand what this is all about, or for, but googling for the
common Bah.Indonesia verb "mendapat" 'get, receive' (sometimes 'be able')
pulled up some 240,000 cites.
Another common verb "berkata" 'say(s)' pulled up 160,000. AFAICT, all in
Indonesian except, oddly, #2, a recipe for "Meatloaf a la Berkata" with an
Israeli address!!