by Michael S. Kaplan, published on 2006/07/03 16:30 -04:00, original URI: http://blogs.msdn.com/b/michkap/archive/2006/07/03/655521.aspx
Back in the end of May, Dare posted some thoughts about the language of search in his post Our Multi-Lingual World and Search Indexes. It seems that he and the Windows Desktop Search folks have both come to the conclusion that they "...should consider simply choosing the indexer based on the locale/language of the Operating System."
We'll ignore for a minute how this is a very non-specific setting that will almost never be correct since the meaningful possble choices for a user are the user locale and the user UI language -- neither of which are based on an OS level setting.
Let's focus on the fact that this is hugely ignoring scenarios that are becoming more and more common -- such as those behind using the Windows Multilingual User Interface on a server to support many individual users, or the bigger and bigger number of multilingual users (evident even in the small sample of the nine comments to Dare's post!).
I am forced to wonder whether the solutions that Google and Live appear to be using involving the building of huge multilingual interfaces and perhaps throwing away the cases where no meaningful content is found are such a bad idea. I mean, it is obviously a brute force approach, but as long as computers seem to keep failing in MT then it is obvious that they can also fail in language guessing scenarios that are not brute force operations.
And I definitely would like to see fewer people relying on the locale/language of the OS....
This post brought to you by চ (U+099a, a.k.a. BENGALI LETTER CA)
Ruben on 3 Jul 2006 8:02 PM:
Michael S. Kaplan on 4 Jul 2006 1:56 AM:
Erik Fortune on 5 Jul 2006 4:44 AM: