Building A Language Identifier

Building A Language Identifier


I recently gave a talk on language identification at Big Nerd Ranch. The gist of it was extracting text from Wikipedia and training a naive Bayes classifier to predict the language of text. You can check out the resulting language identification service, Langue.