Zac Stewart

Have an interesting problem? Let's talk

Building A Language Identifier

I recently gave a talk on language identification at Big Nerd Ranch. The gist of it was extracting text from Wikipedia and training a naive Bayes classifier to predict the language of text. You can check out the resulting language identification service, Langue.