An automatic language detector, written in Python, that uses a character bigram model to predict whether a document is written in one of two languages. Currently set to distinguish between English and Spanish. Written in Python 2.7.
Computational Linguistics and Data Science
An automatic language detector, written in Python, that uses a character bigram model to predict whether a document is written in one of two languages. Currently set to distinguish between English and Spanish. Written in Python 2.7.