Language detection for non French data
For instance La Presse contains NL text. Sometimes FR and NL on the same page... Also some special editions in EN. Use langid?
For instance La Presse contains NL text. Sometimes FR and NL on the same page... Also some special editions in EN. Use langid?