That's an interesting thought. Not seperating datasets by language could lead to more false positives for languages with few posts, but that's just an assumption of mine right now. Thanks for the input!
You are viewing a single comment's thread from: