Yes, diversity will be a problem, maybe I could look into the truffles for different tags. I have to see whether it even make sense to train different models on different tags (yet in this case the data might become quite sparse, especially for rare tags like pregnancy
).
You are viewing a single comment's thread from:
You're right that this isn't very useful for obscure categories with very few posts (#belegarth?) but it might prove effective for evaluating relative value among popular tags relative to other posts in the same tag. There's a list of popular tags here which might be a good starting point.
It would be a really nice metric to be able to say something like:
If you haven't already, I'd recommend checking out @thing-2 and @gentlebot - They're both comment evaluator bots that try to do something very similar to what you're doing with this bot.
Thanks for the tip, I'll look at them. And I think about post statistics for different tags.