Thanks for your thoughtful comment @mattclarke. I can see how the quadratic rewards might have been problematic, but as you say, changing them as effectively just moved the problem to a different kind of behaviour.
You are viewing a single comment's thread from: