not necessarily. I've experimented quite a bit with this, and have achieved very good results for less than pennies, by limiting the reasoning capability but not the context window. Or vice versa, depending on the use-case
You are viewing a single comment's thread from:
Thank you for this information. I didn't think of such limits.
You got it 👍 It just keeps getting cheaper and more flexible