Despite the increased demands, Huang believes the trade-off is worthwhile, as the quality of the answer is significantly better. He emphasized that Nvidia aims to drive the cost down so that this new type of reasoning inference can be delivered with the same level of cost and responsiveness as previous models.
You are viewing a single comment's thread from: