Efficiency Considerations
While TTT does require additional compute during inference, it can potentially offer efficiency benefits:
Model Size Reduction: TTT can enable smaller models to achieve performance comparable to larger models in some cases, potentially reducing overall compute requirements
Compute-Optimal Strategies: Research has shown that by using compute-optimal strategies for TTT, it’s possible to achieve significant performance improvements while using less computation than naive approaches1