Test-Time Compute¶

Entity Type: Glossary ID: test-time-compute

Definition: A scaling paradigm that improves AI model performance by allocating additional computational resources during inference (test time) rather than just during training. This approach includes techniques like iterative refinement, multiple sampling and selection, chain-of-thought reasoning, tree search methods, and verification processes. Test-time compute allows models to 'think harder' on difficult problems by spending more time on reasoning and validation, often leading to better results on complex tasks.

Related Terms: - inference-scaling - reasoning-time - iterative-refinement - beam-search - compute-scaling

Source Urls: - https://arxiv.org/abs/2408.03314 - https://openai.com/index/learning-to-reason-with-llms/ - https://arxiv.org/abs/2407.21787

Tags: - inference - scaling - reasoning - compute-optimization

Status: active

Version: 1.0.0

Created At: 2025-09-10

Last Updated: 2025-09-10