The computational expense and resource usage required to process or generate tokens, which increases when a model performs additional reasoning steps.