Additional refinement applied to a model after its initial training to improve performance on specific tasks like reasoning or instruction-following.