The process of carefully selecting, filtering, and organizing training data to improve a model's performance on specific tasks rather than relying solely on larger datasets.
Code generation, debugging, explanation, and refactoring
Multi-step reasoning, logic puzzles, mathematical problem-solving