A neural network model trained on large amounts of text data before being adapted for specific tasks, using the Transformer architecture.
Multi-step reasoning, logic puzzles, mathematical problem-solving