An AI model that processes and generates outputs from multiple input types (text, images, etc.) simultaneously.