A single model design that handles multiple different tasks without needing separate specialized models for each task.
Quality of vision, audio, and image understanding (distinct from modality support)