The task of identifying and separating individual objects or regions in an image or video by assigning each pixel to a specific object or category.
Quality of vision, audio, and image understanding (distinct from modality support)