A neural network component that processes images at multiple levels of detail simultaneously, capturing both fine details and broad patterns.
Quality of vision, audio, and image understanding (distinct from modality support)