The ability to search and find relevant items across different data types, such as finding images using text queries or vice versa.
Quality of vision, audio, and image understanding (distinct from modality support)