A segmentation approach where you guide the model by providing prompts like points, clicks, or bounding boxes to specify which objects you want it to segment.
Adhering to complex, structured, or constrained instructions
Quality of vision, audio, and image understanding (distinct from modality support)