Models Capabilities Use Cases Benchmarks Papers Glossary

Models Capabilities Use Cases Benchmarks Papers Glossary

About Privacy Terms RSS

ThinkLLM

Spot an error in our data? Let us know.

Glossary/Visual Grounding

Visual Grounding

behavior

The ability to connect specific words or concepts in text to the actual objects or regions they refer to in an image.

Related Capabilities

Quality of vision, audio, and image understanding (distinct from modality support)

Visual Grounding — Glossary — ThinkLLM