Models Capabilities Use Cases Benchmarks Papers Glossary

Models Capabilities Use Cases Benchmarks Papers Glossary

About Privacy Terms RSS

ThinkLLM

Spot an error in our data? Let us know.

Glossary/Multimodal

Multimodal

architecture

A model that can process and understand multiple types of input, such as both text and images.

Learn more on Wikipedia

Related Capabilities

Quality of vision, audio, and image understanding (distinct from modality support)

Multimodal — Glossary — ThinkLLM