Think
LLM
Models
Capabilities
Use Cases
Benchmarks
Papers
Glossary
Search
/
Glossary
/
Visual-Textual Attention
Visual-Textual Attention
techniques
How a multimodal model allocates focus between visual and text information when processing inputs.
Visual-Textual Attention — Glossary — ThinkLLM