Multimodal Action Prediction — Glossary — ThinkLLM