Multimodal Generative Reward Model — Glossary — ThinkLLM