Humanoid policy learning — Glossary — ThinkLLM