Think
LLM
Models
Capabilities
Use Cases
Benchmarks
Papers
Glossary
Search
/
Glossary
/
Execution-Grounded Metrics
Execution-Grounded Metrics
techniques
Evaluation measures based on actually running code and tests, rather than static analysis alone.
Execution-Grounded Metrics — Glossary — ThinkLLM