Syntax Is Easy, Semantics Is Hard: Evaluating LLMs for LTL Translation

Priscilla Kyei Danso, Mohammad Saqib Hasan, Niranjan Balasubramanian, Omar Chowdhury|April 8, 2026arXiv

Key Takeaway

LLMs can generate syntactically correct LTL formulas but frequently produce semantically incorrect ones, suggesting they understand formal syntax better than formal semantics—a gap that matters for real security and policy tools.

Summary

This paper tests whether large language models can translate English sentences into Linear Temporal Logic (LTL)—a formal language used to specify system requirements and security policies. The researchers find that LLMs handle the grammar of LTL well but struggle with its meaning, and that treating the task as code completion significantly improves results.

reasoning evaluation applications

Key Terms

linear-temporal-logic semantic-understanding prompt-engineering code-completion