← Bibliography

SysMoBench: Evaluating AI on formally modeling complex real-world systems

View original ↗

Note

LLMs handle small formal modeling artifacts; significant performance gaps remain for complex real-world distributed systems in TLA+.

Citation Key

cheng2025sysmobench