Open source tools for software engineering teams using AI

SysMoBench: Evaluating AI on formally modeling complex real-world systems

Qian Cheng and Ruize Tang and Emilie Ma and Finn Hackett and Peiyang He and Yiming Su and Ivan Beschastnikh and Yu Huang and Xiaoxing Ma and Tianyin Xu misc 2025 Source: z-spec

View original ↗

Note

LLMs handle small formal modeling artifacts; significant performance gaps remain for complex real-world distributed systems in TLA+.

Citation Key

cheng2025sysmobench