What if rigour were fast and free?

AI is writing more of the code. How can teams verify it and be accountable for it? The practices that produce trustworthy software — formal specs, structured requirements, verified transformations — were always effective, just too time-intensive and inaccessible. We’re building open source tools to try to solve that.

❯ ▋

Explore projects Watch demos Sample PDF

What we are building

Decades of computer science and software engineering research — structured product discovery, test driven development, formal methods, and behavior-preserving transformations — have always worked. They were just challenging to apply consistently and time-cost effectively in most commercial settings. We believe that AI makes it feasible to learn and consistently apply these practices even under pressure and constraints. We build across four layers.

Grounding Structured product discovery and formal specifications — what to build and how to specify it.

5 projects

Building Blocks Each capability ships as a library, CLI, MCP server, and REST API from a single codebase. Every block works standalone; together they discover and enrich each other.

8 projects

Test Bed Real projects where we discover what works and what doesn’t.

2 projects

Punts Exploratory bets on where programming is going — live Smalltalk images, deterministic refactoring, and agentic self-modifying code.

2 projects

What we are releasing

All repos ↗

Ethos v3.1.0 Apr 12

chore: post-release v3.0.0 by @claude-puntlabs in https://github.com/punt-labs/ethos/pull/230

Vox v4.5.1 Apr 12

Playback timeout scales to audio duration (vox-ddf) — the fixed 30-second timeout was silently truncating any TTS speech over ~450 charac...

Vox v4.4.0 Apr 11

Background music generation (/music on|off, vox music on|off) — vibe-driven instrumental music that loops during coding sessions. When mu...

What we are reading

All readings →

Blog post

Prediction: AI Will Make Formal Verification Go Mainstream

Martin Kleppmann

Kleppmann argues AI removes the human bottleneck from formal verification — the same thesis driving our work, arrived at independently.

martinfowler.com

LLMs Bring a New Nature of Abstraction

Martin Fowler

Fowler argues LLMs create a new kind of abstraction — probabilistic rather than deterministic — and explores what that means for how we build software.

arXiv preprint (2507.13290)

Towards Formal Verification of LLM-Generated Code from Natural Language Prompts

Aaron Councilman, Samir Datta, Neel Jain, Milo Martin, Val Tannen

Proposes using formal verification to check LLM-generated code against natural language intent — closing the gap between what you asked for and what you got.

What we are learning

All posts →

Apr 10