Open source tools for software engineering teams using AI
Grounding & Formal Methods
Five tools covering the stages most teams skip under pressure:
discover what to build,
require what it must do,
specify its states and invariants, and
enforce structured delegation with audit trails.
AI removes the time penalty from each stage. These tools put that hypothesis
into practice — and are built to help engineers learn the underlying
disciplines, not just apply them.
PR/FAQ
v1.6.1
Amazon's Working Backwards process, inside the terminal.
Generate, review, and stress-test Amazon Working Backwards documents with specialized agents, simulated review meetings, and compiled PDF output. See an example — the PR/FAQ for this tool, written by the tool itself.
Simulated review meeting with four agentic personas who debate weak spots
Researcher agent integrates with Quarry for semantic search
Cascading feedback engine traces effects across all sections
Compiles to print-ready PDF via LaTeX
$ curl -fsSL https://raw.githubusercontent.com/punt-labs/prfaq/8503f06/install.sh | sh
Coming Soon
Use Cases
v0.0.0 alpha
Actors. Goals. Scenarios. Extensions. The spec that comes before the code.
Use Cases will seek to bring Jacobson and Cockburn's structured requirements methodology into Claude Code. The goal: identify the actors in your system, define what each is trying to accomplish, describe the step-by-step scenario that achieves their goal, and systematically work through what can go wrong at each step. The output is a specification document — not code.
Z is a specification language grounded in set theory and first-order predicate logic, developed at Oxford and standardized as ISO 13568. A Z spec describes a system as states, invariants that constrain them, and operations that transition between them — catching entire classes of bugs mathematically, not just the inputs you happened to test. Z Spec makes two powerful tools approachable inside Claude Code: Spivey's fuzz type-checker and the ProB animator/model-checker from HHU Düsseldorf. Extract specs from existing code, generate code from specs, type-check, and model-check — without learning the toolchain.
agent harness The agent harness — identity, missions, and structured delegation.
The harness that connects agents to their work. Persistent identity (personality, writing style, expertise) loads automatically every session. Typed mission contracts enforce write-set boundaries, bounded rounds, and frozen evaluators — with an append-only audit trail. Teams define roles with tool restrictions, anti-responsibilities, and safety constraints. Identity is the connection point; the harness is the whole system. Three integration surfaces (filesystem, CLI, MCP) so any tool reads identity state without taking a dependency.
Session audit logging — one JSONL line per tool invocation
$ curl -fsSL https://raw.githubusercontent.com/punt-labs/ethos/77fb88e/install.sh | sh
Coming Soon
Refactory
v0.0.0 alpha
refactoring Behavior-preserving program transformation, not just "clean up code."
Refactory will seek to bring Opdyke's original definition of refactoring — behavior-preserving program transformations with formally defined preconditions — into an AI-assisted workflow. The AI decides what needs restructuring; Refactory would execute on a parsed program model where each transformation is proven safe before it runs.
Built on Opdyke's formal definition — each refactoring has preconditions that must hold
Operates on a parsed program model, not text substitution
AI selects the transformation; Refactory guarantees safety
Composable — chain small, proven-safe transformations into large restructurings
Ask about Punt Labs
Hi — I can answer questions about Punt Labs' tools, approach, and what we're building. What would you like to know?