Grounding & Formal Methods

Five tools covering the stages most teams skip under pressure: discover what to build, require what it must do, specify its states and invariants, and enforce structured delegation with audit trails. AI removes the time penalty from each stage. These tools put that hypothesis into practice — and are built to help engineers learn the underlying disciplines, not just apply them.

PR/FAQ

v1.6.1

Amazon's Working Backwards process, inside the terminal.

Generate, review, and stress-test Amazon Working Backwards documents with specialized agents, simulated review meetings, and compiled PDF output. See an example — the PR/FAQ for this tool, written by the tool itself.

Uses Quarry
  • Complete PR/FAQ document generation — press release, FAQs, four-risks assessment, feature appendix
  • Simulated review meeting with four agentic personas who debate weak spots
  • Researcher agent integrates with Quarry for semantic search
  • Cascading feedback engine traces effects across all sections
  • Compiles to print-ready PDF via LaTeX
curl -fsSL https://raw.githubusercontent.com/punt-labs/prfaq/8503f06/install.sh | sh
3 demos → GitHub → Code LevelL4
Coming Soon

Use Cases

v0.0.0 alpha

Actors. Goals. Scenarios. Extensions. The spec that comes before the code.

Use Cases will seek to bring Jacobson and Cockburn's structured requirements methodology into Claude Code. The goal: identify the actors in your system, define what each is trying to accomplish, describe the step-by-step scenario that achieves their goal, and systematically work through what can go wrong at each step. The output is a specification document — not code.

  • Guided walkthrough: actors, goals, scenarios, extensions
  • Cockburn's goal-level hierarchy — summary, user, subfunction
  • Systematic extension handling at every scenario step
  • Specification document output, not generated code
Links available at launch

Z Spec

v0.14.1 beta

Formal Z specifications for stateful systems.

Z is a specification language grounded in set theory and first-order predicate logic, developed at Oxford and standardized as ISO 13568. A Z spec describes a system as states, invariants that constrain them, and operations that transition between them — catching entire classes of bugs mathematically, not just the inputs you happened to test. Z Spec makes two powerful tools approachable inside Claude Code: Spivey's fuzz type-checker and the ProB animator/model-checker from HHU Düsseldorf. Extract specs from existing code, generate code from specs, type-check, and model-check — without learning the toolchain.

Visualizes with Lux
  • Bidirectional: code2model extracts specs, model2code generates implementations
  • Type-checks with fuzz — Spivey's Z type-checker from Oxford
  • Animates and model-checks with ProB — state-space exploration from HHU Düsseldorf
  • Derives test cases from specs using TTF testing tactics
  • Lux visualizations — interactive dashboards for model-check results and counter-example traces
  • Lean 4 proof obligations, runtime contracts, and property-based oracle testing
  • B-Method support: create, type-check, animate, and refine B machines
curl -fsSL https://raw.githubusercontent.com/punt-labs/z-spec/fc89e39/install.sh | sh
View details → Code LevelL4

Ethos

v3.5.0 beta

agent harness The agent harness — identity, missions, and structured delegation.

The harness that connects agents to their work. Persistent identity (personality, writing style, expertise) loads automatically every session. Typed mission contracts enforce write-set boundaries, bounded rounds, and frozen evaluators — with an append-only audit trail. Teams define roles with tool restrictions, anti-responsibilities, and safety constraints. Identity is the connection point; the harness is the whole system. Three integration surfaces (filesystem, CLI, MCP) so any tool reads identity state without taking a dependency.

  • Persistent identity — personality, writing style, talents, roles, team graphs, channel bindings
  • Typed mission contracts — write-set boundaries, frozen evaluators, bounded rounds, audit trails
  • Teams and roles — collaboration graphs, anti-responsibilities, safety constraints
  • MCP server with 10 tools — identity, attributes, extensions, sessions, teams, roles, missions
  • Three integration surfaces — filesystem (zero dependency), CLI (hooks/scripts), MCP (structured)
  • Session audit logging — one JSONL line per tool invocation
curl -fsSL https://raw.githubusercontent.com/punt-labs/ethos/77fb88e/install.sh | sh
View details → Code LevelL1 L4
Coming Soon

Refactory

v0.0.0 alpha

refactoring Behavior-preserving program transformation, not just "clean up code."

Refactory will seek to bring Opdyke's original definition of refactoring — behavior-preserving program transformations with formally defined preconditions — into an AI-assisted workflow. The AI decides what needs restructuring; Refactory would execute on a parsed program model where each transformation is proven safe before it runs.

  • Built on Opdyke's formal definition — each refactoring has preconditions that must hold
  • Operates on a parsed program model, not text substitution
  • AI selects the transformation; Refactory guarantees safety
  • Composable — chain small, proven-safe transformations into large restructurings