Bibliography

Academic papers, books, and references cited across our PR/FAQ documents — collected from 195 citations across 10 repositories.

Journal Articles

article 2025 refactory

Refactoring with llms: Bridging human expertise and machine understanding

article 2025 refactory

SWE-refactor: a repository-level benchmark for real-world LLM-based code refactoring

1,099 behavior-preserving refactorings from 18 Java projects. Codex agent achieves 39.4% on compound refactoring.

article 2024 refactory

A survey of bugs in AI-generated code

72-study systematic review. Functional bugs are the most common AI code defect category.

article 2024 refactory

An empirical study on the code refactoring capability of large language models

article 2025 z-spec

Formal requirements engineering and large language models: A two-way roadmap

Alessio Ferrari and Paola Spoletini

Two symmetric paths: formal methods to guarantee LLM artifact correctness; LLMs to make formal methods more accessible.

article 2015 reason-trace

How Amazon Web Services uses formal methods

Chris Newcombe and Tim Rath and Fan Zhang and Bogdan Munteanu and Marc Brooker and Michael Deardeuff

Canonical reference for TLA+ in industry. AWS engineers used TLA+ to find subtle bugs in DynamoDB, S3, and other critical systems. Formal methods described as “surprisingly feasible” and “routinely applied” at Amazon scale.

article 2016 vox

Comparing the effect of audio and visual notifications on workspace awareness

D. Jung and A. Butz

Peer-reviewed. Audio notifications maintain workspace awareness without requiring visual attention.

article 1997 refactory

A refactoring tool for smalltalk

Don Roberts and John Brant and Ralph Johnson

The Refactoring Browser. Defines refactorings as provably correct parameterized transformations with preconditions guaranteeing behavior preservation.

article 2023 beadle

Use cases are essential

Ivar Jacobson and Alistair Cockburn

ACM peer-reviewed; argues for renewed use of use case methodology

article 2023 use-cases

Use cases are essential

Ivar Jacobson and Alistair Cockburn

Peer-reviewed ACM article calling for return to use case methodology. Jacobson invented use cases (1986 OOPSLA); Cockburn is co-author of Agile Manifesto. Published October/November 2023.

article 2009 z-spec

Formal methods: Practice and experience

Jim Woodcock and Peter Gorm Larsen and Juan Bicarregui and John Fitzgerald

Survey of 62 industrial FM projects: 92% reported quality increases, 0% reported decreases.

article 2026 prfaq

This a.I. Tool is going viral. Five ways people are using it.

Natallie Rocha

Profiles five non-programmers using Claude Code: school administrator, photographer, prosecutor, finance professor, welding business owner

article 2024 dungeon

Recovery from work by playing video games

Ömer Erdem Koçak

Investigates video games as recovery from work stress; finds harmonious gaming passion facilitates cognitive resource accumulation

article 2022 dungeon

“Give me a break!” A systematic review and meta-analysis on the efficacy of micro-breaks for increasing well-being and performance

Patricia Albulescu and Irina Macsinga and Dragos Iliescu and Coralia Sulea and Delia Vîrgă and Alexandra Dorina Stănculescu and Tudor Konstantin Rusu

Meta-analysis of 19 studies (N=2335) showing microbreaks boost vigor (d=.36), reduce fatigue (d=.35), with small positive effect on performance (d=.16)

article 1995 dungeon

The restorative benefits of nature: Toward an integrative framework

Stephen Kaplan

Foundational paper on Attention Restoration Theory; directed attention fatigue from sustained cognitive effort is restored by engagement with environments that provide fascination without requiring directed attention

article 1993 refactory

Creating abstract superclasses by refactoring

William F. Opdyke and Ralph E. Johnson

article 2015 refactory

The birth of refactoring: a retrospective on the nature of high-impact software engineering research

William G. Griswold and William F. Opdyke

Conference Papers

inproceedings 2008 vox

The cost of interrupted work: More speed and stress

Gloria Mark and Daniela Gudith and Ulrich Klocke

Peer-reviewed. 36 knowledge workers. 23 min 15 sec average recovery time after interruption. 2 intervening tasks before returning to original work.

inproceedings 2025 z-spec

Uncovering systematic failures of LLMs in verifying code against natural language specifications

Haolin Jin and Huaming Chen

LLMs exhibit systematic over-correction bias when verifying code. GPT-4o accuracy drops from 52.4% to 11.0% under complex prompts.

inproceedings 2025 z-spec

PropertyGPT: LLM-driven formal verification of smart contracts through retrieval-augmented property generation

Ye Liu and Yue Xue and Daoyuan Wu and Yuqiang Sun and Yi Li and Miaolei Shi and Yang Liu

80% recall against human-written ground-truth properties; finds 26 known and 12 previously unknown vulnerabilities.

Books

book 2021 prfaq

Working backwards: Insights, stories, and secrets from inside amazon

Colin Bryar and Bill Carr

book 1989 reason-trace

The Z notation: a reference manual

J. M. Spivey

The canonical reference for Z notation formal specification language. Z specifications describe state-machine behavior with mathematical precision. Used in safety-critical systems (railway signaling, nuclear, medical devices).

book 1992 z-spec

The Z notation: a reference manual

J. Michael Spivey

The definitive Z reference, by fuzz's creator.

book 1999 refactory

Refactoring: Improving the design of existing code

Martin Fowler

The industry reference that popularized refactoring. Defines refactoring as “a change made to the internal structure of software to make it easier to understand and cheaper to modify without changing its observable behavior.”

book 2018 prfaq

Inspired: How to create tech products customers love

Marty Cagan

Theses

phdthesis 1992 refactory

Refactoring object-oriented frameworks

William F. Opdyke

The originating PhD thesis establishing behavior-preserving program transformation as the formal definition of refactoring.

Reports

report 2025 dungeon

Opportunities in roguelike game market 2025–2033

Data Insights Market

Market research showing roguelike market at $1.33B in 2025, projected $2.57B by 2033 with 8.7% CAGR

report 2024 punt-kit

New platform engineering research report

Google Cloud

55% of global orgs adopted platform engineering; 90% plan to expand; 86% say essential to AI value

report 2025 quarry

State of AI code quality in 2025

Qodo

Developer survey: context pain affects 41–52% of developers by seniority; “improved contextual understanding” is the top-requested AI improvement (26% of votes); persistent context reduces frustration from 54% to 16%.

report 2024 punt-kit

Platform engineering services market size to hit USD 40.17 billion by 2032

SNS Insider

Market valued at $7.19B in 2024, projected $40.17B by 2032, CAGR 23.99%

report 2026 punt-kit

2026 state of code developer survey report

Sonar

1,149 developers; 88% cite negative AI code impacts; 96% do not fully trust AI-generated code; 42% of committed code is AI-generated

Standards

standard 2002 z-spec

Information technology — Z formal specification notation — Syntax, type system and semantics

The international standard for Z notation.

Miscellaneous

misc 2025 z-spec

Towards formal verification of LLM-generated code from natural language prompts

Aaron Councilman and David Jiahao Fu and Aryan Gupta and Chengxiao Wang and David Grove and Yu-Xiong Wang and Vikram Adve

Formal query language as contract layer for LLM-generated Ansible code; verifier achieves 83% confirmation of correct code and 92% identification of incorrect code.

misc 2025 prfaq

Replit CEO: We don't care about professional coders anymore

Amjad Masad

Replit's 35M+ users; CEO explicitly pivoting away from professional developers toward non-coders

misc 2025 prfaq

Vibe coding

Andrej Karpathy

Original post coining the term “vibe coding” — viewed over 4.5 million times

misc 2026 vox

Measuring AI agent autonomy in practice

Anthropic

Primary data: median Claude Code turn 45 seconds; 99.9th percentile grew from ¡25 min to ¿45 min (Oct 2025–Jan 2026). Auto-approve rate grows from 20% for new users to 40%+ for experienced users (750+ sessions). Claude stops to ask clarification 2x more often than users interrupt on complex tasks.

misc 2026 vox

Hooks reference — claude code documentation

Anthropic

Official documentation of hook events: Stop (Claude finishes), Notification (permission_prompt, idle_prompt subtypes). Primary source for hook-based notification architecture.

misc 2026 vox

Effective harnesses for long-running agents

Anthropic Engineering

Claude Opus 4: 7+ hours continuous autonomous operation. Security audits 30–60 min; performance profiling 45–90 min.

misc 2026 z-spec

Evaluating LLM-generated ACSL annotations for formal verification

Arshad Beg and Diarmuid O'Donoghue and Rosemary Monahan

LLM-generated annotations show lower, more variable proof success rates and increased SMT solver instability vs tool-generated.

misc 2025 z-spec

Leveraging LLMs for formal software requirements: Challenges and prospects

Arshad Beg and Diarmuid O'Donoghue and Rosemary Monahan

VERIFAI project: NL-to-formal-spec conversion identified as the critical bottleneck for safety-critical systems.

misc 2025 z-spec

Beyond postconditions: Can large language models infer formal contracts for automatic software verification?

Cedric Richter and Heike Wehrheim

NL2Contract: LLMs infer full functional contracts; verifiers using these detect genuine bugs in real-world code.

misc 2025 vox

agent-notify: Multi-channel notifications for AI coding agents

cfngc4594

Open-source competitor. Sound, macOS alerts, macOS say TTS, ntfy push. Supports Claude Code, Cursor, Codex. OS TTS only; no premium providers.

misc 2026 vox

Nvidia-backed AI voice startup ElevenLabs hits $11 billion valuation

CNBC

$500M Series D at $11B valuation. $330M+ ARR end of 2025. 41% Fortune 500 adoption.

misc 2026 prfaq

Anthropic's claude code revenue doubled since jan. 1

Constellation Research

Claude Code annualized run-rate revenue over $2.5B as of February 2026, doubling since January 1. Anthropic total ARR is $14B. Figure is Claude Code-specific

misc 2026 vox

Notification fatigue is about to get 10x worse

Courier.com

IDC: 80% of enterprise apps will have AI copilots by end of 2026. Context establishing OS notification fatigue.

misc 2025 punt-kit

Agent readmes: An empirical study of context files for agentic coding

Filipe Calegario and others

2,303 context files from 1,925 repos; security in only 14.5% of files; build commands dominate at 62.3%

misc 2025 prfaq

Smash through tech debt: Why AI is the jackhammer

HFS Research and Publicis Sapient

Survey of 608 IT and business leaders estimating Global 2000 accumulated technical debt at $1.5–2 trillion

misc 2026 z-spec

Automatic generation of formal specification and verification annotations using LLMs and test oracles

João Pascoal Faria and Emanuel Trigo and Vinicius Honorato and Rui Abreu

LLMs generate correct Dafny annotations for 98.2% of programs within 8 repair iterations using verifier feedback.

misc 2026 punt-kit

Codified context: Infrastructure for AI agents in a complex codebase

Jonah Katz and others

283 sessions on a 108,000-line codebase; single-file manifests do not scale; three-component infrastructure for persistent agent context

misc 2019 dungeon

AI dungeon

Latitude

Pioneering LLM text adventure launched 2019; reached 100,000 players in first week and 1.5 million by June 2020; retired from Steam March 2024

misc 2026 punt-kit

On the impact of agents.md files on the efficiency of AI coding agents

Lucas Larson and others

10 repos, 124 PRs; AGENTS.md presence reduces runtime 28.64% and token consumption 16.58%

misc 2025 punt-kit

On the use of agentic coding manifests: An empirical study of claude code

Matteo Ciniselli and others

253 CLAUDE.md files from 242 repos; manifests optimized for code execution not org conventions; security in 8.7%

misc 2025 vox

Text to speech market size, trends report, share and forecast 2030

Mordor Intelligence

Global TTS market $3.87B in 2025, projected $7.28B by 2030 at 12.89% CAGR.

misc 2025 vox

Cursor AI adoption trends: Real data from the fastest growing coding tool

Opsera

Cursor: 1M+ users, 360K paying customers, $500M ARR (May 2025), 50%+ Fortune 500 adoption.

misc 2025 vox

AgentVibes: Bring your claude code sessions to life with voice

paulpreibisch

Open-source competitor. macOS say, Piper TTS, Soprano, Windows SAPI. 914 voices. Background music, personality styles, verbosity levels. No ElevenLabs or OpenAI TTS.

misc 2026 use-cases

Autonomous agent system — use case development transcript

Punt Labs

Internal primary source. Transcript of a complete AI-guided use-case elicitation session using Jacobson-Cockburn v1.1. Produced 8 use cases (v1.0) in five iterative rounds with all open questions resolved. Existence proof that AI-guided methodology application is feasible.

misc 2025 z-spec

SysMoBench: Evaluating AI on formally modeling complex real-world systems

Qian Cheng and Ruize Tang and Emilie Ma and Finn Hackett and Peiyang He and Yiming Su and Ivan Beschastnikh and Yu Huang and Xiaoxing Ma and Tianyin Xu

LLMs handle small formal modeling artifacts; significant performance gaps remain for complex real-world distributed systems in TLA+.

misc 2025 prfaq

bolt.new revenue, funding & news

Sacra

Bolt.new hit $40M ARR in March 2025, progressing from $4M ARR within 4 weeks of launch

misc 2025 prfaq

Cursor revenue, valuation & funding

Sacra

Cursor reached $1B ARR and 1M daily active users

misc 2026 prfaq

Claude code is the inflection point

SemiAnalysis

4% of GitHub public commits authored by Claude Code; projection of 20%+ by end of 2026

misc 2025 z-spec

A benchmark for vericoding: formally verified program synthesis

Sergiu Bursuc and Theodore Ehrenborg and Shaowei Lin and Lacramioara Astefanoaei and Ionel Emilian Chiosa and Jure Kukovec and Alok Singh and Oliver Butterley and Adem Bizid and Quinn Dougherty and Miranda Zhao and Max Tan and Max Tegmark

LLM success rates: 82% Dafny, 44% Verus/Rust, 27% Lean. Dafny improved from 68% to 96% in one year.

misc 2026 z-spec

Specification-driven development: Rethinking how we build software in the age of AI

Solomon Lemma Abebe

AIWare 2026. Three-level taxonomy: spec-first, spec-anchored, spec-as-source. Positions formal specs as primary artifact in AI-assisted development.

misc 2025 prfaq

2025 developer survey

Stack Overflow

Annual survey of 65,000+ developers on tools, practices, and AI adoption

misc 2024 z-spec

Stack overflow developer survey 2024

Stack Overflow

76% of developers use or plan to use AI coding tools.

misc 2026 z-spec

Constitutional spec-driven development: Enforcing security by construction in AI-assisted code generation

Suhas Bharadwaj and others

73% reduction in security defects vs unconstrained AI generation when code is constrained by a formal spec layer.

misc 2025 prfaq

Lovable says it's nearing 8 million users

TechCrunch

Lovable, an AI coding startup founded in 2024, nearing 8 million users within one year

misc 2025 prfaq

A quarter of startups in yc's current cohort have codebases almost entirely AI-generated

TechCrunch

25% of Y Combinator W25 cohort startups have codebases that are almost entirely AI-generated

misc 2025 vox

TTS pricing comparison 2025

TextToLab

OpenAI TTS: $15/1M chars (tts-1), $30/1M chars (tts-1-hd). AWS Polly: $4.80/1M standard, $19.20/1M neural. ElevenLabs: subscription from $5/month / 30K chars.

misc 2025 vox

Claude code's “Tasks” update lets agents work longer and coordinate across sessions

VentureBeat

Background tasks recommended for work ¿30 seconds. Describes Claude Code shift from copilot to background subagent.

misc 2024 beadle

RFC 9580: OpenPGP

Werner Koch and others

Current OpenPGP standard; July 2024; v6 formats, post-quantum ML-KEM keys

misc 2024 z-spec

Hallucination is inevitable: An innate limitation of large language models

Ziwei Xu and Sanjay Jain and Mohan Kankanhalli

Proves via diagonalization that LLMs cannot eliminate hallucination for all computable functions; external symbolic reasoning required.

Online Resources

online 2025 quarry

Rewind Mac app shutting down following Meta acquisition

9to5Mac

Confirms Rewind Mac desktop app and Pendant hardware shutdown effective December 2025.

online 2025 reason-trace

AgentOps — Developer platform for AI agents

AgentOps

Product homepage. Confirms: Time Travel Debugging (session waterfall timeline), tool call tracking. Requires SDK instrumentation (2 lines of code). No terminal recording.

online 2026 quarry

Claude code plugins review 2026: 9,000+ extensions

AI Tool Analysis

Reports 9,000+ Claude Code plugins as of early 2026.

online 2023 dungeon

How Latitude scaled production of their gaming worlds while reducing costs

AI21 Labs

Documents Latitude's transition from OpenAI to AI21's Jurassic-1 model due to policy conflicts

online 2025 beadle

The agentic AI security scoping matrix

Amazon Web Services

Recommends per-tool least-privilege permissions and immutable audit trails for autonomous agents

online 2025 use-cases

Introducing Kiro: Agentic AI development from prototype to production

Amazon Web Services

Launched July 15, 2025. Spec-driven approach: requirements.md (EARS syntax), design.md, and tasks.md before code. Key competitor validating AI-assisted specification.

online 2026 reason-trace

The middle loop

Annie Vella

Study of 158 software engineers' AI usage patterns. Introduces “supervisory engineering work” as a new intermediate loop between inner (coding) and outer (planning) development cycles. Engineers spend increasing time reviewing, validating, and directing AI output rather than writing code directly.

online 2026 punt-kit

2026 agentic coding trends report

Anthropic

Developers use AI in 60% of work; AI agents market projected at $52.62B by 2030 at 46.3% CAGR

online 2026 quarry

Anthropic raises $30 billion in Series G funding at $380 billion post-money valuation

Anthropic

Primary Anthropic source: Claude Code WAU doubled since January 1 2026; annualized revenue exceeded $2.5B; total Anthropic ARR $14B. Announced February 12 2026.

online 2026 quarry

Claude code hooks reference

Anthropic

online 2025 punt-kit

Effective context engineering for AI agents

Anthropic

“Each new session begins with no memory of what came before”

online 2025 use-cases

anthropics/knowledge-work-plugins

Anthropic

Anthropic-maintained plugins for knowledge workers. Confirms methodology plugins are a recognized category.

online 2025 reason-trace

Automate workflows with hooks — Claude Code Docs

Anthropic

Official Claude Code hooks documentation. 14 lifecycle events including PostToolUse, Stop, PreCompact. Hook stdin payload includes session_id, transcript_path, tool_name, tool_input, tool_response.

online 2025 punt-kit

Manage claude's memory — claude code docs

Anthropic

CLAUDE.md loaded every session but is static with no feedback loop to detect drift

online 2025 reason-trace

asciicast v3 — asciinema docs

asciinema

Format spec. Five event types: o, i, m, r, x. Marker events carry time + optional string label. Explicitly extensible.

online 2025 dungeon

Bash screensavers revive terminal art with 90s nostalgia and modern whimsy

BigGo News

Documents 2025 revival of ASCII art in developer terminals

online 2025 reason-trace

Cursor, an AI coding assistant, draws a million users without even trying

Bloomberg

Cursor: 1 million daily active users as of April 2025. 360K+ paying subscribers, 50K+ businesses. Growth almost entirely organic. $9.9B valuation; $500M ARR by late 2025.

online 2025 dungeon

Beyond the GUI: The ultimate guide to modern terminal user interface applications and development libraries

BrightCoding

Documents current terminal gaming ecosystem including BrogueCE roguelike and pokete RPG

online 2026 dungeon

Claude revenue and usage statistics (2026)

Business of Apps

Comprehensive Claude platform statistics including 30M MAU and API call volumes

online 2024 refactory

Tree-sitter vs LSP: Why hybrid IDE architecture wins

byteiota

online 2025 beadle

Cisco's 2025 data privacy benchmark study

Cisco

2,600 professionals across 12 countries; 90% see local storage as safer; 64% worry about cloud GenAI data exposure

online 2025 reason-trace

AI agents in production 2025

Cleanlab

Survey of 95 professionals with AI agents in production. Fewer than 1 in 3 teams satisfied with observability. 42% of regulated enterprises plan review controls. Observability called “the weakest layer.”

online 2025 quarry

AI code editors showdown 2025: Cursor vs windsurf vs copilot explained

CodeAnt AI

Documents Windsurf Cascade Memories: persistent cross-conversation context in a major AI code editor.

online 2025 use-cases

State of AI vs human code generation report

CodeRabbit

Analysis of 470 open-source pull requests. AI-generated code produces 1.7x more issues, 2.74x more XSS vulnerabilities. Published December 2025 via BusinessWire.

online 2025 punt-kit

Beyond code generation: How context engineering can transform developer experience

Continue.dev

Describes how repeated code review comments represent tribal knowledge that can be systematized

online 2024 punt-kit

Comparisons — copier documentation

Copier contributors

Lifecycle management: updating existing projects when templates evolve; confirms propagation problem is real

online 2025 punt-kit

Engineering manager's guide to static analysis

DeepSource

Quantifies static analysis at 4 hours/week/engineer saved on code reviews

online 2025 reason-trace

Lean4: How the theorem prover works and why it's the new competitive edge in AI

Dhyey Mavani

VentureBeat guest post. Documents Lean4 adoption by OpenAI, Meta, Google DeepMind (AlphaProof), and Harmonic AI ($100M raised). Key framing: formal verification as a “safety net” for LLMs — each reasoning step translated to Lean4 and proof-checked. VeriBench benchmark: LLMs verify only 12% of code challenges in Lean4, but iterative agent approach reaches 60%.

online 2025 refactory

State of AI-assisted software development 2025

DORA / Google Cloud

90% AI adoption but only 3% express high trust in AI-generated output.

online 2024 refactory

Accelerate state of DevOps report 2024

DORA / Google Cloud

Survey of 39,000+ professionals. 39% reported little to no trust in AI-generated code.

online 2026 quarry

Former GitHub CEO thomas dohmke raises $60 million seed round

Entire

Primary announcement from Entire.io confirming $60M seed round and Checkpoints CLI product launch.

online 2026 reason-trace

entireio/cli: Entire hooks into your git workflow to capture AI agent sessions on every push

Entire

Official open-source CLI repository. Confirms: captures prompts, responses, files modified, timestamps as JSON on entire/checkpoints/v1 git branch. Hooks into .claude/, .gemini/, .cursor/ directories. Optional AI-generated session summaries.

online 2026 quarry

entireio/cli: Entire is a new developer platform that hooks into your git workflow

Entire

Open-source Checkpoints CLI. Captures prompts, transcripts, tool calls, token usage. Stores on entire/checkpoints/v1 git shadow branch. Supports Claude Code, Gemini CLI, OpenCode, Cursor.

online 2025 dungeon

The science and wellness benefits of microbreaks

Focused Solutions, LLC

Reviews research showing five-minute breaks improve concentration

online 2026 dungeon

In the workforce, AI is having the opposite effect it was supposed to, UC Berkeley researchers warn

Fortune Magazine

UC Berkeley research on AI causing workload creep, cognitive fatigue, and burnout

online 2025 dungeon

10 best roguelikes with ASCII art

Game Rant

Profiles modern ASCII roguelikes including Cogmind, praised for “most advanced terminal interface ever”

online 2022 punt-kit

Platform engineering empowers developers to be better, faster, happier

Gartner

80% of large software engineering orgs will have platform engineering teams by 2026, up from 45% in 2022

online 2026 reason-trace

Startup Radar: Meet Seattle founders building software for coding agents

GeekWire

GeekWire Startup Radar feature on SageOx. VC assessment: “The vision is timely…\ The risk is abstraction.”

online 2025 refactory

AI copilot code quality: 2025 data suggests 4x growth in code clones

GitClear

211 million changed lines analyzed. Refactoring fell from 25% to under 10% of changed lines.

online 2025 refactory

The state of the octoverse 2025

GitHub

Over 150 million developers on GitHub as of 2025.

online 2020 punt-kit

Promote consistency across your organization with workflow templates

GitHub

Checking many repos for consistency exposes teams to human error and reduced visibility

online 2025 reason-trace

The state of observability in 2025

Grafana Labs

n=1,255 responses, collected Sep 2024–Jan 2025, analysis by Censuswide. Alert fatigue is the No. 1 obstacle to faster incident response, outpacing next response by almost 2:1. Average user manages 16 data sources; 5% manage over 100. Strongest independent source for the collection-without-action pattern in observability.

online 2025 beadle

AI agents market size and share

Grand View Research

Market at USD 7.63B in 2025, projected USD 182.97B by 2033 at 49.6% CAGR

online 2024 punt-kit

Enforcing coding standards in a team with code review

Graphite.dev

Code review is the primary mechanism for standards enforcement; wikis as knowledge storage

online 2026 dungeon

AI doesn't reduce work—it intensifies it

Harvard Business Review

Documents burnout from constant review and verification of AI-generated code

online 2021 dungeon

83% of developers suffer from burnout, haystack analytics study finds

Haystack Analytics

Survey finding that 83% of software developers suffer from burnout; top causes: high workload (47%), inefficient processes (31%), unclear goals (29%)

online 2024 beadle

GPG security review: Strengths, weaknesses, and best practices

hoop.dev

Key expiry and rotation as required best practice; complexity as primary operational risk

online 2024 beadle

ACP: Agent control plane

HumanLayer

Open-source agent scheduler with email as a human-approval channel for outer-loop agents

online 2024 use-cases

Use-case foundation v1.1

Ivar Jacobson and Alistair Cockburn

Primary methodology document. Defines system of interest, primary actor, goal, basic course, and extensions. Freely available.

online 2024 use-cases

Use-case foundation

Ivar Jacobson International

Active commercial and training ecosystem. Certifications and training materials available.

online 2024 use-cases

Use-case 3.0: The definitive guide

Ivar Jacobson International

2024 evolution of the methodology. Documents the incremental, story-driven approach. Confirms the methodology is actively maintained and evolving.

online 2025 reason-trace

I built my own observability for Claude Code — here's why and how

J. Doneyli

Developer account of gaps in existing Claude Code observability: unbounded logs, missing responses, no queryability, lost session context.

online 2026 reason-trace

Six problems of agentic engineering

Jeff Freeman

Punt Labs blog post. Maps the agentic engineering coordination landscape as six layers: (1) Intent & Trust, (2) Context & Memory, (3) Team Communication, (4) Project Tracking, (5) Agent Teams, (6) Agent-to-Agent protocols. Published on punt-labs.com.

online 2025 reason-trace

coding_agent_session_search: Unified TUI/CLI for coding agent session history

Jeffrey Emanuel

Indexes chat transcript JSON from 11+ agent providers. Sub-60ms search. No terminal recording, no visual replay, no reasoning review.

online 2024 refactory

Program structure interface (PSI) — IntelliJ platform plugin SDK

JetBrains

online 2024 refactory

PSI elements — IntelliJ platform plugin SDK

JetBrains

online 2026 beadle

OpenClaw/ClawdBot vulnerabilities

Kaspersky

Authentication disabled by default; three CVEs including CVSS 8.8 gateway compromise

online 2025 reason-trace

10 things developers want from their agentic IDEs in 2025

Kate Holterhoff

RedMonk analysis. Developers demand persistent memory, audit trails, rollbacks. “Developers are frustrated by agents that forget everything between sessions.”

online 2025 refactory

Augmented coding: Beyond the vibes

Kent Beck

online 2026 reason-trace

Governing AI generated code — a hands-on experiment with Entire and Kosli

Kosli

Key insight: “every proxy measures actions, not cognition. The gap between them is exactly where intent lives.”

online 2024 beadle

State of AI agents report: 2024 trends

LangChain

Survey of 1,300+ professionals; 51% using agents in production

online 2025 reason-trace

Trace Claude Code with Langfuse

Langfuse

Official integration guide. Captures via Stop hook: user inputs, assistant responses, tool invocations (inputs/outputs), session grouping, timing. Structured JSON traces and spans. Does not capture terminal ANSI output.

online 2025 dungeon

Burnout is on the rise as layoffs reshape the tech industry

LeadDev

Using validated Maslach Burnout Inventory: 22% of 617 surveyed engineering leaders and developers face critical burnout levels; 24% moderately burned out

online 2025 refactory

LLMs bring a new nature of abstraction

Martin Fowler

online 2025 refactory

Refactoring with codemods to automate API changes

Martin Fowler

online 2025 refactory

Some thoughts on llms and software development

Martin Fowler

online 2025 use-cases

Understanding spec-driven-development: Kiro, spec-kit, and Tessl

Martin Fowler

Independent analysis of the spec-driven development tool landscape. Identifies three main tools; none use Jacobson-Cockburn format.

online 2003 refactory

Etymology of refactoring

Martin Fowler

online 2025 z-spec

Prediction: AI will make formal verification go mainstream

Martin Kleppmann

LLM proof-script generation + formal checkers create a virtuous cycle. Cites seL4 (8,700 lines C, 20 person-years, 200,000 lines Isabelle) as evidence that cost barrier is collapsing.

online 2025 refactory

Prediction: AI will make formal verification go mainstream

Martin Kleppmann

online 2024 refactory

.NET compiler platform SDK (roslyn)

Microsoft

Compiler-as-library exposing full semantic model via API. Powers refactoring in Visual Studio and OmniSharp.

online 2025 refactory

AI agents and OpenRewrite: Automated code remediation

Moderne

Moderne positions OpenRewrite recipes as tools invocable by AI agents via MCP and function calling. 3,500+ recipes available.

online 2025 refactory

OpenRewrite — large-scale automated source code refactoring

Moderne

Lossless Semantic Tree (LST) for Java/Kotlin. Recipe-based refactoring for migrations and framework upgrades.

online 2025 reason-trace

AI agent observability — evolving standards and best practices

OpenTelemetry

OTel GenAI semantic conventions for agent spans. Targets API-level observability. Does not address terminal/CLI-level capture.

online 2025 beadle

AI agent security cheat sheet

OWASP

Technical guidance on sandboxing, permission scoping, and audit trails for autonomous agents

online 2025 beadle

OWASP top 10 risks and mitigations for agentic AI security

OWASP GenAI Security Project

100+ security researchers; covers prompt injection, identity abuse, tool misuse for agentic AI

online 2025 use-cases

OWASP top 10 risks and mitigations for agentic AI security

OWASP GenAI Security Project

Formal taxonomy of agentic AI security threats from 100+ researchers.

online 2026 beadle

OpenClaw agentic AI threat assessment

Palo Alto Networks Unit 42

Concluded OpenClaw maps to all 10 OWASP risks for agentic applications; lacks enforceable trust boundaries between untrusted inputs and high-privilege reasoning

online 2025 use-cases

Claude code official plugin marketplace: Complete guide to 36 plugins now available

Pete Gypps Consultancy

Official marketplace launched October 9, 2025. 36 plugins as of December 2025.

online 2025 dungeon

Claude Code reaches 115,000 developers, processes 195 million lines weekly

PPC Land

Official statistics on Claude Code user base and adoption metrics

online 2025 use-cases

Claude Code reaches 115,000 developers, processes 195 million lines weekly

ppc.land

Claude Code statistics from July 6, 2025. 115,000 developers; 195M lines/week. Launched March 2025.

online 2025 quarry

Claude Code reaches 115,000 developers, processes 195 million lines weekly

ppc.land

Reports figures disclosed by Deedy Das (Menlo Ventures) on July 6 2025 — NOT an official Anthropic release. 115,000 developers, 195M lines per week. Anthropic confirmed only “5.5x revenue increase” for July 2025. Use Anthropic Series G announcement for confirmed figures.

online 2024 punt-kit

projen: Rapidly build modern applications with advanced configuration management

projen contributors

Synthesized config files should never be manually edited; custom project types can update N repos; AWS-centric, TypeScript-first

online 2025 beadle

Proton mail Bridge

Proton

Local IMAP/SMTP server; open-source Go; paid subscription required

online 2024 beadle

Proton reaches 100 million accounts

Proton

100M+ accounts; demonstrates demand for privacy-focused communication

online 2024 refactory

Rope overview

python-rope contributors

online 2026 use-cases

Building effective AI agents with model context protocol (MCP)

Red Hat

MCP described as “fastest-adopted standard RedMonk has ever seen.” Donated to Linux Foundation. 1,000+ MCP servers in ecosystem.

online 2026 beadle

Building effective AI agents with model context protocol (MCP)

Red Hat

MCP as fastest-adopted developer standard; donated to Linux Foundation

online 2026 use-cases

The uncomfortable truth about vibe coding

Red Hat Developer

Published February 17, 2026. Documents the backlash and shift toward structured development practices.

online 2025 refactory

rust-analyzer — a rust compiler frontend for ides

rust-analyzer contributors

Compiler-as-library architecture. Exposes semantic analysis and refactoring via LSP.

online 2026 quarry

PR #4: Replace flock-based daemon liveness with socket-ping detection

SageOx

Demonstrates SageOx's product surface: conversation recording and session links in PR body.

online 2026 quarry

Context infrastructure for human-agent collaboration

SageOx

online 2026 reason-trace

Introducing SageOx

SageOx

Primary source from SageOx. Confirms four-component product: Team Context, Ledger of Work, Ox CLI (context priming), Web App. CEO: Ajit Banerjee (ex-Hugging Face).

online 2024 dungeon

The story of roguelikes and ASCII

SD Times

Historical overview of roguelike genre and ASCII art tradition from 1980s UNIX mainframes

online 2025 refactory

LLM-driven code refactoring: Opportunities and limitations

SEAL, Queen's University

StarCoder2 refactorings pass only 28.36% of unit tests at pass@1.

online 2025 refactory

Semgrep — lightweight static analysis

Semgrep, Inc.

Pattern-based multi-language analysis. Focuses on security and bug detection rather than behavior-preserving refactoring.

online 2024 beadle

AutoGPT

Significant Gravitas

182,000+ GitHub stars; autonomous AI agent framework with local deployment

online 2026 reason-trace

Entire launches with $60M to build an AI-focused code management platform

SiliconAngle

Detailed product breakdown: Checkpoints logs prompts, responses, files modified, token counts, attribution percentages. Planned components: semantic reasoning layer and developer UI.

online 2026 quarry

Entire launches with $60M to build AI-focused code management platform

SiliconANGLE

online 2025 z-spec

Global developer population trends 2025

SlashData

Estimates 36.5 million professional developers within 47.2 million total developers worldwide as of early 2025.

online 2026 beadle

ClawdBot security analysis: CVE-2026

Snyk

Demonstrated spoofed email triggering shell command that exfiltrated clawdbot.json containing API keys

online 2025 punt-kit

Celebrating five years of backstage

Spotify Engineering

3,000+ organizations; adoption stalls at under 10% in most non-Spotify orgs; requires dedicated full-time team

online 2025 quarry

Anthropic's Claude Code plugins open the floodgates

StartupHub AI

Claude Code plugins launched public beta October 9 2025. Ecosystem grew to 9,000+ plugins in under five months.

online 2026 dungeon

The AI vampire

Steve Yegge

Coined the term “AI vampire” to describe cognitive fatigue from AI-assisted coding; documents “Nap Attacks” and productivity limitations

online 2026 biff

Welcome to gas town

Steve Yegge

Maturity model with 8 levels of AI integration in development workflows. Argues teams progress through stages from basic completion to full agent orchestration. Complements Eledath's agentic engineering levels with a practitioner perspective.

online 2024 punt-kit

What is policy as code? Definition and benefits

Styra

OPA as de facto standard for policy-as-code; executable assertion pattern proven in infrastructure

online 2026 reason-trace

Former GitHub CEO raises record $60M dev tool seed round at $300M valuation

TechCrunch

Primary news source confirming Entire's $60M seed round, Checkpoints product description, three-component platform architecture, and provenance gap framing. Investors: Felicis (lead), Madrona, Basis Set, M12 (Microsoft).

online 2026 quarry

Former GitHub CEO raises record $60M dev tool seed round at $300M valuation

TechCrunch

Confirms Thomas Dohmke (GitHub CEO 2021–2025) founded Entire; $60M seed at $300M valuation led by Felicis; largest-ever developer tools seed round.

online 2025 reason-trace

GitHub Copilot crosses 20 million all-time users

TechCrunch

Satya Nadella announced 20M on Microsoft earnings call July 30, 2025. TechCrunch notes this is “all-time users,” not active. Paid subscribers: approx. 1.3M Q1 2025, growing 30% QoQ.

online 2025 quarry

Meta acquires AI device startup Limitless

TechCrunch

Meta acquired Limitless (formerly Rewind) December 5 2025. Founded by Dan Siroker and Brett Bejcek. Raised $33M+ from Sam Altman, First Round, a16z, NEA. Team joins Meta Reality Labs.

online 2025 use-cases

AWS Kiro coding agents highlight spec-driven development

TechTarget

Industry coverage confirming spec-driven development as the competitive direction AWS is betting on.

online 2025 use-cases

Vibe coding fails enterprise reality check

The New Stack

Documents enterprise teams abandoning vibe coding due to accumulating technical debt from absent specifications.

online 2025 punt-kit

23% of devs regularly use AI agents, per stack overflow survey

The New Stack

69% of agent users report productivity increase

online 2026 quarry

entire.io: Capture the reasoning behind AI-generated code

Thomas Dohmke

Founded by former GitHub CEO Thomas Dohmke. Launched February 2026 with $60M in funding.

online 2025 use-cases

Spec-driven development: Unpacking one of 2025's key new AI-assisted engineering practices

Thoughtworks

Practitioner post on spec-driven development practices for AI-assisted engineering.

online 2025 use-cases

Spec-driven development — technology radar

Thoughtworks

Thoughtworks Technology Radar entry. Identifies SDD as “one of the most important practices to emerge in 2025.” Cites 60–70% rework rate in low-maturity teams without specifications.

online 2025 dungeon

The NEW surprising number of steam games that use GenAI

Totally Human Media

Documents that nearly 20% of games released in 2025 disclosed use of generative AI

online 2024 punt-kit

Streamlining development workflows: The secret to taming multi-repo chaos

Trunk.io

Multi-repo consistency requires deliberate effort; configuration drift without automation

online 2024 punt-kit

Trunk code quality: Automated code quality for teams

Trunk.io

Metalinter running 100+ tools; CI-focused; does not scaffold, manage releases, or integrate with AI agents

online 2025 dungeon

The hidden cost of AI-assisted development: cognitive fatigue

WarpedVisions.org

Analysis of cognitive fatigue patterns in AI-assisted development

online 2025 reason-trace

agentsview: Fast local coding agent session viewer

Wes McKinney

Local web app for browsing Claude Code, Codex, Gemini sessions. Activity heatmaps, tool usage metrics. No terminal recordings, no reasoning review.

online 2025 use-cases

Vibe coding

Wikipedia contributors

Encyclopedia entry confirming vibe coding as a recognized cultural phenomenon in software development.

online 2024 dungeon

AI dungeon

Wikipedia Contributors

Historical overview of AI Dungeon's development, LLM transitions, and timeline

online 2025 reason-trace

State of agentic AI adoption survey [2026]

Zapier

84% of enterprises plan to boost AI agent investments in 2026; 72% already use AI agents.