09. Bibliography

This course was informed by a focused set of public references on harness engineering, long-running agents, and reliable agentic software delivery.

Core papers

Anthropic — Building effective agents (December 2024) — Practical architecture patterns for deciding when to use workflows vs autonomous agents.
Anthropic — Effective harnesses for long-running agents (November 2025) — Concrete guidance on continuity, recovery, and reliability in extended agent sessions.
Anthropic — Harness design for long-running applications — Practical design patterns for building durable harnesses in long-horizon production workflows.
OpenAI — Zero-manual-code experiment report (2025–2026) — Reference link for OpenAI’s reported experiment on reducing manual coding steps through agentic workflows.
LangChain — Evaluating Deep Agents CLI on Terminal Bench 2.0 (December 2025, referenced in March 2026 reporting) — Benchmark-oriented case study on harness and evaluation setup impact.

OpenAI — Harness engineering — Introduces harness engineering as the reliability layer that makes agent behavior operationally dependable.
Martin Fowler — Harness Engineering for Coding Agent Users — Clear framing of why software teams need structure and control boundaries around coding agents.
LangChain — The anatomy of an agent harness — Breaks down the main harness components and how they work together in production settings.

WalkingLabs — Learn Harness Engineering — A structured learning path that connects conceptual foundations to implementation choices.

Claude Code from Source — Chapter 1: Architecture / The golden path from keystroke to output — Useful for understanding the end-to-end runtime path from prompt input to system output.
Claude Code from Source — Chapter 2: Bootstrap — Explains early-session initialization mechanics that strongly influence reliability and context quality.
Simon Willison — Red/Green TDD for agentic coding — A practical reliability pattern for keeping coding-agent changes test-driven and auditable.
Phil Schmid — Agent Harness 2026 — A concise implementation-focused perspective on harness design decisions and operational trade-offs.

Start practice: Project 01 →