Skip to content

Project 03 · Multi-Agent Pipeline

Objective

Build the same capability under controlled conditions and observe how harness design changes reliability.

Steps

  1. Define the task and acceptance criteria.
  2. Run the agent without a harness and record scope, tests, modified files and completion claims.
  3. Add AGENTS.md, feature_list.json and progress.md.
  4. Run the governed session with the same core prompt.
  5. Compare baseline vs harnessed execution.

Metrics

MetricBaselineHarnessed
Files modified
Out-of-scope edits
Tests run automatically
Attempts to pass tests
Progress documented

Released under the MIT License.