Daniel's personal website

This library entry is part of The Extended Frontier thesis. Entries are curated with AI assistance and human review; most initial entries were prepared with Claude (Anthropic), while individual entries may note other assisting systems. Metadata and annotations are editorial, not peer-reviewed. Entries flagged as unverified may contain placeholder dates, authors, or classifications.

Good and Bad Harness Engineering

Daniel Miessler·2025-08-31·essay·source

Metadata unverified. URL and author verified; content summarised via search snippets (site blocks automated fetch). Exact publish date is a best guess — confirm from the post header before citing.

In the early days of prompt engineering (2023-2024) it was helpful to tell AI exactly how to do things, but this inversion probably happened somewhere in 2025.

Argues that good harness engineering focuses on who the user is and what they're trying to accomplish — the 'what' — and lets the model handle the 'how'. Pairs with Miessler's 'Bitter Lesson Engineering' as a design discipline for scaffolding that extends capability rather than compensating for model weakness.

Classification

Role: framework-piece
Domain: cross-domain
Source type: essay
Harness types: input-shapinggrounding-context-loadingexecution-harnessvalidation-harnessrepair-harnessmonitoring-harness
Validation position: before-generationimmediately-after-generationpost-deployment
Validation mode: empiricalmechanical
Prescription stance: mixed
Relation to argument: capability-is-extendedrepairability-mattersobservability-mattersbreakdown-when-harness-absent
Tags: harness-engineeringbitter-lessondesign-disciplineagent-designwhat-not-how

Extended capability commentary

Input legibility: Treats input formation as part of the engineered system, not preprocessing.
Task structure
Reward richness
Feedback latency
Repairability
Observability
Reversibility
Offline evaluability
Institutional ratification

Why it matters

Supplies the vocabulary for distinguishing harnesses that *extend* capability from harnesses that merely *compensate* for it. A critical lens for reading practitioner writing.

Annotation

Stakes out the middle ground between "thin harness, fat skills" and fully prescriptive agent frameworks. The core move is a good/bad distinction inside harness engineering itself: some scaffolding genuinely extends what the system can do (input shaping, repair loops, observability), while other scaffolding is brittle compensation for current model weakness and will not survive the next model.

Miessler's design rule is compressed into one line: don't confuse the what with the how. Tell the model who you are and what outcome you want; let the model figure out the path.

Read together with:

Bitter Lesson Engineering — the underlying argument, leaning on Sutton's "The Bitter Lesson."
Tan's "Thin Harness, Fat Skills" — adjacent but less prescriptive-about-good-design.

Miessler is not endorsing the thin-harness conclusion that scaffolding is always waste. He is endorsing a discipline of harness design. The disagreement with Tan is legible: both agree some scaffolding is waste; they disagree about how much of the harness is waste in the limit of model improvement.

What the library should extract once the post is fully read

The explicit taxonomy (if any) of good vs. bad harness work.
Concrete examples cited as each type.
Whether repairability and observability are treated as constitutive of capability or merely as hygiene.

Related entries

Skill Issue: Harness Engineering for Coding Agents
HumanLayer · 2026-02-28
#harness-engineeringcapability-is-extendedrepairability-mattersobservability-mattersbreakdown-when-harness-absentexecution-harnessrepair-harnessmonitoring-harness
What Is an Agent Harness
Aparna Dhinakaran · 2026-04-21
capability-is-extendedrepairability-mattersobservability-mattersbreakdown-when-harness-absentinput-shapinggrounding-context-loadingexecution-harnessvalidation-harnessrepair-harnessmonitoring-harness
LLM Knowledge Bases
Andrej Karpathy · 2026-04-01
capability-is-extendedrepairability-mattersobservability-mattersgrounding-context-loadingexecution-harnessvalidation-harnessrepair-harnessmonitoring-harness
Hermes Agent README
Nous Research · 2026-04-28
capability-is-extendedrepairability-mattersobservability-mattersinput-shapinggrounding-context-loadingexecution-harnessrepair-harnessmonitoring-harness

Overlap is computed on tags, relation-to-argument, and harness types — not on role or domain, because contrasts are often the most useful neighbours.