LLM Knowledge Bases

Classification

Role

practitioner-note

Domain

research

Source type

Harness types

grounding-context-loadingexecution-harnessvalidation-harnessrepair-harnessmonitoring-harnesslearning-harnessinterface-harness

Validation position

before-generationimmediately-after-generationcontinuous

Validation mode

empiricalinterpretive

Prescription stance

mixed

Relation to argument

capability-is-extendedfirst-mile-input-formationvalidation-is-constitutiverepairability-mattersobservability-mattersdomain-structure-matters

Extended Frontier Read

The raw model is not the unit of analysis. The useful system is model plus:

a raw source archive,
a compiled markdown wiki,
index and summary files,
Obsidian as inspection surface,
generated outputs that feed back into the wiki,
health checks over consistency and missing data,
small custom tools such as a wiki search engine.

This belongs beside harness entries, but it broadens the frame from coding agents to research agents. The same pattern appears: make the environment legible, let the model act on files, inspect the result, repair the substrate, and let work accumulate.

Open Questions

At what corpus size does this stop working without stronger retrieval infrastructure?
Which health checks are most predictive of useful future Q&A?
Does finetuning on the wiki improve capability, or does it destroy the inspectability and repairability that make the workflow valuable?

Related entries

What Is an Agent Harness

Aparna Dhinakaran · 2026-04-21

capability-is-extendedvalidation-is-constitutiverepairability-mattersobservability-mattersgrounding-context-loadingexecution-harnessvalidation-harnessrepair-harnessmonitoring-harnesslearning-harnessinterface-harness

An open-source spec for Codex orchestration: Symphony

Alex Kotliarskyi, Victor Zhu, and Zach Brock · 2026-04-26

capability-is-extendedvalidation-is-constitutiverepairability-mattersobservability-mattersexecution-harnessvalidation-harnessrepair-harnessmonitoring-harnesslearning-harnessinterface-harness

Hermes Agent README

Nous Research · 2026-04-28

capability-is-extendedfirst-mile-input-formationrepairability-mattersobservability-mattersgrounding-context-loadingexecution-harnessrepair-harnessmonitoring-harnesslearning-harnessinterface-harness

Good and Bad Harness Engineering

Daniel Miessler · 2025-08-31

capability-is-extendedrepairability-mattersobservability-mattersgrounding-context-loadingexecution-harnessvalidation-harnessrepair-harnessmonitoring-harness

Overlap is computed on tags, relation-to-argument, and harness types — not on role or domain, because contrasts are often the most useful neighbours.

LLM Knowledge Bases

Classification

Extended capability commentary

Why it matters

Annotation

Extended Frontier Read

Open Questions

Notes

Related entries