Knowledge OS: Render, Promote, and the Research Command

Posted on Apr 8, 2026

Three days ago ICO was at v0.1.5. Today it’s at v0.4.0. Two epics landed, three releases shipped, and the test count nearly tripled.

Epic 8: Render and Promote

Epic 8 delivered two distinct capabilities: turning compiled knowledge into Marp slide decks, and a promotion engine that gates which artifacts make it into the wiki.

Slide Deck Renderer

The renderer takes compiled knowledge and produces Marp-compatible markdown via Claude API. Marp is a solid choice here — it’s just markdown with slide separators, so the output is diffable, version-controllable, and doesn’t require a proprietary format.

Thirty-four tests cover the renderer. That’s a lot for a rendering pipeline, but the edge cases matter. Malformed frontmatter, missing fields, Claude API failures, oversized inputs — each one gets explicit coverage.

Promotion Engine

This is the more interesting piece. The promotion engine copies artifacts from outputs/ into wiki/, but only if they pass seven eligibility rules:

Path must exist
File must be readable
Frontmatter must parse
Type must be promotable
Confirm flag must be set
No draft status
No raw evidence files

Three of those are anti-pattern rejections — drafts, evidence artifacts, and unconfirmed outputs get blocked before they can pollute the wiki. The confirm flag is deliberate friction. You have to explicitly mark something as ready for promotion. No accidental publishes.

Why not just copy files from outputs/ to wiki/? Because a wiki is a published surface. Raw compiler output includes intermediate artifacts, drafts, and evidence files that are useful for debugging but wrong for consumption. The promotion engine is the quality gate between “the compiler produced this” and “a human has reviewed this and declared it wiki-ready.” Without it, every compile operation pollutes the wiki with unreviewed content.

The alternative was a manual process — review the outputs directory, hand-copy the good ones, edit frontmatter yourself. That works for 10 files. It breaks at 100. The rules engine automates the judgment while keeping the human confirmation step.

When promotion succeeds, the engine:

Computes SHA-256 hash of the source file
Mutates frontmatter on the target copy (adds promotion metadata)
Writes a database record
Emits a trace event
Creates a per-promotion JSONL audit file

When it fails, everything rolls back. No partial promotions. The audit file still gets written so you can see what was attempted and why it was rejected.

// Each promotion gets its own audit trail
const auditEntry = {
  timestamp: new Date().toISOString(),
  source: sourcePath,
  target: targetPath,
  sourceHash: sha256(sourceContent),
  status: 'promoted',
  rules: ruleResults,
};
appendFileSync(auditPath, JSON.stringify(auditEntry) + '\n');

Unpromote: Rollback Without Data Loss

The inverse operation is just as important. ico unpromote removes an artifact from the wiki without deleting the source. It strips the promotion metadata from the wiki copy, records the demotion in the database, and emits a trace event. The source file in outputs/ remains untouched.

Why not just delete the wiki file? Because deletion loses the audit trail. Unpromote preserves the record that this artifact was once promoted, who promoted it, when, and why it was later demoted. In a knowledge system, the history of what was considered authoritative and later reconsidered is itself valuable knowledge.

The unpromote operation also triggers a wiki index rebuild. The wiki’s table of contents, cross-references, and search index all update to reflect the removal. No orphan links.

This shipped as PR #9 and tagged v0.3.0.

Docs Fix: The Test Count Was Wrong

A small but telling commit: CLAUDE.md listed 292 tests. The actual count was 864. The discrepancy? The docs were only counting the CLI package. ICO has four packages, and nobody had summed them up.

Same commit fixed the CLI command count (13 to 14 — eval was listed as a stub but had been implemented) and clarified that claude_agent_sdk is planned for Epic 9, not yet installed.

Tagged v0.3.1. Small release, but accurate docs matter more than people think.

Epic 9: The Research Command

Epic 9 introduced ico research <brief> — a command that creates a scoped research task workspace. You give it a research question or topic, and it scaffolds:

A brief.md with YAML frontmatter (title, scope, created timestamp)
An audit log entry tracking when the research task was created
Support for both human-readable and JSON output formats

# Human output
ico research "Compare vector databases for sub-100ms latency"

# Machine output
ico research "Compare vector databases for sub-100ms latency" --json

The key design decision: research tasks are scoped workspaces, not loose files. Each ico research invocation creates a directory with a brief.md containing YAML frontmatter (title, scope, created timestamp, status). This means research tasks are discoverable — you can list all active research, see when each started, and track which ones produced outputs.

The --json flag matters for automation. When another tool (or a Claude Code session) invokes ico research, it gets structured output it can parse. The human-readable format is for terminal use. The JSON format is for pipeline integration.

# Human: readable summary
ico research "Compare vector databases for sub-100ms latency"

# Machine: structured JSON for downstream processing
ico research "Compare vector databases for sub-100ms latency" --json

Fourteen new tests brought the total to 878 passing. The PR review caught two issues: the command was calling process.exit() directly (bad for testability — switched to setting process.exitCode), and there were non-null assertions that could mask runtime errors. Both fixed before merge.

Tagged v0.4.0 as PR #10.

The Architecture Pattern

Nine epics across three days share a pattern worth naming: contract-first acceleration. Epic 1 froze 14 standards documents. Every subsequent epic implements against those contracts, not against a moving target. The promotion engine’s 7 rules? Derived from the content policy standard. The compiler’s 6 passes? Defined in the compiler spec. The research command’s output format? Matches the workspace schema.

This is why 878 tests exist after three days. When the contract is defined before the code, the tests write themselves — you’re testing conformance to a spec, not inventing behavior. And when a PR review catches issues (like the process.exit() problem in PR #10), the fix is scoped to one file because the contract boundary is clear.

Marketplace Housekeeping

Over in claude-code-plugins, three small but necessary changes:

Hyperfocus plugin (PR #514 by nextor2k). First community-contributed plugin merged this week. An ADHD-friendly output formatting skill grounded in W3C COGA guidelines and ATG Publishing ADHD Standards — no tool access, just formatting. Three modes: clean (reduced clutter), flow (optimized for sustained attention), and zen (minimal output). It restructures Claude’s output to reduce cognitive load: shorter paragraphs, clearer section breaks, explicit action items separated from context.

The initial version shipped with an empty allowed-tools field, which triggered a validator error. Formatting-only skills don’t need tool declarations — they transform output, not invoke tools. Second commit removed the invalid field. This is a good signal: the community is building skills that the validator catches correctly.

Claudebase source path fix. The marketplace catalog (marketplace.extended.json) had the source path listed as "external" instead of the actual directory path. This broke marketplace loading entirely — the schema validator rejected the non-path value, and the error cascaded to prevent any plugin from loading. Filed as issue #516, fixed within 2 hours. Small data integrity fix with outsized blast radius.

Velocity Check

Three days of ICO development:

Day	Epics	Version	Tests
Apr 6	1–5	v0.1.0–v0.1.5	292
Apr 7	6–7	v0.2.0–v0.2.1	557
Apr 8	8–9	v0.3.0–v0.4.0	878

Nine epics. Eight releases. 878 tests. The project has a full CLI, a knowledge compilation pipeline, a policy engine, slide deck rendering, a promotion engine with audit trails, and now a research command.

The test count growth is the metric I care about most. Features without tests are demos. Features with tests are products.

Related Posts: