Search
Posts
Honest Perf Benchmarks for a Paid-API Compiler
Four PRs, three releases, and a benchmark suite that won't lie to you: seeded-RNG corpora, double-gated Claude scenarios, and skipped-but-recorded records.
ReadFive Silent Failures in One Day
Five tools said PASS without doing the work — pr-prescreen, .gitignore, prettier, SSH deploy, and a regex. Each silent failure and the guard that catches it.
ReadDeterministic First, LLM Second: An Advisory CI Pre-Screen
Wire an LLM into CI without veto power: the deterministic classifier is the product, the LLM is advisory polish, and the never-block contract retires the old system the same day.
ReadSelf-Improving Skills: Three Schema Versions in One Day
Backward-compatible schema upgrades for Claude skills: L0 metadata indexing, conditional visibility, and config prompts eliminate mid-task environment errors. 2783 skills.
ReadTransitive CVE Clearance: The Dual-Layer Pattern
How v0.9.1 cleared 6 high-severity transitive CVEs in axios + fast-uri — and why the dep bump needed top-level overrides to actually stick.
ReadThree Guards Against Shipping Slop
How three review guards — adversarial pre-flight, empirical verification, post-delivery sweep — catch slop shipping to partners. Seven PRs, one day, zero embarrassment.
ReadTwo False-Positive Fixes, Same Root Cause
Two separate false-positive alerts, same root cause: monitoring conjoined liveness with conditional behavior. Separating health signals quiets noise under load.
ReadAGENTS.md as a Cross-Tool Plugin Brief: A Case Study from kobiton/automate
A 5-device parity sweep against kobiton/automate showed iOS screenshot capture ~17% faster than Android — but the more interesting finding is what an AGENTS.md file would close. A worked example of cross-tool plugin briefs done right.
ReadSpec graduation: when a partner email rewrites architecture
A partner check-in forced a contract re-read, which clarified a content boundary, which unblocked a spec graduation that had been stalled for two weeks.
ReadCoherence as a Deliverable: How a Multi-Surface Engagement Stays Sane
Why scattered Plane issues, beads, docs, and partner portals silently diverge — and four structural patterns that catch drift before it compounds.
Read