Posts 264 entriesPage 2 of 27

Posts

Technical Deep-Dive

Honest Perf Benchmarks for a Paid-API Compiler

Four PRs, three releases, and a benchmark suite that won't lie to you: seeded-RNG corpora, double-gated Claude scenarios, and skipped-but-recorded records.

Read
Technical Deep-Dive

Five Silent Failures in One Day

Five tools said PASS without doing the work — pr-prescreen, .gitignore, prettier, SSH deploy, and a regex. Each silent failure and the guard that catches it.

Read
DevOps

Deterministic First, LLM Second: An Advisory CI Pre-Screen

Wire an LLM into CI without veto power: the deterministic classifier is the product, the LLM is advisory polish, and the never-block contract retires the old system the same day.

Read
Technical Deep-Dive

Self-Improving Skills: Three Schema Versions in One Day

Backward-compatible schema upgrades for Claude skills: L0 metadata indexing, conditional visibility, and config prompts eliminate mid-task environment errors. 2783 skills.

Read
Technical Deep-Dive

Transitive CVE Clearance: The Dual-Layer Pattern

How v0.9.1 cleared 6 high-severity transitive CVEs in axios + fast-uri — and why the dep bump needed top-level overrides to actually stick.

Read
Technical Deep-Dive

Three Guards Against Shipping Slop

How three review guards — adversarial pre-flight, empirical verification, post-delivery sweep — catch slop shipping to partners. Seven PRs, one day, zero embarrassment.

Read
Technical Deep-Dive

Two False-Positive Fixes, Same Root Cause

Two separate false-positive alerts, same root cause: monitoring conjoined liveness with conditional behavior. Separating health signals quiets noise under load.

Read
Technical Deep-Dive

AGENTS.md as a Cross-Tool Plugin Brief: A Case Study from kobiton/automate

A 5-device parity sweep against kobiton/automate showed iOS screenshot capture ~17% faster than Android — but the more interesting finding is what an AGENTS.md file would close. A worked example of cross-tool plugin briefs done right.

Read
Architecture

Spec graduation: when a partner email rewrites architecture

A partner check-in forced a contract re-read, which clarified a content boundary, which unblocked a spec graduation that had been stalled for two weeks.

Read
Technical Deep-Dive

Coherence as a Deliverable: How a Multi-Surface Engagement Stays Sane

Why scattered Plane issues, beads, docs, and partner portals silently diverge — and four structural patterns that catch drift before it compounds.

Read