Search
Posts
Deep Dive Part 3: A 12-Package Nested Monorepo That Watches AI Agents for You
Inside the IRSB Watchtower: a 12-package pnpm monorepo with evidence verification, behaviour-signal derivation, risk scoring, and auto-dispute — ~500 tests of deterministic monitoring, grounded in the payment-channel watchtower literature.
ReadDeep Dive Part 2: Cryptographic Receipts and the Evidence Pipeline That Proves What AI Agents Actually Did
How the IRSB Solver creates SHA-256 evidence bundles, signs them with Cloud KMS, and posts cryptographic receipts on-chain — creating an unforgeable audit trail for AI-agent work, grounded in the audit-log integrity literature.
ReadDeep Dive Part 1: Five On-Chain Enforcers That Make AI Agent Wallets Structurally Safe
How EIP-7702 delegation, five caveat enforcers, and bond staking create defense-in-depth guardrails for AI agent wallets — with 11 contracts live on Sepolia. Read against the smart-contract security literature and the agent-tool-use research.
ReadDeep Dive Part 4: Building 10 Production Gems with Claude Code as Tech Lead
What it means when the AI makes the architectural decisions. How Claude Code served as tech lead across 10 Ruby gems with approximately 2,924 tests and 60+ canonical docs — read against the LLM-agent and software-engineering-automation literature.
ReadDeep Dive Part 3: The Observability Loop — Teaching AI Tools to Improve Themselves
How the wild ecosystem's three-repo pipeline — telemetry, transcript normalization, and gap mining — creates a feedback loop that teaches AI tools what they are struggling with. Read against Dapper, fire-and-forget supervision, and the privacy-engineering literature.
ReadDeep Dive Part 2: CLAUDE.md — The Missing Manual for Human–AI Software Collaboration
How per-repo CLAUDE.md files act as binding contracts between human architects and AI implementers. Read against the literature on long-context attention degradation, sycophancy, and module boundary information distribution.
ReadDeep Dive Part 1: The Safety Architecture of Letting AI Agents Touch Your Production Rails Database
How the wild ecosystem makes it structurally impossible for AI agents to cause damage when accessing production Rails databases. Defense in depth, adversarial testing, and hard safety ceilings — read against the security-engineering and access-control literature.
ReadWrite Once, Publish Everywhere: Building Content Distribution Across Three Sites
Three sites now share content through automated pipelines: Hugo to Astro backfill, cross-repo plugin sync with repository_dispatch, and a CodeRabbit-caught no-op filter.
ReadUsable, Not Just Functional: Entity Selection, Binary Eval, and 6 UX Fixes Across 4 Repos
40+ commits across 4 repos. CAD entity selection gets real bounding boxes, a binary eval framework bootstraps to v0.2.8, and the X triage plugin ships 6 UX features.
ReadX Bug Triage Plugin: Zero to v0.4.3 in One Day
A brand-new MCP plugin that triages X/Twitter bug reports shipped 10 epics, 13 releases, 89 tests, and 4 extracted sub-agent skills in a single day. Plus three more SaaS packs got quality-repaired.
Read