Posts 264 entriesPage 7 of 27

Posts

IRSB Deep Dive

Deep Dive Part 3: A 12-Package Nested Monorepo That Watches AI Agents for You

Inside the IRSB Watchtower: a 12-package pnpm monorepo with evidence verification, behaviour-signal derivation, risk scoring, and auto-dispute — ~500 tests of deterministic monitoring, grounded in the payment-channel watchtower literature.

Read
IRSB Ecosystem Deep Dive

Deep Dive Part 2: Cryptographic Receipts and the Evidence Pipeline That Proves What AI Agents Actually Did

How the IRSB Solver creates SHA-256 evidence bundles, signs them with Cloud KMS, and posts cryptographic receipts on-chain — creating an unforgeable audit trail for AI-agent work, grounded in the audit-log integrity literature.

Read
IRSB Deep Dive

Deep Dive Part 1: Five On-Chain Enforcers That Make AI Agent Wallets Structurally Safe

How EIP-7702 delegation, five caveat enforcers, and bond staking create defense-in-depth guardrails for AI agent wallets — with 11 contracts live on Sepolia. Read against the smart-contract security literature and the agent-tool-use research.

Read
Wild Ecosystem Deep Dive

Deep Dive Part 4: Building 10 Production Gems with Claude Code as Tech Lead

What it means when the AI makes the architectural decisions. How Claude Code served as tech lead across 10 Ruby gems with approximately 2,924 tests and 60+ canonical docs — read against the LLM-agent and software-engineering-automation literature.

Read
Wild Ecosystem Deep Dive

Deep Dive Part 3: The Observability Loop — Teaching AI Tools to Improve Themselves

How the wild ecosystem's three-repo pipeline — telemetry, transcript normalization, and gap mining — creates a feedback loop that teaches AI tools what they are struggling with. Read against Dapper, fire-and-forget supervision, and the privacy-engineering literature.

Read
Wild Ecosystem Deep Dive

Deep Dive Part 2: CLAUDE.md — The Missing Manual for Human–AI Software Collaboration

How per-repo CLAUDE.md files act as binding contracts between human architects and AI implementers. Read against the literature on long-context attention degradation, sycophancy, and module boundary information distribution.

Read
Wild Ecosystem Deep Dive

Deep Dive Part 1: The Safety Architecture of Letting AI Agents Touch Your Production Rails Database

How the wild ecosystem makes it structurally impossible for AI agents to cause damage when accessing production Rails databases. Defense in depth, adversarial testing, and hard safety ceilings — read against the security-engineering and access-control literature.

Read
Technical Deep-Dive

Write Once, Publish Everywhere: Building Content Distribution Across Three Sites

Three sites now share content through automated pipelines: Hugo to Astro backfill, cross-repo plugin sync with repository_dispatch, and a CodeRabbit-caught no-op filter.

Read
Technical Deep-Dive

Usable, Not Just Functional: Entity Selection, Binary Eval, and 6 UX Fixes Across 4 Repos

40+ commits across 4 repos. CAD entity selection gets real bounding boxes, a binary eval framework bootstraps to v0.2.8, and the X triage plugin ships 6 UX features.

Read
Technical Deep-Dive

X Bug Triage Plugin: Zero to v0.4.3 in One Day

A brand-new MCP plugin that triages X/Twitter bug reports shipped 10 epics, 13 releases, 89 tests, and 4 extracted sub-agent skills in a single day. Plus three more SaaS packs got quality-repaired.

Read