Search
Field notes from building AI in production.
Daily case studies, deep-dives, and operator-grade write-ups on AI engineering, DevOps, and the messy reality of shipping software that touches LLMs. Written from the cab, the office, and the trenches β by Jeremy Longshore at Intent Solutions.
Curated long reads
Agent-Native Mobile Testing
A three-chapter series on plugin authoring, first-mover dynamics, and AI triage for agent-orchestrated mobile cloud testing β worked through kobiton/automate as the running case study.
in-progressBuilding MCP Servers in Production
A three-chapter case study on shipping production-grade Model Context Protocol servers β synthesized from the daily case-study posts that documented the work.
Recent posts
AGENTS.md as a Cross-Tool Plugin Brief: A Case Study from kobiton/automate
A 5-device parity sweep against kobiton/automate showed iOS screenshot capture ~17% faster than Android β but the more interesting finding is what an AGENTS.md file would close. A worked example of cross-tool plugin briefs done right.
ReadSpec graduation: when a partner email rewrites architecture
A partner check-in forced a contract re-read, which clarified a content boundary, which unblocked a spec graduation that had been stalled for two weeks.
ReadCoherence as a Deliverable: How a Multi-Surface Engagement Stays Sane
Why scattered Plane issues, beads, docs, and partner portals silently diverge β and four structural patterns that catch drift before it compounds.
ReadForge Dogfood Ships a Grade-A Plane Plugin, JRig Loop Closes
First end-to-end /skill-creator --forge dogfood produced a Grade A 97/100 Plane plugin while the JRig-Verified provenance loop closed across schema, build pipeline, page target, homepage surface, and a new validator tier.
ReadGuidewire MCP: v0.1.0 β v0.1.1 in 76 minutes
v0.1.0 shipped at 19:14 Mountain, surfaced an install-path defect at 20:30, patched in v0.1.1. The architectural insight: dual Postgres pools for tamper-resistant audit logs.
ReadThe Two Postgres Bugs the Tests Caught: A Real-DB Integration Test Case Study
A no-mocks testcontainers policy caught two production-fatal Postgres bugs in one test run β PG 15's schema USAGE removal and an asymmetric SELECT grant for a state-machine-driving sink.
ReadGuidewire MCP v0.1.0: Carrier-Native Server Blueprint
How v0.1.0 of guidewire-mcp-for-claude shipped six foundation packages, five carrier-vocabulary tools, and a 30k-word blueprint in one day.
ReadSeries & ecosystems
MCP for Beginners
End-to-end Model Context Protocol curriculum β solutions in Python, TypeScript, Java, Rust, C#, .NET β translated across six languages.
Agentic Design Patterns
Patterns and anti-patterns for production agent systems. Decision frameworks, prompt scaffolds, evaluation harnesses.
Tiny Recursive Models
Building small recursive systems where simple loops compose into emergent behavior. Series in progress.
IRSB Ecosystem
Intent Solutions release tooling β the open-source family of plugins, skills, and packages.
Wild Ecosystem
The shared-GCP, multi-MCP family of standalone integrations powered by a common platform spine.
Research & Curriculum
Long-form research articles, learning paths, and reading lists for the AI builder community.
Retrospectives
April 2026: Tier 3 Finds Its Footing β Eight Case Studies, the Audit Harness Ships, and the Schema Postmortem
April 2026 retrospective β 30 posts, 162 commits, eight Tier 3 case studies (the first real month of Tier 3 publishing), audit-harness v0.1.0 shipped, the rubric-on-spec postmortem set the architectural rule, and the daily classification pipeline ran clean for half the month.
March 2026March 2026: 35 Posts, the Eval Framework That Shipped 10 Epics in a Day, and a Meta-Milestone
March 2026 retrospective β 35 posts published, 340 commits, j-rig binary eval framework shipped 10 epics in one day, and the tier classification system itself was designed and built.