We Benchmarked MCP Against Code Generation. MCP Won (Mostly).

TL;DR: For a small, well-designed API (10 tools), structured MCP tool calls consistently outscore code generation on correctness — 0.99 vs 0.97 — and the gap concentrates in tasks where domain-specific logic matters. Adding a reference document to MCP tools costs 6–14% more tokens with zero accuracy gain. Cloudflare’s search+execute pattern matches MCP accuracy but uses more tokens. With MCP tools, Haiku is within 1% of Opus at 1/12 the cost. ...

March 18, 2026 · 12 min · npow

Finding the Human in the Machine

50+ open-source orgs are rebuilding how they evaluate contributors. Here’s what’s emerging.

March 6, 2026 · 8 min · npow

Hidden Technical Debt in Agentic Systems

Agentic systems are replaying hidden technical debt from early ML, and the missing control plane is where the biggest risk accumulates.

March 5, 2026 · 8 min · npow

Distillation Isn’t an Attack. It’s a Moat Test.

Distillation reveals which capabilities are economically reproducible.

March 2, 2026 · 2 min · npow

AI Didn't Replace Engineers. It Replaced the Signal.

Good code used to mean the engineer understood it. That’s no longer true.

February 24, 2026 · 7 min · npow