Engineering Explainer

Practice

How leading engineering teams build, ship, and operate real systems — from their own blogs.

Practice

When the actor in your systems is an AI

Anthropic, Grab and Meta each shipped infrastructure work this week that, read together, sketches the early blueprint for running autonomous AI agents safely inside a real company.

3 papers

Practice

When something is always breaking

Three teams — Cloudflare, Netflix and Airbnb — on three faces of staying reliable at scale: how to undo a half-finished job, schedule a flood of work fairly, and push changes to thousands of servers without breaking any.

3 papers