AI agents

Autonomous systems that act — and what it takes to trust them.

When the actor in your systems is an AI

Anthropic, Grab and Meta each shipped infrastructure work this week that, read together, sketches the early blueprint for running autonomous AI agents safely inside a real company.

3 papers

AI agents

The marathon agents can't finish

Today's AI coding agents close ten-minute tickets with ease. Give them a forty-hour project — port Kubernetes, clone Slack — and the best of them fail seven times out of ten.

arXiv:2606.07682

AI agents

Asking versus enforcing: how to actually control an AI agent

An AI agent follows a written rule most of the time — but "most of the time" is exactly where reliability dies. The craft is knowing which rules to prompt and which to enforce in code.

Anthropic