Why it matters
This is the most concrete public account of agentic token waste from the team that debugs it — live, on calls with users, from locally stored transcripts. Dario Amodei's framing explains the squeeze: Anthropic planned for 10x annual growth and got 80x.
Tokenmaxxing read
Wu's two waste patterns are checkable today: /usage flags broken-cache sessions and heavy subagent fan-out — audit both before blaming the model for quota burn. The team's stated north star is intelligence per token.
Source takeaway
Token efficiency is managed at the harness layer, not the model layer: Anthropic defaults to not shipping a tool unless it clearly improves token performance or accuracy.