The useful story this week is not a louder usage number; it is the price tag finally catching up to it. Two independent signals landed days apart and said the same thing: uncapped token spend stopped pencilling out, so the way AI gets billed is being rewritten.
Gartner now projects that by 2028 the tokens behind coding agents could cost more than the developers using them, and 6% of firms already pay over $2,000 per developer per month. On the vendor side, CFO Brew reports companies like Pegasystems and Intercom shifting from token-metered fees toward outcome-based pricing, because buyers stopped believing that more tokens equaled more value.
Put those together and tokenmaxxing flips from a bragging metric into a budget liability. When the meter charges per token, runaway volume is the buyer's problem; when it charges per resolved outcome, that volume becomes the vendor's problem. The change in who absorbs wasted loops is the most important thing happening in the space right now.
The reader rule for this issue: price your own work the way the market is starting to. Track cost per accepted task, not token volume, and treat every model default, retry cap, and cache setting as a line item on that bill.

