news

Silicon Valley's AI token craze is facing a reality check

Business Insider says the gamified token-leaderboard era is yielding to efficiency-maxxing: Amazon told staff not to use AI for its own sake, Copilot moved to usage-based billing, and labs now compete on intelligence per dollar.

Published 2026-06-01Source: Business Insider
Business Insider source artwork

Why it matters

Metered billing is spreading just as executives demand outcome evidence — the subsidized-usage era is ending and real bills are landing. Visa now rewards outcome-producing teams with internal points, not raw usage.

Tokenmaxxing read

Jeen CEO Oded Tahori's proposal inverts the leaderboard: give bigger token budgets to employees whose AI work demonstrably benefits the business, and cut budgets for waste. That's spend governance expressed in the same currency that caused the problem.

Source takeaway

ACF Investors' Tim Mills calls the clampdown a constructive utility check rather than a bubble signal — tokenmaxxing distorted usage statistics for non-productive reasons, so reining it in restores the data everyone prices the AI trade on.

Topic links

Related projects

Tools that match this angle

#1Direct
Routing

LiteLLM

BerriAI/litellm

An OpenAI-compatible gateway and SDK for calling many model providers with budgets, logging, load balancing, guardrails, and cost tracking.

50.2K8.8KSource-available
gatewaycost-trackingrouting
#2Direct
Observability

Langfuse

langfuse/langfuse

Open-source LLM engineering platform for observability, traces, metrics, evals, prompt management, datasets, and playground workflows.

29K3KSource-available
tracesevalscosts
#5Direct
Evaluation

promptfoo

promptfoo/promptfoo

A CLI and CI workflow for testing prompts, agents, and RAG systems across models, with evals and red-team style checks.

22.1K2KMIT
prompt-evalscirag
Related feed

More source-linked context

Generated Tokenmaxxing editorial thumbnail for “Tokenmaxxing is real, expensive & it’s spreading”: AI budgets are exploding - The New Stack
newsTN
newsmedium review

“Tokenmaxxing is real, expensive & it’s spreading”: AI budgets are exploding - The New Stack

AI accountability startup Lanai debuted Token Tuner, a beta that scores each employee's efficiency by matching token usage and model choice to task complexity — peers burned 10x the tokens for half the efficiency in one beta.

ai-spendcost-governanceexplainer
Read note
Generated Tokenmaxxing editorial thumbnail for ‘Tokenmaxxing’ Is the New Quiet Quitting—Here’s the Fix - SUCCESS Magazine
newsSM
news

‘Tokenmaxxing’ Is the New Quiet Quitting—Here’s the Fix - SUCCESS Magazine

SUCCESS argues tokenmaxxing-style adoption targets create performative AI usage. Their fix is to measure outcomes and quality, not raw token volume.

tokenmaxxingexplainerworkplace-ai
Read note
Generated Tokenmaxxing editorial thumbnail for ‘Nobody has budgeted’ for tokenmaxxing, Box’s Levie says
newsS
news

‘Nobody has budgeted’ for tokenmaxxing, Box’s Levie says

Box CEO Aaron Levie told Semafor that AI coding costs 'just showed up overnight' once 10,000 of his engineers piled onto Claude Code, and warned that 'nobody has budgeted' for the bills now hitting enterprises.

tokenmaxxingexplainerworkplace-ai
Read note