news

“Tokenmaxxing is real, expensive & it’s spreading”: AI budgets are exploding - The New Stack

AI accountability startup Lanai debuted Token Tuner, a beta that scores each employee's efficiency by matching token usage and model choice to task complexity — peers burned 10x the tokens for half the efficiency in one beta.

Published 2026-05-27Source: The New Stack
Generated Tokenmaxxing editorial thumbnail for “Tokenmaxxing is real, expensive & it’s spreading”: AI budgets are exploding - The New Stack

Why it matters

This is the usage leaderboard inverted: ranking by outcomes per token instead of tokens consumed. Lanai claims it needs no custom instrumentation, attributing intent, value, and cost at the interaction level from prompt and tool activity alone.

Tokenmaxxing read

The scoring is grounded in observed outcomes, not synthetic benchmarks — like other teams finishing the same workflow on a cheaper model with equal success. Premium models for email replies tank your score: outcomemaxxing tooling, on schedule.

Source takeaway

Lanai CPO Mohit Mehta positions Token Tuner against the Kong/LiteLLM/Dynatrace budget-enforcement crowd by adding the missing context layer: which workflows the tokens actually bought.

Topic links

Related projects

Tools that match this angle

#1Direct
Routing

LiteLLM

BerriAI/litellm

An OpenAI-compatible gateway and SDK for calling many model providers with budgets, logging, load balancing, guardrails, and cost tracking.

50.4K8.9KSource-available
gatewaycost-trackingrouting
#2Direct
Observability

Langfuse

langfuse/langfuse

Open-source LLM engineering platform for observability, traces, metrics, evals, prompt management, datasets, and playground workflows.

29.1K3KSource-available
tracesevalscosts
#5Direct
Evaluation

promptfoo

promptfoo/promptfoo

A CLI and CI workflow for testing prompts, agents, and RAG systems across models, with evals and red-team style checks.

22.2K2KMIT
prompt-evalscirag
Related feed

More source-linked context

Business Insider source artwork
newsBI
newsmedium review

Silicon Valley's AI token craze is facing a reality check

Business Insider says the gamified token-leaderboard era is yielding to efficiency-maxxing: Amazon told staff not to use AI for its own sake, Copilot moved to usage-based billing, and labs now compete on intelligence per dollar.

cost-governanceexplainermetrics
Read note
Generated Tokenmaxxing editorial thumbnail for Amazon deletes devs’ tokenmaxxing leaderboard to minimize costs - InfoWorld
newsI
news

Amazon deletes devs’ tokenmaxxing leaderboard to minimize costs - InfoWorld

Amazon reportedly pulled an unofficial internal leaderboard that ranked employees by AI usage after it drove wasteful behavior and higher compute bills—workers started spinning up agents just to climb the rankings.

tokenmaxxingcost-governanceai-spend
Read note
Generated Tokenmaxxing editorial thumbnail for Tokenmaxxing is dead. It didn't produce the AI ROI companies wanted. - Fortune
newsF
newsmedium review

Tokenmaxxing is dead. It didn't produce the AI ROI companies wanted. - Fortune

Fortune's Jeremy Kahn argues the tokenmaxxing era ended nearly as fast as it began: Meta, Amazon, Microsoft, and Uber retired token-usage incentives once spend outran provable returns.

ai-spendexplainermetrics
Read note