news

China’s MiniMax, Moonshot top AI token use ranking, ending year of US dominance

SCMP reports that OpenRouter's token-usage rankings show a surge in demand for Chinese open-source models, with MiniMax (M2.5) and Moonshot (Kimi K2.5) leading by token usage after a wave of recent releases.

Published 2026-02-25Source: South China Morning Post
South China Morning Post source artwork

Why it matters

Usage rankings are becoming a market signal for where developer attention (and inference spend) is actually flowing. For teams optimizing cost and latency, the frontier is routing across fast-moving open models—not just picking one "best" model.

Tokenmaxxing read

Token usage isn't just a cost metric—it can be a leading indicator of model adoption. But it only becomes actionable when paired with routing logic (quality gates, budget ceilings, and fallbacks) instead of chasing leaderboard churn.

Source takeaway

SCMP highlights how quickly the leaderboard can flip when new models launch: OpenRouter's ranking shows Chinese models taking top spots by token usage, reinforcing that routing and continuous evaluation matter more than one-time model selection.

Topic links

Related projects

Tools that match this angle

#1Direct
Routing

LiteLLM

BerriAI/litellm

An OpenAI-compatible gateway and SDK for calling many model providers with budgets, logging, load balancing, guardrails, and cost tracking.

48.9K8.5KSource-available
gatewaycost-trackingrouting
#2Direct
Observability

Langfuse

langfuse/langfuse

Open-source LLM engineering platform for observability, traces, metrics, evals, prompt management, datasets, and playground workflows.

28.3K2.9KSource-available
tracesevalscosts
#7Direct
Tokenization

tiktoken

openai/tiktoken

A fast BPE tokenizer for OpenAI models, useful for counting and estimating token usage before requests go out.

18.4K1.5KMIT
token-countingbudgetingopenai
Related feed

More source-linked context

eu.36kr.com source artwork
newsE
news

OpenRouter scale becomes a business-model story

36Kr's OpenRouter coverage is valuable as a business-model read on how high-volume model routing can turn token flow into platform leverage.

tokenmaxxingmodel-routerpricing
Read note
NTB Kommunikasjon source artwork
agentNK
agent

OpenRouter funding puts router volume in the spotlight

The OpenRouter funding item is a clean router-market signal because it ties capital raised to reported weekly token volume and model access demand.

tokenmaxxingmodel-routerpricing
Read note
Menlo Ventures source artwork
newsMV
news

OpenRouter Now Processes More Than a Quadrillion Tokens a Year | Menlo Ventures

Menlo Ventures argues OpenRouter is becoming a core multi-model routing layer, and highlights how routing, caching, and policy controls matter as token volumes surge.

tokenmaxxingmodel-routerpricing
Read note