Token goes global, selling Chinese electricity to the world

A Smokeless Power War

Author: Black Lobster, Deep Tide TechFlow

In the summer of 1858, a copper-core cable crossed the Atlantic Ocean floor, connecting London and New York.

The significance of this event was never about transmission speed, but about power structures—who laid the submarine cable could siphon off the flow of information. The British Empire, through this global telegraph network, held the intelligence of colonies, cotton prices, and war news in its hands.

The empire’s strength was not only its fleet but also that cable.

Over 160 years later, this logic is being reenacted in an unexpected way.

By 2026, China’s large models are quietly consuming the global developer market. According to the latest data from OpenRouter, Chinese models account for 61% of token consumption among the top ten models on the platform, with the top three all from China. Developers in San Francisco, Berlin, and Singapore send API requests daily across the Pacific Ocean via undersea cables to Chinese data centers, where computing power is consumed, electricity flows, and results are returned.

Electricity has never left China’s power grid, but its value is delivered across borders through tokens.

AI Model Migration

On February 24, 2026, OpenRouter released weekly data: the top ten models on the platform consumed about 87 trillion tokens in total, with Chinese models dominating at 53 trillion tokens, accounting for 61%. MiniMax M2.5 led with 2.45 trillion tokens, followed by Kimi K2.5 and Zhipu GLM-5—all from China.

Latest data as of February 26

This is no coincidence; a single spark ignited everything.

Earlier this year, OpenClaw emerged—a truly functional open-source AI tool that can directly control computers, execute commands, and parallelize complex workflows. Within weeks, it surpassed 210,000 stars on GitHub.

Financial professional John was among the first to install OpenClaw, integrating it with the Anthropic API. He began automatically monitoring stock market information and promptly reporting trading signals. Hours later, he stared at his account balance in disbelief: just a few dollars—gone.

This is the new reality brought by OpenClaw. Previously, chatting with AI involved a few thousand tokens per conversation, costing almost nothing. After integrating OpenClaw, AI runs multiple sub-tasks in the background, repeatedly calling context and looping iterations, causing token consumption to grow exponentially rather than linearly. Bills accelerate like a car with its hood open, the fuel gauge dropping rapidly—there’s no stopping.

A “trick” quickly circulated among developer communities: using OAuth tokens to connect Anthropic or Google subscription accounts directly to OpenClaw, turning the monthly unlimited quota into free fuel for AI agents—this is a common approach among many developers.

Official countermeasures soon followed.

On February 19, Anthropic updated its terms, explicitly prohibiting the use of Claude subscription credentials with third-party tools like OpenClaw. To access Claude’s features, API billing must be used. Google also broadly banned subscription accounts accessing Antigravity and Gemini AI Ultra via OpenClaw.

“Long have the people suffered under Qin,” John then embraced domestic large models.

On OpenRouter, domestic models like MiniMax M2.5 scored 80.2% on software engineering tasks, while Claude Opus scored 80.8%. The difference is negligible. But the prices are worlds apart: the former costs $0.3 per million tokens at the input end, while the latter costs $5—about 17 times more.

John switched over, workflows continued, and bills shrank by an order of magnitude. This migration is happening globally in parallel.

OpenRouter’s COO Chris Clark stated plainly: The reason Chinese open-source models capture such a large market share is because they are disproportionately used in US developer workflows.

Power Goes Overseas

To understand the essence of token export, one must first grasp the cost structure of a token.

It appears lightweight—roughly 0.75 English words per token. A typical AI conversation consumes only a few thousand tokens. But when these tokens stack into trillions, the physical reality becomes heavy.

Breaking down token costs, there are only two core components: computing power and electricity.

Computing power is the depreciation of GPUs. Buying a Nvidia H100 costs about $30,000, and its lifespan amortized per inference is the depreciation cost. Electricity is the fuel for data center operation. When GPUs are fully loaded, each consumes about 700 watts, plus cooling costs. A large AI data center’s annual electricity bill can easily exceed hundreds of millions of dollars.

Now, map this physical process.

An American developer in San Francisco sends an API request. Data travels from California via undersea cables across the Pacific to a Chinese data center. GPU clusters start working, electricity flows from China’s grid to the chips, inference completes, and results are sent back. The entire process may take only one or two seconds.

Electricity never leaves China’s grid, but its value is delivered across borders through tokens.

Here’s a magical aspect that traditional trade cannot match: tokens have no physical form, no customs, no tariffs, and are not counted in any current trade statistics. China exports vast amounts of computing and electrical services, yet they are almost invisible in official trade data.

Tokens have become derivatives of electricity; token export is fundamentally electricity export.

This is also thanks to China’s relatively low electricity prices—about 40% lower than the US—an inherent physical cost advantage that competitors can easily replicate.

Moreover, Chinese AI large models also benefit from algorithms and “internal competition.”

DeepSeek V3’s MoE architecture activates only parts of the model during inference. Independent tests show its inference cost is about 36% lower than GPT-4o. MiniMax M2.5, with 229 billion total parameters, activates only 10 billion.

At the top level is internal competition: Alibaba, ByteDance, Baidu, Tencent, Moon’s Shadow, Zhipu, MiniMax… over a dozen companies racing on the same track, prices have long fallen below reasonable profit margins. Loss-leading has become industry norm.

In detail, this mirrors China’s manufacturing export strategy—leveraging supply chain advantages and industry internal competition to push token prices down sharply.

From Bitcoin to Tokens

Before tokens, there was another form of electricity export.

Around 2015, power plant operators in Sichuan, Yunnan, and Xinjiang began welcoming a strange new clientele.

These people rented abandoned factories, packed them with countless machines, and operated 24/7. The machines produced nothing but kept solving an endless math problem. Occasionally, this math problem would yield a Bitcoin.

This was the first form of electricity export: using cheap hydro and wind power, through mining rigs’ hashing computations, converting into globally circulating digital assets, then cashing out on exchanges for dollars.

Electricity never crossed borders, but its value, via Bitcoin, flowed into global markets.

During those years, China’s hash rate accounted for over 70% of global Bitcoin mining. China’s hydropower and coal power, through this circuitous route, participated in a global redistribution of capital.

In 2021, all this abruptly stopped. Regulatory crackdowns scattered miners, and hash rate migrated to Kazakhstan, Texas, and Canada.

But the logic itself never disappeared; it only waited for a new shell. When ChatGPT emerged, large models became the new battlefield. Former Bitcoin mining farms transformed into AI data centers, mining rigs into GPUs, and Bitcoin into tokens—only electricity remained unchanged.

Bitcoin’s overseas expansion and token export are structurally homologous, but today, tokens hold greater commercial value.

Mining is purely mathematical computation; Bitcoin produced is a financial asset. Its value derives from scarcity and market consensus, unrelated to “what it computes.” Computing power itself is non-productive, more like a trust mechanism byproduct.

Large model inference is different. GPUs consume electricity to produce real cognitive services—code, analysis, translation, creativity. The value of tokens directly stems from their utility to users. This is a deeper embedding: once a developer’s workflow depends on a model, switching costs grow exponentially over time.

Another key difference: Bitcoin mining was expelled from China, but token export is actively chosen by global developers.

Token Wars

The submarine cable laid in 1858 represented the British Empire’s sovereignty over the information highway—who owns the infrastructure can set the rules of the game.

Token export is similarly a war without declared combat, facing heavy resistance.

Data sovereignty is the first barrier. An API request from a US developer processed by a Chinese data center physically traverses China. For individual developers and small applications, this isn’t a problem. But for scenarios involving sensitive enterprise data, financial information, or government compliance, it’s a hard barrier. That’s why Chinese models have the highest penetration in developer tools and personal applications but are almost absent in core enterprise systems.

Chip bans are the second barrier. China’s AI development faces export controls on high-end Nvidia GPUs. MoE architectures and algorithm optimizations can only partially offset this disadvantage, and the ceiling remains.

But these obstacles are only the prologue. A larger battlefield is taking shape.

Tokens and AI models have become a new strategic arena between China and the US—comparable to the 20th-century semiconductor and internet wars, or even closer to an ancient metaphor: space race.

In 1957, the Soviet Union launched Sputnik, shocking the US. The US then launched the Apollo program, investing hundreds of billions of dollars to ensure it would not fall behind in space.

The logic of AI competition is eerily similar, but the intensity will far surpass the space race. Space is physical and imperceptible to most people; AI infiltrates the economic capillaries—every line of code, every contract, every government decision system may be running a large model from a certain country. Whose model becomes the default infrastructure for global developers? Who gains structural influence over the global digital economy?

This is precisely what makes China’s token export truly unsettling for Washington.

When a developer’s codebase, agent workflows, and product logic revolve around a Chinese model’s API, the cost of migration grows exponentially over time. Even if the US legislates restrictions, developers will resist actively—just as today’s programmers cannot abandon GitHub.

Today’s token export may only be the beginning of this long game. China’s large models have not claimed to overthrow anything; they simply deliver services at lower prices to every developer worldwide with an API key.

This time, the cables are laid by engineers coding in Hangzhou, Beijing, and Shanghai, and GPU clusters running day and night in some southern province.

There’s no countdown to this war; it’s ongoing 24/7, measured in tokens, fought on every developer’s terminal.

BTC-1,03%
View Original
This page may contain third-party content, which is provided for information purposes only (not representations/warranties) and should not be considered as an endorsement of its views by Gate, nor as financial or professional advice. See Disclaimer for details.
  • Reward
  • Comment
  • Repost
  • Share
Comment
0/400
No comments
  • Pin

Trade Crypto Anywhere Anytime
qrCode
Scan to download Gate App
Community
  • 简体中文
  • English
  • Tiếng Việt
  • 繁體中文
  • Español
  • Русский
  • Français (Afrique)
  • Português (Portugal)
  • Bahasa Indonesia
  • 日本語
  • بالعربية
  • Українська
  • Português (Brasil)