img.36krcdn.com

AI Got 1,000x Cheaper. Enterprise Bills Tripled. The Jevons Paradox Andreessen Predicted.

AI price collapse, Jevons Paradox, Marc Andreessen, US China AI competition, intelligence cost, DeepSeekBusiness

Marc Andreessen's 2026 Outlook: AI Timelines, US vs. China, and The Price of AI

2026 Early Interview with Marc Andreessen: AI Revolution Just Started, Intelligence Price Collapsing

a16z's 2026 Outlook: Shortages Will Eventually Lead to Surpluses

The price of a single AI token has fallen 1,000-fold in three years. What cost $60 per million tokens in the GPT-3 era now runs at just $0.06 for equivalent performance.

Yet enterprise AI bills have tripled. Prices fell — spending went up. This paradox has a name. In 1865, British economist William Stanley Jevons discovered that improvements in coal efficiency didn't reduce coal consumption — they caused it to explode. That's exactly what we're watching happen with AI.

3-Second Summary

Intelligence costs collapse 1,000x → Usage spikes 50–500x → China accelerates price war → Jevons Paradox kicks in → New rules of competition

Here's what everyone assumed

"If AI costs come down, our AI bill will shrink." Sounds obvious, right? If token prices drop by half, you should be able to do the same work for half the cost.

1,000x

AI inference cost drop (2021→2024)

320%

Enterprise AI spend increase (2024→2025)

Average enterprise AI bill increase

The actual data went the opposite direction. Enterprise AI spending jumped from $11.5B in 2024 to $37B in 2025 — a 320% increase — while per-token costs dropped 1,000x.

On the a16z podcast, Marc Andreessen called AI "the biggest tech revolution I've ever experienced." He also said "the unit cost of intelligence is collapsing faster than Moore's Law." And then — in the same breath — "the market is still very early".

Those two statements have to be read together. When costs drop, demand explodes — that's the heart of Jevons Paradox. A single agentic workflow burns 50 to 500x more tokens than a simple chat. Cheaper tokens unlock more complex tasks, which require even more tokens. And 72% of real AI costs aren't even in your model invoice — they're in orchestration, retries, and monitoring.

China lit the price war on fire

There's one more variable in this equation: China.

Andreessen openly acknowledged on the podcast that "DeepSeek surprised Silicon Valley". And the benchmarks back it up — DeepSeek, Kimi (Moonshot AI), Qwen (Alibaba), and ByteDance models reached near-parity with Claude 3.5 Sonnet and GPT-4o within just 12 months of release.

	Top US Models	Top Chinese Models
Overall Benchmark	Claude Opus 4.6 — 88pts	DeepSeek V4 Pro — 87pts
Coding Score	Claude: 93.9% (SWE-Bench)	DeepSeek: 91.2% (SWE-Bench)
API Price Gap	Baseline (100%)	~10–13x cheaper
Open Source	Closed source	Open weights, self-hostable

The performance gap is 5–7% on benchmarks. The cost gap is more than 10x. Andreessen sees this as "reshaping global price competition through an open-source strategy".

The age of Chinese AI researchers is also worth noting. As Andreessen mentioned, the lead researchers are 22–24 years old. That's not inexperience — it means they carry no preconceptions about existing paradigms. These teams pushed GPU efficiency to the extreme, achieving comparable performance at a fraction of the cost.

Following a16z's projection, the current GPU shortage will flip to oversupply within five years, as big tech companies build their own chips and AMD plus Chinese manufacturers scale production. When that happens, AI unit costs fall even further — and Jevons Paradox goes into overdrive.

Andreessen's core argument

He called AI "the biggest tech revolution of my lifetime" — bigger than the internet, comparable to electricity and the microprocessor. Yet he simultaneously said the market is "still very early". How can hundreds of millions of ChatGPT users still be "early"? His point: the product forms are still immature. Jevons Paradox hasn't even really started — far more use cases still have to emerge before it reaches full force.

What to audit in your business right now

Break down your AI spend structure
Separate your model API costs from everything else. 72% of real AI costs live outside the model invoice — in orchestration, retries, and monitoring. If your bill looks wrong, start here.
Simulate token consumption before deploying agents
Agentic workflows burn 50–500x more tokens than simple chats. Run a small-scale pilot first, measure actual tokens per task, and use that as your budget baseline.
A/B test open-source and Chinese models
DeepSeek V4 Pro is 5–7% behind top US models on benchmarks and 10–13x cheaper on API costs. Start with specific workflows rather than a wholesale switch. Self-hosting is an option too.
Set monthly AI budget caps
Uber burned through its annual AI budget in four months and had to impose a $1,500/employee monthly cap. Knowing about Jevons Paradox doesn't help if you have no hard spending limits. Set caps by team and function.
Reinvest savings into new automation
Don't book token price savings as cost reduction — reinvest them into new workflow automation. That's how you use Jevons Paradox offensively. If you're not planning this now, a competitor will capture that edge first.

More to explore

Marc Andreessen's 2026 Outlook: AI Timelines, US vs. China, and The Price of AI Andreessen's 81-minute a16z AMA podcast — his full take on intelligence cost collapse and the China competition a16z.com

2026 Early Interview with Marc Andreessen: AI Revolution Just Started, Intelligence Price Collapsing 36kr's English summary of Andreessen's key statements on cost collapse and the rise of Chinese AI 36kr.com

a16z's 2026 Outlook: Shortages Will Eventually Lead to Surpluses 36kr's analysis of the GPU oversupply thesis and structural changes in the AI chip market 36kr.com

The Inference Cost Paradox: Why Generative AI Spending Surged 320% Despite Per-Token Costs Dropping 1,000x A deep data analysis of how Jevons Paradox is playing out in enterprise AI spending arturmarkus.com

AI Token Cost Over Time: Down 99.7%, Bills Up 3x NavyaAI's data report on the token price paradox navyaai.com

Best Chinese LLMs in 2026: DeepSeek V4, Kimi K2.6, Qwen and Every Model Ranked Benchmark comparisons between Chinese and US AI models, with cost analysis benchlm.ai

FAQ

What exactly is the Jevons Paradox?

It's an economic principle discovered by British economist William Stanley Jevons in 1865. He found that improvements in coal efficiency actually caused coal consumption to surge, not reduce. The same dynamic applies to AI — as token prices fall, more use cases become economically viable, total consumption explodes, and spending goes up even as unit costs go down.

Is it actually safe to use Chinese AI models in production?

It depends on your data sensitivity and compliance requirements. Chinese models like DeepSeek are open-weight, meaning you can self-host them on your own servers without sending data to Chinese companies. That said, if you handle medical, financial, or government data, consult your legal team first. A practical starting point: A/B test Chinese models on non-sensitive workflows before making any broader decisions.

When my AI spend is growing faster than expected, where should I look first?

Look outside the model invoice first. 72% of real AI costs are in infrastructure — orchestration, retries, monitoring — not the model API itself. If you're running agentic workflows, measure actual tokens consumed per task. The number is often 50–500x more than you'd estimate from a simple chat interaction.

Is the GPU shortage really going to flip to oversupply in five years?

That's Andreessen and a16z's projection. The logic is sound: big tech companies are building their own AI chips (Meta MTIA, Google TPU, Microsoft Maia), while AMD and Chinese manufacturers are scaling alternatives. Technology infrastructure has historically followed the pattern of early scarcity → massive investment → oversupply. The exact timeline is uncertain, but the directional bet has broad industry consensus.

Written by Rush

Tracking where business meets AI.

Did you find this reference helpful?

Get curated references delivered to your inbox weekly

Share this reference

Top 20% of Companies Capture 74% of AI's Economic Value — PwC's 1,217-Executive Study Reveals the Real Gap

PwC's 2026 AI Performance Study shows that 74% of AI's economic value is captured by just 20% of companies. Here's what AI leaders do differently and how to close the gap.

Explore more AI workflow guides on similar topics