DeepSeek has been the pricing disruptor of 2026. While Anthropic charges $10/M input for Claude Fable 5 and OpenAI charges $5/M for GPT-5.6 Sol, DeepSeek V4 Pro delivers competitive benchmark scores at $0.435/M input— that's 23× cheaper than Fable 5 and 11× cheaper than Sol. And the cache read pricing is even more extreme: $0.003625/M, making it 120× cheaper than uncached input.

DeepSeek V4 Model Lineup

DeepSeek offers two models in the V4 family, both from the same architecture but at different speed/cost trade-offs:

	DeepSeek V4 Pro	DeepSeek V4 Flash
Input $/M	$0.435	$0.14
Output $/M	$0.87	$0.28
Cache Read $/M	$0.003625	$0.0028
Context Window	1M	1M
Max Output	128K	64K
SWE-bench	80.6	79.0
Thinking/Reasoning	✅	❌

The key takeaway: V4 Flash scores only 1.6 points lower on SWE-bench than V4 Pro, while costing 3× less on input and 3× less on output. For most applications, V4 Flash is the better deal.

The Cache Read Magic

DeepSeek's cache read pricing — $0.003625/M for V4 Pro — is where the economics get absurd:

Uncached input: 5M tokens × $0.435/M = $2.18

Cached input (90% hit rate): 0.5M × $0.435 + 4.5M × $0.003625 = $0.23

That's a 90% reduction with realistic cache hit rates.

Compare this to the same scenario on Claude Fable 5 vs DeepSeek V4 Pro: with 90% cache hits, Fable 5 costs $7.25 for input vs DeepSeek's $0.23 — that's a 31× difference for models with near-identical SWE-bench scores.

Where DeepSeek Wins

High-volume chat/coding apps — The combination of low base pricing and extreme cache discounts makes DeepSeek the cheapest option at any scale.
Coding assistants with fixed system prompts — 90%+ cache hit rates are typical, and DeepSeek rewards this pattern more than any other provider.
Startups & indie devs on a budget — You can serve millions of tokens per month for single-digit dollar costs.

Where DeepSeek Falls Short

No image/audio/video support — Text-only. If you need multimodal, look at Gemini vs DeepSeek.
No batch processing— OpenAI and Anthropic offer 50% discounts for batch workloads. DeepSeek doesn't.
Cache writes cost the same as uncached input — Unlike Anthropic/OpenAI where cache writes cost 1.25×. This means your first request with new content is more expensive, but subsequent cached reads more than make up for it.

The bottom line: if you're building a text-only application and optimizing for cost, DeepSeek is almost impossible to beat. Check the full DeepSeek V4 Pro vs GPT-5.4 comparison for a detailed side-by-side.

DeepSeek API Pricing Explained: Why It Costs 50× Less Than GPT-5

DeepSeek V4 Model Lineup

The Cache Read Magic

Where DeepSeek Wins

Where DeepSeek Falls Short