:

DEEPSEEK V4 MODELS UNDERCUT COMPETITION ON PRICING

AI DESK2 MIN READ
FRI, APR 24, 2026

DeepSeek released two new AI models with aggressive pricing that undercut comparable offerings. V4 Pro costs $1.74 per million input tokens and $3.48 per million output tokens, while V4 Flash runs at $0.14 and $0.28 per million respectively.

DeepSeek's latest models establish new price floors across their performance tiers. The V4 Flash variant, positioned as the budget option, costs significantly less than competing budget models from other providers. At $0.14 per million input tokens, it delivers substantial savings for high-volume token processing. The V4 Pro model targets users requiring higher capability levels, priced at $1.74 per input million and $3.48 per output million. This positions it as the cheapest option in its performance class, according to analysis by Simon Willison. The pricing structure reflects a widening gap in AI model costs. DeepSeek's aggressive rates contrast with established providers like OpenAI and Anthropic, which charge notably higher per-token rates across their model lineups. This pricing strategy extends DeepSeek's push into Western markets, where cost-conscious developers and enterprises evaluate model selection. Output tokens cost double the input rate across both models, a standard structure in the AI API market that accounts for increased computational requirements during generation. The release signals continued competition around AI model pricing and accessibility. As more models enter the market, per-token costs have trended downward, though substantial gaps remain between premium and budget tiers. DeepSeek's positioning suggests the company prioritizes market penetration through pricing rather than premium positioning. For developers evaluating models, the pricing advantage comes alongside considerations of model capability, latency, and reliability. Both V4 variants represent options for builders weighing cost against performance requirements across different use cases.

■ MORE FROM THE AI DESK

Nvidia CEO Jensen Huang and other semiconductor leaders converge on Taiwan for Computex, Asia's largest tech showcase, where they will address critical challenges facing the post-ChatGPT AI hardware sector.

2H AGOAI Desk

Anthropic is releasing Claude Opus 4.8 on Thursday, emphasizing the model's improved ability to acknowledge when it lacks sufficient evidence for its claims.

2H AGOAI Desk

The technology industry's terminology around artificial intelligence is evolving. Corporate discussions increasingly center on AI agents rather than generative AI systems.

2H AGOAI Desk

A new review paper argues that software infrastructure—not language models—is the real bottleneck for autonomous AI agents. Tools, memory systems, testing frameworks, and permission boundaries transform stateless models into functional agents.

2H AGOAI Desk

■ SUBSCRIBE TO THE DAILY BRIEF

ONE EMAIL, 5 STORIES, 06:00 UTC. UNSUBSCRIBE ANYTIME.