DeepSeek-V3
Open-weight Mixture-of-Experts language model with 671B total parameters (37B activated per token), developed by DeepSeek AI and released in December 2024.
Technical specification
Modalities
Capabilities
9Reasoning★
Reasoning
Multi-step reasoning★
Reasoning
Long context★
Reasoning
Coding★
Coding
Function Calling
Planning
Structured output★
Structured gen.
Multilingual★
Language
Planning★
Planning
Streaming output
Reasoning
Architecture and technologies
Core Architecture
2Training Techniques
3Applications
Prices refer to the current DeepSeek API (model deepseek-chat = DeepSeek-V3.2 via API from December 2025). Original DeepSeek-V3 prices at launch: input cache hit $0.07/1M, input cache miss $0.27/1M, output $1.10/1M (historical prices). The current API uses V3.2.
INPUT
$0.2700 / per 1M tokens
OUTPUT
$1.1000 / per 1M tokens
CACHE
$0.0700 / per 1M tokens
TOTAL
for 10K tokens
Historical pricing for DeepSeek-V3 (launched December 2024). Cache hit = $0.07/MTok, cache miss = $0.27/MTok, output = $1.10/MTok.
