Private AI
DeepSeek's 685B parameter mathematical reasoning model with self-verification capabilities. Achieves gold-level scores on IMO 2025 and CMO 2024, plus 118/120 on Putnam 2024. Built on DeepSeek-V3.2-Exp-Base with generator-verifier architecture for rigorous theorem proving.
Added Dec 3, 2025
Context Window
128.0K
Max Output
65.5K
Input Price (Auto)
$0.60/1M
Output Price (Auto)
$2.20/1M
Performance metrics and benchmarks
Sourced from Artificial Analysis.
Intelligence Index
24.7
Auto routing is available for this model. Explicit provider selection is not available.
Loading provider options…
GPQA Diamond
Graduate-level scientific reasoning
75.1%
Better than 65% of models compared
HLE
Humanity's Last Exam
10.5%
Better than 67% of models compared
IFBench
Instruction-following benchmark
49.0%
Better than 59% of models compared
T²-Bench Telecom
Conversational AI agents in dual-control scenarios
78.9%
Better than 70% of models compared
AA-LCR
Long context reasoning evaluation
39.0%
Better than 53% of models compared
SciCode
Python programming for scientific computing
38.7%
Better than 71% of models compared
Terminal-Bench Hard
Agentic coding and terminal use
32.6%
AIME 2025
American Invitational Mathematics Examination 2025
59.0%
Better than 56% of models compared
MMLU-Pro
Professional and academic subject knowledge
83.7%
Better than 86% of models compared
Last updated Jun 24, 2026
Artificial AnalysisBetter than 78% of models compared
LiveCodeBench
Contamination-free coding benchmark
59.3%
Better than 66% of models compared