Adds extended, step‑by‑step reasoning for tougher coding, planning, and multi‑tool tasks. Ideal for long‑horizon agent workflows, complex problem solving, and scenarios that benefit from explicit thinking traces.
Added Sep 29, 2025
Context Window
1.0M
Max Output
64.0K
Input Price (Auto)
$2.99/1M
Output Price (Auto)
$14.99/1M
Cache Read (Auto)
$0.30/1M
Capabilities
Performance metrics and benchmarks
Sourced from Artificial Analysis.
Intelligence Index
43.0
Auto routing is available for this model. Explicit provider selection is not available.
Loading provider options…
Coding Index
38.6
GPQA Diamond
Graduate-level scientific reasoning
83.4%
Better than 88% of models compared
HLE
Humanity's Last Exam
17.3%
Better than 86% of models compared
IFBench
Instruction-following benchmark
57.3%
Better than 76% of models compared
T²-Bench Telecom
Conversational AI agents in dual-control scenarios
78.1%
Better than 75% of models compared
AA-LCR
Long context reasoning evaluation
65.7%
Better than 89% of models compared
SciCode
Python programming for scientific computing
44.7%
Better than 93% of models compared
Terminal-Bench Hard
Agentic coding and terminal use
35.6%
Better than 88% of models compared
AIME 2025
American Invitational Mathematics Examination 2025
88.0%
Better than 86% of models compared
MMLU-Pro
Professional and academic subject knowledge
87.5%
Better than 98% of models compared
Last updated May 15, 2026
Artificial AnalysisLiveCodeBench
Contamination-free coding benchmark
71.4%
Better than 82% of models compared