Private AI
OpenHands+Devstral is 100% local 100% open, and is SOTA for the category on SWE-Bench Verified: 46.8% accuracy.
Added Aug 2, 2025
Context Window
32.8K
Max Output
8.2K
Input Price (Auto)
$0.060/1M
Output Price (Auto)
$0.060/1M
Performance metrics and benchmarks
Sourced from Artificial Analysis.
Intelligence Index
11.8
Auto routing is available for this model. Explicit provider selection is not available.
Loading provider options…
GPQA Diamond
Graduate-level scientific reasoning
43.4%
Better than 22% of models compared
HLE
Humanity's Last Exam
4.0%
Better than 13% of models compared
IFBench
Instruction-following benchmark
31.6%
Better than 18% of models compared
T²-Bench Telecom
Conversational AI agents in dual-control scenarios
38.0%
Better than 47% of models compared
AA-LCR
Long context reasoning evaluation
26.7%
Better than 41% of models compared
SciCode
Python programming for scientific computing
24.5%
Better than 31% of models compared
Terminal-Bench Hard
Agentic coding and terminal use
6.1%
AIME
American Invitational Mathematics Examination
6.7%
Better than 24% of models compared
Math-500
Diverse mathematical problem solving benchmark
68.4%
Better than 25% of models compared
MMLU-Pro
Professional and academic subject knowledge
63.2%
Better than 24% of models compared
Last updated Jun 25, 2026
Artificial AnalysisBetter than 35% of models compared
LiveCodeBench
Contamination-free coding benchmark
25.8%
Better than 27% of models compared