Private AI
Nvidia's latest Llama fine-tune optimized for instruction following. Early results hints that it might outperform models such as GPT-4o and Claude 3.5 Sonnet.
Added Apr 15, 2025
Context Window
16.4K
Max Output
8.2K
Input Price (Auto)
$0.36/1M
Output Price (Auto)
$0.41/1M
Performance metrics and benchmarks
Sourced from Artificial Analysis.
Intelligence Index
8.5
Auto routing is available for this model. Explicit provider selection is not available.
Loading provider options…
GPQA Diamond
Graduate-level scientific reasoning
51.5%
Better than 30% of models compared
HLE
Humanity's Last Exam
4.2%
Better than 17% of models compared
IFBench
Instruction-following benchmark
39.0%
Better than 37% of models compared
T²-Bench Telecom
Conversational AI agents in dual-control scenarios
19.0%
Better than 17% of models compared
AA-LCR
Long context reasoning evaluation
24.3%
Better than 39% of models compared
SciCode
Python programming for scientific computing
29.9%
Better than 46% of models compared
Terminal-Bench Hard
Agentic coding and terminal use
6.8%
AIME 2025
American Invitational Mathematics Examination 2025
3.0%
Better than 4% of models compared
AIME
American Invitational Mathematics Examination
21.3%
Better than 48% of models compared
MMLU-Pro
Professional and academic subject knowledge
73.2%
Better than 43% of models compared
Last updated Jun 25, 2026
Artificial AnalysisBetter than 38% of models compared
LiveCodeBench
Contamination-free coding benchmark
30.5%
Better than 36% of models compared
Math-500
Diverse mathematical problem solving benchmark
70.3%
Better than 30% of models compared