Private AI
Hermes 3 is a generalist language model with many improvements over Hermes 2, including advanced agentic capabilities, better roleplaying, reasoning, multi-turn conversation, and long context coherence. This 70B model is a competitive finetune of Llama-3.1-70B focused on aligning LLMs to the user with powerful steering capabilities.
Added Jan 7, 2026
Model weightsContext Window
65.5K
Max Output
8.2K
Avg output tokens (7d)
419 tokens
Input Price (Auto)
$0.43/1M
Output Price (Auto)
$0.43/1M
Performance metrics and benchmarks
Sourced from Artificial Analysis.
Intelligence Index
5.1
Choose explicit providers for this model. Auto routing remains available as the default option.
Loading provider options…
Agentic Index
10.0
GPQA Diamond
Graduate-level scientific reasoning
40.1%
Better than 18% of models compared
HLE
Humanity's Last Exam
4.1%
Better than 14% of models compared
CritPt
Research-level physics reasoning
0.0%
Better than 27% of models compared
SciCode
Python programming for scientific computing
23.1%
Better than 27% of models compared
LiveCodeBench
Contamination-free coding benchmark
18.8%
AIME
American Invitational Mathematics Examination
2.3%
Better than 13% of models compared
Math-500
Diverse mathematical problem solving benchmark
53.8%
Better than 17% of models compared
MMLU-Pro
Professional and academic subject knowledge
57.1%
Better than 19% of models compared
AA-Omniscience Accuracy
Proportion of correctly answered questions
18.2%
Better than 41% of models compared
Last updated Jun 17, 2026
Artificial AnalysisBetter than 20% of models compared
AA-Omniscience Hallucination Rate
Rate of incorrect answers among non-correct responses
80.1%
Better than 52% of models compared