Private AI
DeepSeek V4 Flash is an efficiency-optimized Mixture-of-Experts model from DeepSeek with a 1M-token context window, built for fast inference, high-throughput workloads, reasoning, coding, and agent workflows.
Added Apr 24, 2026
Model weightsContext Window
1.0M
Max Output
384.0K
Avg output tokens (7d)
171 tokens
Input Price (Auto)
$0.10/1M
Output Price (Auto)
$0.21/1M
Cache Read (Auto)
$0.021/1M
Capabilities
Performance metrics and benchmarks
Sourced from Artificial Analysis.
Intelligence Index
40.3
Choose explicit providers for this model. Auto routing remains available as the default option.
Loading provider options…
Coding Index
56.2
GPQA Diamond
Graduate-level scientific reasoning
89.4%
Better than 94% of models compared
HLE
Humanity's Last Exam
32.1%
Better than 93% of models compared
IFBench
Instruction-following benchmark
79.2%
Better than 97% of models compared
T²-Bench Telecom
Conversational AI agents in dual-control scenarios
95.0%
Better than 93% of models compared
AA-LCR
Long context reasoning evaluation
63.0%
Better than 78% of models compared
SciCode
Python programming for scientific computing
44.9%
Better than 89% of models compared
Terminal-Bench Hard
Agentic coding and terminal use
35.6%
Last updated Jun 20, 2026
Artificial AnalysisBetter than 83% of models compared