DeepSeek V4 Flash

deepseek/deepseek-v4-flash

BackTry Model

DeepSeek V4 Flash

deepseek/deepseek-v4-flash

BackTry Model

DeepSeek V4 Flash is an efficiency-optimized Mixture-of-Experts model from DeepSeek with a 1M-token context window, built for fast inference, high-throughput workloads, reasoning, coding, and agent workflows.

Added Apr 24, 2026

Context Window

1.0M

Max Output

384.0K

Input Price (Auto)

$0.10/1M

Output Price (Auto)

$0.21/1M

Cache Read (Auto)

$0.021/1M

Capabilities

Reasoning

Tool calling

Structured output

Benchmarks

Performance metrics and benchmarks

Artificial Analysis

LMArena

Sourced from Artificial Analysis.

Intelligence Index

46.5

Providers

Choose explicit providers for this model. Auto routing remains available as the default option.

Loading provider options…

DeepSeek V4 Flash

DeepSeek V4 Flash

Benchmarks

Providers

Reasoning

Coding