DeepSeek V4 Flash (Thinking)

deepseek/deepseek-v4-flash:thinking

BackTry Model

DeepSeek V4 Flash (Thinking)

deepseek/deepseek-v4-flash:thinking

BackTry Model

DeepSeek V4 Flash Thinking enables DeepSeek's reasoning mode on the efficiency-optimized Mixture-of-Experts model with a 1M-token context window, built for fast inference, high-throughput workloads, reasoning, coding, and agent workflows.

Added Apr 24, 2026

Model weights

Context Window

1.0M

Max Output

384.0K

Avg output tokens (7d)

1.3K tokens

47%

Input Price (Auto)

$0.094/1M

Output Price (Auto)

$0.19/1M

Cache Read (Auto)

$0.019/1M

Capabilities

Benchmarks

Performance metrics and benchmarks

Artificial Analysis

LMArena

Sourced from Artificial Analysis.

Intelligence Index

49.9

Providers

Choose explicit providers for this model. Auto routing remains available as the default option.

Loading provider options…

DeepSeek V4 Flash (Thinking)

DeepSeek V4 Flash (Thinking)

Benchmarks

Providers

Reasoning

Coding