Step-R1-V-Mini, which supports image and text input, text output, has good instruction following and general capabilities, can perceive images with high precision and complete complex reasoning tasks.
Context Window
128.0K
Max Output
65.5K
Input Price (Auto)
$2.50/1M
Output Price (Auto)
$11.00/1M
Performance metrics and benchmarks
Sourced from Artificial Analysis.
No benchmark data is available yet for this model.
Auto routing is available for this model. Explicit provider selection is not available.
Loading provider options…