Private AI
MiMo V2.5 Pro UltraSpeed is Xiaomi's speed-focused 1T-parameter MiMo V2.5 Pro mode, built for near-instant coding assistance, real-time chat, live edits, and low-latency agent loops. Xiaomi reports up to roughly 1,000 tokens per second using its TileRT serving stack, FP4 expert quantization, and DFlash speculative decoding.
Added Jun 10, 2026
Context Window
1.0M
Max Output
131.1K
Input Price (Auto)
$1.50/1M
Output Price (Auto)
$3.00/1M
Cache Read (Auto)
$0.12/1M
Capabilities
Performance metrics and benchmarks
No benchmark data is available yet for this model.
Auto routing is available for this model. Explicit provider selection is not available.
Loading provider options…