Fastest reasoning model in China, with up to 200 tokens per second. The stronger version of GLM Z1 Air.
Added Apr 15, 2025
Context Window
32.0K
Max Output
16.4K
Input Price (Auto)
$0.70/1M
Output Price (Auto)
$0.70/1M
Capabilities
Performance metrics and benchmarks
Sourced from Artificial Analysis.
No benchmark data is available yet for this model.
Auto routing is available for this model. Explicit provider selection is not available.
Loading provider options…