GLM 4.6V Flash

zai-org/glm-4.6v-flash-original

BackTry Model

GLM 4.6V Flash

zai-org/glm-4.6v-flash-original

BackTry Model

GLM-4.6V-Flash (9B), a lightweight model optimized for local deployment and low-latency applications. Scales context window to 128k tokens and achieves SoTA performance in visual understanding among similar-scale models.

Added Dec 8, 2025

Context Window

128.0K

Max Output

24.0K

Input Price (Auto)

$0.10/1M

Output Price (Auto)

$0.40/1M

Cache Read (Auto)

$0.050/1M

Capabilities

Benchmarks

Performance metrics and benchmarks

No benchmark data is available yet for this model.

Providers

Auto routing is available for this model. Explicit provider selection is not available.

Loading provider options…