GLM 4.7 Flash

zai-org/glm-4.7-flash

zai-org/glm-4.7-flash

GLM-4.7-Flash is a lightweight 30B model optimized for coding and agentic tasks. Balances high performance with efficiency.

Added Jan 19, 2026

Context Window

200.0K

Max Output

128.0K

Avg output tokens (7d)

2.3K tokens

82%

Input Price (Auto)

$0.073/1M

Output Price (Auto)

$0.42/1M

Cache Read (Auto)

$0.037/1M

Capabilities

Benchmarks

Performance metrics and benchmarks

Artificial Analysis

LMArena

Vectara

Design Arena

Sourced from Artificial Analysis.

Intelligence Index

15.5

Choose explicit providers for this model. Auto routing remains available as the default option.

Loading provider options…