z-ai

Z.ai: GLM 4.7

GLM 4.7 from Z.ai is a text-input model with a 202,752-token context window and a maximum completion length of 131,072 tokens. It supports tool use and reasoning, which makes it applicable to multi-step agentic workflows. Structured output support is unconfirmed, so developers who depend on guaranteed JSON schemas should verify that capability independently before committing. On the comparison side, GLM 4.7 is priced at $0.40 per million input tokens and $1.75 per million output tokens, which is relatively affordable on the input side but mid-range on output. Its blended benchmark score of 67.1 is drawn from only two benchmarks, so that figure should be treated as preliminary rather than a settled indicator of general ability. Teams running long-context or reasoning-heavy workloads on a moderate budget may find it worth testing, but buyers who need broad, well-validated performance data should wait for wider benchmark coverage.

Quality Score
98/100
price + capability + benchmarks
Input Price
$0.40
per 1M tokens
Output Price
$1.75
per 1M tokens
Context Window
202,752
tokens
Model ID
z-ai/glm-4.7
Vendor
z-ai
Tokenizer
Other
Input Modalities
text
Output Modalities
text
Max Output
131,072 tokens
Tool Calling
✓ supported
Structured Output
✓ supported
Reasoning Mode
✓ supported
Vision
text only
Audio
no
Moderated
no

Similar models