z-ai

Z.ai: GLM 4.6

GLM 4.6 from Z.ai is a text-in, text-out model with a 202,752-token context window and a maximum output of 131,072 tokens. It supports tool use and reasoning, which makes it usable for agentic workflows and multi-step tasks. Structured output support is unconfirmed, so teams that depend on guaranteed JSON schemas should verify that separately before committing. On the comparison side, GLM 4.6 carries a blended benchmark score of 37.2, though that figure comes from only one tracked benchmark, so treat it as a preliminary signal rather than a settled verdict. Pricing sits at $0.43 per million input tokens and $1.74 per million output tokens, which is competitive for a model with this context capacity. Buyers running high-volume, long-context jobs who can tolerate some benchmark uncertainty may find it worth testing, but those who need well-documented performance across diverse tasks should wait for broader evaluation coverage.

Quality Score
98/100
price + capability + benchmarks
Input Price
$0.43
per 1M tokens
Output Price
$1.74
per 1M tokens
Context Window
202,752
tokens
Model ID
z-ai/glm-4.6
Vendor
z-ai
Tokenizer
Other
Input Modalities
text
Output Modalities
text
Max Output
131,072 tokens
Tool Calling
✓ supported
Structured Output
✓ supported
Reasoning Mode
✓ supported
Vision
text only
Audio
no
Moderated
no

Similar models