z-ai

Z.ai: GLM 4.6

GLM 4.6 from Z.ai is a text-in, text-out model with a 202,752-token context window and a maximum output of 131,072 tokens. It supports tool use and reasoning, which makes it usable for agentic workflows and multi-step tasks. Structured output support is unconfirmed, so teams that depend on guaranteed JSON schemas should verify that separately before committing. On the comparison side, GLM 4.6 carries a blended benchmark score of 37.2, though that figure comes from only one tracked benchmark, so treat it as a preliminary signal rather than a settled verdict. Pricing sits at $0.43 per million input tokens and $1.74 per million output tokens, which is competitive for a model with this context capacity. Buyers running high-volume, long-context jobs who can tolerate some benchmark uncertainty may find it worth testing, but those who need well-documented performance across diverse tasks should wait for broader evaluation coverage.

Query via API → View on z-ai → Estimate cost

Quality Score

98/100

price + capability + benchmarks

Input Price

$0.43

per 1M tokens

Output Price

$1.74

per 1M tokens

Context Window

202,752

tokens

Model ID: z-ai/glm-4.6
Vendor: z-ai
Tokenizer: Other
Input Modalities: text
Output Modalities: text
Max Output: 131,072 tokens
Tool Calling: ✓ supported
Structured Output: ✓ supported
Reasoning Mode: ✓ supported
Vision: text only
Audio: no
Moderated: no

Similar models

z-ai

Z.ai: GLM 4.6

Similar models

Z.ai: GLM 4.7

Z.ai: GLM 5

Z.ai: GLM 4.7 Flash

Z.ai: GLM 5.2

Z.ai: GLM 5.1

Z.ai: GLM 5V Turbo