Z.ai: GLM 4.6
GLM 4.6 from Z.ai is a text-in, text-out model with a 202,752-token context window and a maximum output of 131,072 tokens. It supports tool use and reasoning, which makes it usable for agentic workflows and multi-step tasks. Structured output support is unconfirmed, so teams that depend on guaranteed JSON schemas should verify that separately before committing. On the comparison side, GLM 4.6 carries a blended benchmark score of 37.2, though that figure comes from only one tracked benchmark, so treat it as a preliminary signal rather than a settled verdict. Pricing sits at $0.43 per million input tokens and $1.74 per million output tokens, which is competitive for a model with this context capacity. Buyers running high-volume, long-context jobs who can tolerate some benchmark uncertainty may find it worth testing, but those who need well-documented performance across diverse tasks should wait for broader evaluation coverage.
- Model ID
- z-ai/glm-4.6
- Vendor
- z-ai
- Tokenizer
- Other
- Input Modalities
- text
- Output Modalities
- text
- Max Output
- 131,072 tokens
- Tool Calling
- ✓ supported
- Structured Output
- ✓ supported
- Reasoning Mode
- ✓ supported
- Vision
- text only
- Audio
- no
- Moderated
- no