Qwen: Qwen3 Max Thinking
Qwen3 Max Thinking is a text-in, text-out model from Qwen that supports tool use and built-in reasoning, making it applicable to multi-step tasks, agentic workflows, and long-document work. Its 262,144-token context window is well above average, and it can return up to 32,768 completion tokens per response. Structured output support is unconfirmed based on available data. On cost, it sits at $0.78 per million input tokens and $3.90 per million output tokens, which is a moderate price for a reasoning-capable model. The notable gap here is benchmark coverage: there are no independent benchmark scores available yet, so performance claims rest entirely on vendor positioning rather than third-party validation. Teams with tolerance for that uncertainty and a need for long-context reasoning may find it worth trialing, but those requiring verified performance data before committing should wait for independent coverage to emerge.
- Model ID
- qwen/qwen3-max-thinking
- Vendor
- qwen
- Tokenizer
- Qwen
- Input Modalities
- text
- Output Modalities
- text
- Max Output
- 32,768 tokens
- Tool Calling
- ✓ supported
- Structured Output
- ✓ supported
- Reasoning Mode
- ✓ supported
- Vision
- text only
- Audio
- no
- Moderated
- no