Mistral: Mistral Medium 3.1
Mistral Medium 3.1 is a multimodal model from Mistral AI that accepts text, images, and files as input, with a 131,072-token context window. It supports tool use, which makes it suitable for agentic workflows, but it does not include a built-in reasoning mode. Structured output support is unconfirmed based on available data. At $0.40 per million input tokens and $2.00 per million output tokens, it sits in the budget-to-mid range for multimodal models. Its blended benchmark score of 23.3 is drawn from only one independent benchmark, so performance claims should be treated as preliminary rather than well-established. Buyers who need multimodal input plus tool support at a relatively low input cost may find it worth testing, but those making high-stakes model selection decisions should wait for broader benchmark coverage before committing.
- Model ID
- mistralai/mistral-medium-3.1
- Vendor
- mistralai
- Tokenizer
- Mistral
- Input Modalities
- text, image, file
- Output Modalities
- text
- Max Output
- default
- Tool Calling
- ✓ supported
- Structured Output
- ✓ supported
- Reasoning Mode
- not supported
- Vision
- ✓ accepts images
- Audio
- no
- Moderated
- no