Mistral: Mistral Medium 3
Mistral Medium 3 is a multimodal model from Mistral AI that accepts text, image, and file inputs alongside a 128K-token context window. It supports tool use, which makes it usable in agentic workflows, but it does not offer a native reasoning mode. Structured output support is unconfirmed based on available data. At $0.40 per million input tokens and $2.00 per million output tokens, this model sits in a budget-to-mid range tier, making it worth considering for teams that need multimodal input and tool calling without paying premium prices. Its blended benchmark score of 19.4 comes from a single benchmark, so performance comparisons should be treated as preliminary rather than well-established. Developers running high-volume pipelines with mixed file and image inputs may find the pricing attractive, but those prioritizing verified, broad benchmark coverage should weigh that limited data before committing.
- Model ID
- mistralai/mistral-medium-3
- Vendor
- mistralai
- Tokenizer
- Mistral
- Input Modalities
- text, image, file
- Output Modalities
- text
- Max Output
- default
- Tool Calling
- ✓ supported
- Structured Output
- ✓ supported
- Reasoning Mode
- not supported
- Vision
- ✓ accepts images
- Audio
- no
- Moderated
- no