Mistral: Mistral Small 4
Mistral Small 4 is a text-and-image input model from Mistral AI with a 262,144-token context window. It supports tool use and reasoning, giving it coverage across agentic and multi-step workflows. Structured output support is unconfirmed, so developers who depend on guaranteed JSON schemas should verify that independently before committing. At $0.15 per million input tokens and $0.60 per million output tokens, the model sits in the budget-to-mid tier on price. Its blended benchmark score of 6.1 is drawn from a single independent benchmark, so the performance picture is thin and should be treated with caution. Teams running high-volume pipelines with image-plus-text inputs, or those needing long-context processing on a modest budget, have reasonable grounds to shortlist it, but anyone prioritizing well-validated capability should wait for broader benchmark coverage before relying on it for critical workloads.
- Model ID
- mistralai/mistral-small-2603
- Vendor
- mistralai
- Tokenizer
- Mistral
- Input Modalities
- text, image
- Output Modalities
- text
- Max Output
- default
- Tool Calling
- ✓ supported
- Structured Output
- ✓ supported
- Reasoning Mode
- ✓ supported
- Vision
- ✓ accepts images
- Audio
- no
- Moderated
- no
Category rankings
Where Mistral: Mistral Small 4 places across the 5 categories it ranks in. How we rank →
| # | Category | Score |
|---|---|---|
| #11 | Self-Hosted / LocalCost · of 25 ranked | 117 |
| #18 | Social Media PostsWriting · of 25 ranked | 119 |
| #18 | Voice Assistant BackendVoice · of 25 ranked | 123 |
| #18 | Cheap Bulk InferenceCost · of 25 ranked | 137 |
| #19 | Real-Time ChatLatency · of 25 ranked | 117 |