mistralai

Mistral: Mistral Small 3.2 24B

Mistral Small 3.2 24B is a multimodal model from Mistral AI that accepts both image and text inputs, supports tool use, and offers a 128,000-token context window with up to 16,384 tokens of output per completion. It does not include a dedicated reasoning mode, and structured output support is unconfirmed. The combination of vision capability and tool use makes it broadly applicable to agentic and document-processing workflows without requiring a larger, more expensive model. At $0.075 per million input tokens and $0.20 per million output tokens, it sits in the budget-to-midrange tier, making it worth considering for teams running high-volume inference who need image understanding alongside text. Benchmark coverage is thin, with a blended score of 59.6 drawn from a single benchmark, so performance claims should be treated as provisional rather than well-established. Teams that need wider benchmark validation before committing should treat it as a candidate to test rather than a settled choice.

Query via API → View on mistralai → Estimate cost

Quality Score

90/100

price + capability + benchmarks

Input Price

$0.07

per 1M tokens

Output Price

$0.20

per 1M tokens

Context Window

128,000

tokens

Model ID: mistralai/mistral-small-3.2-24b-instruct
Vendor: mistralai
Tokenizer: Mistral
Input Modalities: image, text
Output Modalities: text
Max Output: 16,384 tokens
Tool Calling: ✓ supported
Structured Output: ✓ supported
Reasoning Mode: not supported
Vision: ✓ accepts images
Audio: no
Moderated: no

Similar models

mistralai

Mistral: Mistral Small 3.2 24B

Similar models

Mistral: Ministral 3 3B 2512

Mistral Large 2407

Mistral Large

Mistral: Mistral Medium 3.1

Mistral: Mistral Medium 3

Mistral: Mistral Nemo