OpenAI: GPT-5 Image Mini
GPT-5 Image Mini is a paid model from OpenAI that accepts text, images, and files as input. It offers a 400,000-token context window and up to 128,000 completion tokens, making it capable of processing long documents alongside visual content. The model supports reasoning but does not support tool use, and structured output support is unconfirmed. At $2.50 per million input tokens and $2.00 per million output tokens, it sits in a mid-range price tier. The absence of any independent benchmark coverage means its real-world performance relative to competitors is currently unproven, so buyers are relying on vendor claims rather than third-party evidence. It is worth shortlisting if your workflow centers on multimodal document analysis with long context requirements and you do not need tool-calling support, but teams that prioritize verified performance data should wait for benchmark coverage before committing.
- Model ID
- openai/gpt-5-image-mini
- Vendor
- openai
- Tokenizer
- GPT
- Input Modalities
- file, image, text
- Output Modalities
- image, text
- Max Output
- 128,000 tokens
- Tool Calling
- not supported
- Structured Output
- ✓ supported
- Reasoning Mode
- ✓ supported
- Vision
- ✓ accepts images
- Audio
- no
- Moderated
- yes
Strong choice for
Category rankings
Where OpenAI: GPT-5 Image Mini places across the 1 category it ranks in. How we rank →
| # | Category | Score |
|---|---|---|
| #1 | Image GenerationVision · of 8 ranked | 112 |