openai

OpenAI: GPT-4o-mini

GPT-4o-mini is a multimodal model from OpenAI that accepts text, images, and files as input. It supports a 128,000-token context window and can produce up to 16,384 tokens per response. Tool use is supported, making it suitable for agentic workflows, but it has no built-in reasoning mode and structured output support is unconfirmed. At $0.15 per million input tokens and $0.60 per million output tokens, it sits at the budget end of the multimodal market, which is its clearest selling point. Benchmark coverage is thin, with a blended score of 4.4 across only two benchmarks, so capability claims should be treated as provisional rather than well-established. Teams running high-volume pipelines that need image and file handling at low cost are the most logical fit; teams where output quality is the primary concern should weight those limited benchmark numbers carefully before committing.

Query via API → View on openai → Estimate cost

Quality Score

95/100

price + capability + benchmarks

Input Price

$0.15

per 1M tokens

Output Price

$0.60

per 1M tokens

Context Window

128,000

tokens

Model ID: openai/gpt-4o-mini
Vendor: openai
Tokenizer: GPT
Input Modalities: text, image, file
Output Modalities: text
Max Output: 16,384 tokens
Tool Calling: ✓ supported
Structured Output: ✓ supported
Reasoning Mode: not supported
Vision: ✓ accepts images
Audio: no
Moderated: no

Similar models

openai

OpenAI: GPT-4o-mini

Similar models

OpenAI: GPT-4o-mini (2024-07-18)

OpenAI: GPT-5.2-Codex

OpenAI: GPT-5 Image Mini

OpenAI: GPT-5.5

OpenAI: GPT-5.1-Codex-Max

OpenAI: GPT-5.1-Codex