OpenAI: GPT-4.1
GPT-4.1 is a paid multimodal model from OpenAI that accepts text, images, and files as input. It supports tool use and offers a context window of roughly one million tokens, which is large enough to process book-length documents or long codebases in a single pass. It does not include a built-in reasoning mode, and structured output support is unconfirmed based on available data. At $2.00 per million input tokens and $8.00 per million output tokens, it sits in the mid-range of frontier model pricing. Its blended benchmark score of 53.9 is drawn from only three benchmarks, so treat that figure as a partial signal rather than a definitive ranking. Teams that need broad file and image handling alongside a very long context window will find it worth evaluating, but buyers who prioritize reasoning tasks or want extensive benchmark coverage before committing should compare it against better-documented alternatives first.
- Model ID
- openai/gpt-4.1
- Vendor
- openai
- Tokenizer
- GPT
- Input Modalities
- image, text, file
- Output Modalities
- text
- Max Output
- default
- Tool Calling
- ✓ supported
- Structured Output
- ✓ supported
- Reasoning Mode
- not supported
- Vision
- ✓ accepts images
- Audio
- no
- Moderated
- no
Category rankings
Where OpenAI: GPT-4.1 places across the 4 categories it ranks in. How we rank →
| # | Category | Score |
|---|---|---|
| #10 | Transcript CleanupWriting · of 25 ranked | 143 |
| #23 | Meeting NotesBusiness · of 25 ranked | 144 |
| #25 | OCR / Document ParsingData · of 25 ranked | 138 |
| #25 | Table Extraction from PDFsData · of 25 ranked | 138 |