openai
OpenAI: GPT Audio
The gpt-audio model is OpenAI's first generally available audio model. The new snapshot features an upgraded decoder for more natural sounding voices and maintains better voice consistency. Audio is priced...
Quality Score
84/100
composite of price, context, capability
Input Price
$2.50
per 1M tokens
Output Price
$10.00
per 1M tokens
Context Window
128,000
tokens
- Model ID
- openai/gpt-audio
- Vendor
- openai
- Tokenizer
- GPT
- Input Modalities
- text, audio
- Output Modalities
- text, audio
- Max Output
- 16,384 tokens
- Tool Calling
- ✓ supported
- Structured Output
- ✓ supported
- Reasoning Mode
- not supported
- Vision
- text only
- Audio
- ✓ accepts audio
- Moderated
- yes
Similar models
openai
OpenAI: GPT-4o Audio
$2.50 in / $10.00 out
128,000 ctx
84
openai
OpenAI: GPT-5.4 Pro
$30.00 in / $180.00 out
1,050,000 ctx
85
openai
OpenAI: GPT-5.2 Pro
$21.00 in / $168.00 out
400,000 ctx
85
openai
OpenAI: o3 Deep Research
$10.00 in / $40.00 out
200,000 ctx
85
openai
OpenAI: GPT-5 Pro
$15.00 in / $120.00 out
400,000 ctx
85
openai
OpenAI: o3 Pro
$20.00 in / $80.00 out
200,000 ctx
85