openai

OpenAI: GPT Audio

The gpt-audio model is OpenAI's first generally available audio model. The new snapshot features an upgraded decoder for more natural sounding voices and maintains better voice consistency. Audio is priced...

Quality Score
84/100
composite of price, context, capability
Input Price
$2.50
per 1M tokens
Output Price
$10.00
per 1M tokens
Context Window
128,000
tokens
Model ID
openai/gpt-audio
Vendor
openai
Tokenizer
GPT
Input Modalities
text, audio
Output Modalities
text, audio
Max Output
16,384 tokens
Tool Calling
✓ supported
Structured Output
✓ supported
Reasoning Mode
not supported
Vision
text only
Audio
✓ accepts audio
Moderated
yes

Similar models