Back to Models
OpenAI: GPT Audio
openai/gpt-audioJan 19, 2026128K context16.4K max output$2.50/M in · $10.00/M out
Description
The gpt-audio model is OpenAI's first generally available audio model. The new snapshot features an upgraded decoder for more natural sounding voices and maintains better voice consistency. Audio is priced at $32 per million input tokens and $64 per million output tokens.
Specifications
Provider
openai
Context Length
128K
Max Output
16.4K
Modality
Intextaudio
Outtextaudio
Pricing
| Type | Price / 1M tokens |
|---|---|
| Input | $2.50 |
| Output | $10.00 |
| Audio Input | $32.00 |
Quick Start
curl https://api.ominigate.ai/v1/chat/completions \
-H "Authorization: Bearer sk-omg-your-api-key" \
-H "Content-Type: application/json" \
-d '{
"model": "openai/gpt-audio",
"messages": [{"role": "user", "content": "Hello!"}]
}'