Back to Models

OpenAI: GPT Audio

openai/gpt-audio
Jan 19, 2026128K context16.4K max output$2.50/M in · $10.00/M out

Description

The gpt-audio model is OpenAI's first generally available audio model. The new snapshot features an upgraded decoder for more natural sounding voices and maintains better voice consistency. Audio is priced at $32 per million input tokens and $64 per million output tokens.

Specifications

Provider
openai
Context Length
128K
Max Output
16.4K
Modality
Intextaudio
Outtextaudio

Pricing

TypePrice / 1M tokens
Input$2.50
Output$10.00
Audio Input$32.00

Quick Start

curl https://api.ominigate.ai/v1/chat/completions \
  -H "Authorization: Bearer sk-omg-your-api-key" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "openai/gpt-audio",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'