Back to Models

OpenAI: GPT-4o Audio

openai/gpt-4o-audio-preview
Aug 15, 2025128K context16.4K max output$2.50/M in · $10.00/M out

Description

The gpt-4o-audio-preview model adds support for audio inputs as prompts. This enhancement allows the model to detect nuances within audio recordings and add depth to generated user experiences. Audio outputs are currently not supported. Audio tokens are priced at $40 per million input and $80 per million output audio tokens.

Specifications

Provider
openai
Context Length
128K
Max Output
16.4K
Modality
Inaudiotext
Outtextaudio

Pricing

TypePrice / 1M tokens
Input$2.50
Output$10.00
Audio Input$40.00

Quick Start

curl https://api.ominigate.ai/v1/chat/completions \
  -H "Authorization: Bearer sk-omg-your-api-key" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "openai/gpt-4o-audio-preview",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'
OpenAI: GPT-4o Audio — Pricing & API on OminiGate