gpt-realtime

gpt-realtime

from OpenAI’s latest real-time voice conversation API
2025-08-31
Audio-Video Processing
Input:
$32/1M charactersstarting from
Output:
$64/1M charactersstarting from
Bulk order? Contact your manager for exclusive deals

API Overview

GPT-realtime is an advanced speech model released by OpenAI, which can be used in AI voice call applications. Its features include: the ability to understand complex instructions and precisely invoke tools; generating natural, fluent, and expressive speech, with two new voices—Marin and Cedar—alongside upgrades to the existing eight voices; capturing non-verbal cues like laughter; seamlessly switching languages mid-sentence; and flexibly adjusting tone based on the context.


Note:

To stream audio output in real time, please use a wss connection; online debugging is not supported

Source code: https://github.com/302ai/openai-realtime-console

API Console

Log in to explore more features! Click to Log In

API Reference (1)

API DescriptionAPI EndpointRequest MethodStabilityParameter Description
Realtime
POST
Stable
View Details

API Pricing

$
ModelDescriptionOfficial Price302.AI Price

gpt-realtime

Real-time Voice Conversation

Input$32 / 1M characters
Output$64 / 1M characters

Input$32/ 1M characters
Output$64/ 1M characters
Original Price

gpt-realtime-2025-08-28

Real-time Voice Conversation

Input$32 / 1M characters
Output$64 / 1M characters

Input$32/ 1M characters
Output$64/ 1M characters
Original Price