gpt-4o-transcribe-diarize

gpt-4o-transcribe-diarize

A speech-to-text model from OpenAI
2025-10-31
Audio-Video Processing
Model capability: audio
Input:
$3/1M tokens
Output:
$5/1M tokens
Bulk order? Contact your manager for exclusive deals

API Overview

GPT-4o-Transcribe-Diarize is a speech-to-text model launched by OpenAI, featuring built-in speaker separation capabilities. The model can automatically identify and label distinct speakers, generating transcripts that include speaker tags and timestamps, making it ideal for transcribing multi-person conversations such as interviews, meetings, and discussion recordings.

API Console

Log in to explore more features! Click to Log In

API Analytics

API Reference (1)

API DescriptionAPI EndpointRequest MethodStabilityParameter Description
Transcriptions(Speech to Text)
POST
Stable
View Details

API Pricing

$
ModelDescriptionOfficial Price302.AI Price

gpt-4o-transcribe-diarize

-

Input$3 / 1M tokens
Output$5 / 1M tokens

Input$3/ 1M tokens
Output$5/ 1M tokens
Original Price