gpt-4o-transcribe

gpt-4o-transcribe

A speech-to-text model from OpenAI
2025-10-31
Audio-Video Processing
Model capability: audio
Input:
$6/1M tokens
Output:
$10/1M tokens
Bulk order? Contact your manager for exclusive deals

API Overview

GPT-4o-Transcribe is a high-precision speech-to-text model launched by OpenAI, built on the GPT-4o architecture and specifically optimized for speech recognition tasks. The model excels in multilingual, multi-accent, and noisy environments, significantly reducing the word error rate (WER), particularly in English and other major languages. It is ideal for applications such as meeting transcripts, customer service, and media captioning.

API Console

Log in to explore more features! Click to Log In

API Analytics

API Reference (1)

API DescriptionAPI EndpointRequest MethodStabilityParameter Description
Transcriptions(Speech to Text)
POST
Stable
View Details

API Pricing

$
ModelDescriptionOfficial Price302.AI Price

gpt-4o-transcribe

-

Input$6 / 1M tokens
Output$10 / 1M tokens

Input$6/ 1M tokens
Output$10/ 1M tokens
Original Price