
whisper-1
starting from
API Overview
Convert audio to the input language or translate audio into English.
API Console
Log in to explore more features! Click to Log In
API Analytics
API Reference (2)
| API Description | API Endpoint | Request Method | Stability | Parameter Description |
|---|---|---|---|---|
Transcriptions(Speech to Text whisper-1) | POST | Stable | View Details | |
Document Details OpenAI Guide Transcribe audio into the input language. The Transcription API accepts the audio file you want to transcribe as input, along with your desired output file format for the transcription. We currently support multiple input and output file formats. Price: 0.006 PTC / min Request Parameters Header ParametersAcceptstringRequired Example Value: application/jsonAuthorizationstringOptional Example Value: Bearer {{YOUR_API_KEY}}Request Body multipart/form-datafilestringRequired The audio file to be transcribed modelenum<string>Required The ID of the model to use. Enum Value: whisper-1gpt-4o-transcribegpt-4o-mini-transcribegpt-4o-transcribe-diarizeExample Value: whisper-1promptstringOptional Optional text to guide the model’s style or continue from a previous audio segment. Prompting guide should match the language of the audio. response_formatenum<string>Optional The format of the transcript output. Options: json. Enum Value: jsontextsrtverbose_jsonvttdiarized_jsonExample Value: jsontemperaturenumberOptional Sampling temperature between 0 and 1. Higher values (e.g., 0.8) make the output more random, while lower values (e.g., 0.2) make it more focused and deterministic. If set to 0, the model automatically increases temperature using log probability until a certain threshold is reached. languagestringOptional The language of the input audio. Providing the input language in ISO-639-1 format will improve accuracy and latency. | ||||
Translations(Speech to English Text whisper-1) | POST | Stable | View Details | |
Document Details OpenAI Guide Translate audio into English. The Translation API accepts audio files in any supported language as input and, if necessary, transcribes the audio into English. This differs from our /Transcriptions endpoint because the output is not in the original input language but is instead translated into English text. Price: 0.006 PTC / min Request Parameters Header ParametersContent-TypestringRequired Example Value: multipart/form-dataAcceptstringRequired Example Value: application/jsonAuthorizationstringOptional Example Value: Bearer {{YOUR_API_KEY}}Request Body multipart/form-datafilestringRequired The audio file to be translated. Use one of the following formats: mp3, mp4, mpeg, mpga, m4a, wav or webm. modelstringRequired The ID of the model to use. Only Example Value: whisper-1promptstringOptional Optional text; to guide the model’s style or continue a previous audio segment. Prompt should be in English. response_formatstringOptional The format of the transcript output. Choose from: json, text, srt, verbose_json, or vtt. Example Value: jsontemperaturenumberOptional The sampling temperature ranges from 0 to 1. Higher values (e.g., 0.8) increase randomness in the output, while lower values (e.g., 0.2) make it more focused and deterministic. If set to 0, the model uses log probability to automatically adjust the temperature until specific thresholds are reached. | ||||
API Pricing
| Model | Description | 302.AI Price |
|---|
Transcriptions | Speech to Text
whisper-1 |
|
Translations | Speech to English Text
whisper-1 |
|