Recognize (Rapid Audio File Recognition)

Recognize (Rapid Audio File Recognition)

Text-to-Speech API from Doubao.
2025-08-01
Audio-Video Processing
Pricing:
$0.015/min
Bulk order? Contact your manager for exclusive deals

API Overview

This interface is designed for scenarios requiring rapid recognition of audio files, leveraging advanced large-model capabilities to deliver superior recognition performance and faster response times. The invocation method allows you to receive recognition results immediately after a single request, eliminating the need for submit/query polling.

Usage Limits

Project Limitations

Audio Duration: Not exceeding 2 hours

Audio Size: Not exceeding 100 MB

Audio Encoding: Supports PCM / WAV / MP3 / OGG OPUS

Uploaded File Binary Stream: Maximum size limited to 20 MB, depending on the client's outbound bandwidth

Multichannel Audio: Compared to mono audio, processing time will increase accordingly

Price: 0.015 PTC/minute

API Console

Log in to explore more features! Click to Log In

API Reference (1)

API DescriptionAPI EndpointRequest MethodStabilityParameter Description
Recognize (Rapid Audio File Recognition)
POST
Stable
View Details

API Pricing

$
ModelDescription302.AI Price

Recognize

Rapid Audio File Recognition

$0.015/min