Category

API

App

Audio-Video Processing
Date
Price

music-2.5+

2.5 upgraded version, supports pure music generation
API
Audio-Video Processing
Pricing:
$0.15/call

speech-2.8-hd

High-performance text-to-speech model launched by MiniMax
API
Audio-Video Processing
Pricing:
$52.5/1M characters

starting from

speech-2.8-turbo

High-performance text-to-speech model launched by MiniMax
API
Audio-Video Processing
Pricing:
$30/1M characters

starting from

music-2.5

MiniMax’s flagship AI music generation model delivers both high-fidelity sound quality and studio-level control.
API
Audio-Video Processing
Pricing:
$0.15/call

GLM-ASR-2512

Zhipu's next-generation speech recognition model supports real-time conversion of speech into high-quality text.
API
Audio-Video Processing
Pricing:
$0.025/M tokens

GLM-TTS

GLM Speech Synthesis Model Combining Large Language Models and Diffusion Model Technologies
API
Audio-Video Processing
Pricing:
$0.03/1000 characters

Sound-Generation

Convert text descriptions into high-quality audio effects, with precise control over timing, style, and complexity.
API
Audio-Video Processing
Pricing:
$0.06/call

Audio-Isolation

Isolate speech from background noise, music, and ambient sounds from any audio
API
Audio-Video Processing
Pricing:
$0.3/minute

speech-2.6-hd

The latest voice HD model from Minimax
API
Audio-Video Processing
Pricing:
$52.5/1M characters

starting from

mureka-7.5

Mureka is an audio processing service launched by Kunlun Tech
API
Audio-Video Processing
Pricing:
$0.05/call

starting from

gpt-realtime-mini

from OpenAI’s latest real-time voice conversation API
API
Audio-Video Processing
Input:
$10/1M characters
Output:
$20/1M characters