music-2.5

music-2.5

MiniMax’s flagship AI music generation model delivers both high-fidelity sound quality and studio-level control.
2026-01-30
Audio-Video Processing
Pricing:
$0.15/call
Bulk order? Contact your manager for exclusive deals
稳定性
Stable

API Overview

Music 2.5 is MiniMax’s flagship AI music generation model, positioned as a “full-dimensional intelligent composition engine that combines high-fidelity sound quality with studio-level control,” empowering creators to truly take absolute command over their sound.

  • Full-Dimensional Breakthrough: Upgrades have been made across four key dimensions—vocal performance, arrangement and mixing, structural precision, and sound design—ensuring every nuance of the listening experience is exactly as you envision it.
  • Strong Creative Control: Use prompts to define style, mood, or scene, and input lyrics to quickly generate unique background music and theme songs tailored for videos, games, or apps.
  • Precise Structural Arrangement: Over 14 musical structure tags—including Intro, Bridge, Hook, and more—are available, enabling precise control over turning points from intro to chorus.
  • Realistic Vocal Texture: Deep optimization minimizes synthetic traces, delicately recreating breaths, vibrato, and professional vocal placement, significantly enhancing flow and delivering a realism that feels “almost like breathing.”
  • Style-Specific Physical Restoration: Automatically identifies and restores acoustic characteristics specific to certain genres, such as the Minneapolis sound of the 1980s, modern electronic wide-band transients, and the warm low-pass warmth of classic jazz.

───────────────────────────────────────────────────────────────────

Core Capabilities

🎛️ Studio-Level Arrangement Control:

  • Expanded high-sampling-rate sound library (including orchestral and traditional Chinese instruments), with each instrument having a clear “sense of position” in the frequency spectrum.
  • Optimized sound-field algorithms ensure extreme clarity even in complex arrangements, with vocals and accompaniment frequencies remaining completely separate.

🎤 True-to-Life Vocal Performance:

  • Introduces human-like voice simulation, delivering stronger emotional dynamics and saying goodbye to mechanical singing.
  • Supports fine-grained adjustments to emotional intensity and singing techniques segment by segment, enabling nuanced narrative expression.

🎨 Smart Style Filters:

  • The system automatically matches style features such as rock distortion, Funk grooves, and Lo-fi suction effects, instantly enhancing genre recognition.
  • Sound design is deeply integrated with musical genres, ensuring authentic representation from EDM to Contemporary R&B.

🎬 Professional-Grade Scene Delivery:

  • Film scoring: Precisely matches the emotional ups and downs of the visual scenes, injecting narrative tension.
  • Game audio: Builds immersive, 3A-level dynamic soundscapes, supporting high-sampling-rate instrumental performances.
  • Commercial release: Delivers studio-quality finished products ready for direct market launch, meeting the stringent standards of the pop industry.

───────────────────────────────────────────────────────────────────

Demonstration of Results

  • An expanded high-sampling-rate sound library ensures that each instrument has a clear “sense of position” in the frequency spectrum.


API Console

Log in to explore more features! Click to Log In

API Analytics

API Reference (1)

API DescriptionAPI EndpointRequest MethodStabilityParameter Description
Music Generation
POST
Stable
View Details

API Pricing

$
ModelDescription302.AI Price

Music-Generation

Music Generation

$0.15/call