PixVerse v5.5

PixVerse v5.5

Native audio and Multi-shot video generation
2025-12-01
Video Generation
Pricing:
$0.1/call

starting from

Bulk order? Contact your manager for exclusive deals

API Overview

Basic Information

PixVerse V5.5 is an AI video generation model launched by Aishitech in China on December 1, 2025. The Chinese version is named “Pai Wo AI V5.5,” and its official website is https://pai.video. It supports generating videos of 5-second, 8-second, and 10-second durations. In V5Fast mode, a 1080p video can be produced in about 30 seconds. This is the first AI video model in China that enables one-click direct output of videos featuring “shot-by-shot planning + audio,” marking the transition of AI video from “shot generation” to the practical stage of “complete storytelling.”

Core Features

Native Audio Generation: Describe the sound you want in your prompt, and the model will generate matching audio:

  • Voice-overs & Narration
  • Ambient Sounds
  • Diverse Sound Effects
  • Atmospheric Background Music

Multi-Shot Storytelling: Generate a coherent set of shots based on your prompts, ensuring character consistency. Suitable for:

  • Over-the-shoulder dialogue shots
  • Quick close-ups for emotional shifts
  • Seamless scene transitions
  • Dynamic action scene changes
  • Quick shots for plot twists

Technical Highlights

It adopts a self-developed hybrid architecture combining Diffusion and Transformer models, enhancing both generation speed and quality. It features multi-modal understanding capabilities, enabling it to interpret vague prompts and construct narrative logic. It achieves integrated end-to-end production, allowing users to complete the entire workflow—from image generation to video editing and publishing—in one stop.

Market Impact

It significantly lowers the barrier to video creation, enabling beginners without professional skills to produce finished videos. It boosts creative efficiency, shortening workflow time by 80%, and perfectly matches the demand for the “golden three-second opening” in short videos. It expands application scenarios and has already been used in advertising, social entertainment, and film & TV auxiliary creation, driving AI video to become a scalable content production tool.

720p does not support selecting both fast mode and 8s/10s duration simultaneously; 1080p does not support selecting fast mode.
The price multiplier is dynamically calculated based on the request parameters, and the multipliers for multiple parameters are multiplied together. For example: The price for a 720p 8s video is 0.1 × 2 × 2 = 0.4.

API Console

Log in to explore more features! Click to Log In

API Reference (4)

API DescriptionAPI EndpointRequest MethodStabilityParameter Description
Generate(Text-to-Video Generation)
POST
Stable
View Details
Generate(Image-to-Video Generation)
POST
Stable
View Details
Generate(Multi-Subject Reference)
POST
Stable
View Details
Fetch
GET
Stable
View Details

API Pricing

$
ModelDescription302.AI Price

Generate(quality-360p-5s-mode-normal)

Dynamically calculate the price based on request parameters, multiplying the multipliers of multiple parameters.

$0.1/call

Generate(quality-540p-5s-mode-normal)

Dynamically calculate the price based on request parameters, multiplying the multipliers of multiple parameters.

$0.15/call

Generate(quality-720p-5s-mode-normal)

Dynamically calculate the price based on request parameters, multiplying the multipliers of multiple parameters.

$0.2/call

Generate(quality-1080p-5s-mode-normal)

Dynamically calculate the price based on request parameters, multiplying the multipliers of multiple parameters.

$0.4/call

Generate(quality-360p-8s-mode-normal)

Dynamically calculate the price based on request parameters, multiplying the multipliers of multiple parameters.

$0.2/call

Generate(quality-540p-8s-mode-normal)

Dynamically calculate the price based on request parameters, multiplying the multipliers of multiple parameters.

$0.3/call

Generate(quality-720p-8s-mode-normal)

Dynamically calculate the price based on request parameters, multiplying the multipliers of multiple parameters.

$0.4/call

Generate(quality-1080p-8s-mode-normal)

Dynamically calculate the price based on request parameters, multiplying the multipliers of multiple parameters.

$0.8/call

Generate(quality-360p-10s-mode-normal)

Dynamically calculate the price based on request parameters, multiplying the multipliers of multiple parameters.

$0.2/call

Generate(quality-540p-10s-mode-normal)

Dynamically calculate the price based on request parameters, multiplying the multipliers of multiple parameters.

$0.3/call

Generate(quality-720p-10s-mode-normal)

Dynamically calculate the price based on request parameters, multiplying the multipliers of multiple parameters.

$0.4/call

Generate(quality-1080p-10s-mode-normal)

Dynamically calculate the price based on request parameters, multiplying the multipliers of multiple parameters.

$0.8/call

Generate(quality-360p-5s-mode-fast)

Dynamically calculate the price based on request parameters, multiplying the multipliers of multiple parameters.

$0.2/call

Generate(quality-540p-5s-mode-fast)

Dynamically calculate the price based on request parameters, multiplying the multipliers of multiple parameters.

$0.3/call

Generate(quality-720p-5s-mode-fast)

Dynamically calculate the price based on request parameters, multiplying the multipliers of multiple parameters.

$0.4/call

Generate(quality-360p-8s-mode-fast)

Dynamically calculate the price based on request parameters, multiplying the multipliers of multiple parameters.

$0.4/call

Generate(quality-540p-8s-mode-fast)

Dynamically calculate the price based on request parameters, multiplying the multipliers of multiple parameters.

$0.6/call

Fetch

Result

Free