wan2.6-r2v

wan2.6-r2v

The reference video feature of the wan2.6 series precisely preserves the appearance and voice of the people or objects in the reference video, and supports multi-reference simultaneous recording.
2025-12-17
Video Generation
Pricing:
$0.1/second

starting from

Bulk order? Contact your manager for exclusive deals

API Overview

Tongyi Wanxiang 2.6 introduces the groundbreaking “Video Role-Playing” capability—just 5 seconds of reference video is all it takes to customize an exclusive digital avatar and reuse it in any new storyline.

  • Exclusive Feature: The first in China to support “Video Role-Playing”—upload a reference video and let your character perform in new scenes while maintaining consistent appearance and voice tone.
  • Upgrade Highlights: Compared to the previous version Wan 2.5, the duration has increased from 10 seconds to 15 seconds, and intelligent scene segmentation has been added, supporting cinematic camera movements such as wide shots and close-ups.
  • Competitive Edge: vs OpenAI Sora 2, it offers free trials + open APIs, making role-playing more flexible and better suited for Chinese-language creative scenarios.
  • Image Quality Performance: Outputs high-definition videos at 1080P/24fps, with enhanced realistic human portrait textures that significantly reduce the “AI feel,” and lighting and shadows reflecting professional aesthetic standards.
  • Applicable Scenarios: Covers short films, commercial ads, and virtual character interactions—simply input text to generate coherent narrative videos.

───────────────────────────────────────────────────────────────────

Core Capabilities

🎭 Character Identity Cloning: Upload a 5-second video of a real person, animation, or animal, and the AI precisely extracts facial features, expressions, and vocal characteristics.

🔗 Cross-Scene Character Consistency: In newly generated multi-shot short videos, the character’s appearance, lip-sync, and tone remain highly consistent, eliminating any “face-swapping” effect.

🎭 Supports Dual-Character Dialogue Generation: You can simultaneously set two characters’ roles and generate stable, synchronized multi-character dialogue scenarios.

🎤 Audio-Driven Performance: Input any voice, and the AI automatically drives the character’s lip-sync, expressions, and body movements, creating personalized IP interactions.

📽️ 15-Second Cinematic Storytelling + Full Commercial Licensing: From short dramas and educational content to brand endorsements, the creative boundaries are comprehensively expanded.

───────────────────────────────────────────────────────────────────

Effect Demonstrations

Reference Video 1:

Reference Video 2:

Effect Demonstrations:


API Console

Log in to explore more features! Click to Log In

API Analytics

API Reference (2)

API DescriptionAPI EndpointRequest MethodStabilityParameter Description
R2V (Reference-Generated Video)
POST
Stable
View Details
Tasks (Get Task Results)
GET
Stable
View Details

API Pricing

$
ModelDescription302.AI Price

wan2.6-r2v(Video-to-video)

720p

$0.1/second

wan2.6-r2v(Video-to-video)

1080p

$0.15/second

Tasks

Fetch Task Results

Free