Kling V3(Image-to-video)

Kling V3(Image-to-video)

Kling flagship image-to-video model
2026-02-06
Video Generation
Pricing:
$0.084/Second

starting from

Bulk order? Contact your manager for exclusive deals
稳定性
Stable

API Overview

Supports synchronous audio generation (sound=True)


Kling V3.0 Image-to-Video is a dual-mode product line for generating videos from images, launched by Kuaishou. It includes two versions: Standard and Pro. Its core positioning is to transform static images into high-quality dynamic videos featuring cinematic camera movements, natural subject motions, and optional synchronized sound effects via a powerful API.

  • Dual-version strategy: The Pro version delivers top-tier visual fidelity, smoother physics simulations, and stronger adherence to prompt instructions; while the Standard version significantly reduces usage costs while maintaining high availability.
  • Core capabilities: Both versions support uploading a starting image (image) and optionally an ending image (end_image) to achieve controllable transitions. They also support video lengths of 5 or 10 seconds and multiple aspect ratios (16:9 / 9:16 / 1:1).
  • Audio integration: Both versions allow enabling sound to generate ambient audio that matches the visuals; the Pro version additionally supports adding up to 2 custom voice entries (voice_list) for character dialogue.
  • Applicable scenarios: Photo animation, product demonstration videos, social media dynamic cover images, ad material extensions, and story-driven content creation with sound effects.
  • Cost comparison: The Pro version costs about twice as much as the Standard version. Users can flexibly choose based on their quality requirements.

───────────────────────────────────────────────────────────────────

Core Capabilities

🖼️→🎥 Single-image-driven animation

With just one reference image and a text description, you can generate coherent dynamic videos, dramatically lowering the barrier to video production. 🔄 Start-and-end frame guidance(optional)

By controlling the end image, you can define the final state of the video, enabling semantic transitions such as "flower blooming → flower wilting" or "stillness → running."

🔊 Synchronized audio and video generation

When sound is enabled, ambient audio is automatically added; in the Pro version, you can also overlay custom voices to create composite audio featuring "character dialogue + background sound."

🎬 Cinematic motion performance (Pro) The Pro version excels in subject consistency, light and shadow changes, and smooth camera movements, making it ideal for delivering finished products.

🚫 Precise content control

Negative prompts allow you to exclude unwanted elements such as "blur" or "distortion," enhancing the stability of generated results. ───────────────────────────────────────────────────────────────────

Effect Demonstrations


API Console

Log in to explore more features! Click to Log In

API Reference (3)

API DescriptionAPI EndpointRequest MethodStabilityParameter Description
Kling v3-std
POST
Stable
View Details
Kling v3-pro
POST
Stable
View Details
Fetch
GET
Stable
View Details

API Pricing

$
ModelDescription302.AI Price

Kling v3-std

No synchronous sound is generated

$0.084/Second

Kling v3-std

Generate synchronous sound

$0.126/Second

Kling v3-pro

No synchronous sound is generated

$0.112/Second

Kling v3-pro

Generate synchronous sound

$0.168/Second

Fetch

Fetch Task

Free