qwen-image-edit-plus-2025-12-15

qwen-image-edit-plus-2025-12-15

The Tongyi Qianwen series Image Editing Plus model further optimizes inference performance and system stability based on the initial Edit model.
2026-01-05
Image Processing
Pricing:
$0.03/image
Bulk order? Contact your manager for exclusive deals
稳定性
Stable

API Overview

Qwen-Image-Edit-Plus is an advanced image-editing large model launched by Alibaba’s Tongyi Lab, with a core focus on “high-fidelity text reconstruction + multi-dimensional visual control.” Compared to the basic version, it achieves a qualitative leap in precise text manipulation within images, multi-image logical fusion, and maintaining subject consistency, thanks to its native dual-encoder architecture.

  • Industry-leading text editing: It solves the long-standing challenge of “text distortion” in the AI image generation field. Supporting both Chinese and English, it can accurately identify the original font, size, color, and material texture, enabling seamless text replacement or in-place addition.
  • Multi-image reference (Multi-Image) inference: It can simultaneously understand and integrate 1–3 reference images. For example, it can logically combine the facial features from Image A, specific clothing from Image B, and the background environment from Image C, while automatically handling lighting and occlusion relationships.
  • Extreme identity consistency: When performing pose transformations, filter switching, or stylization (such as converting to 3D or anime style), it perfectly preserves facial features or detailed appearance of specific products, ensuring that the “subject remains unchanged after editing.”
  • Semantic and pixel-level dual control: It not only understands semantic instructions like “replace apples with oranges,” but also integrates structured inputs such as Canny edges and depth maps to impose rigorous pixel-level constraints on composition.
  • Light-and-shadow adaptive reconstruction: When changing backgrounds or compositing objects, the model automatically analyzes the global illumination of the new environment, re-rendering shadows and highlights for the subject to eliminate the awkwardness of “cheap-looking effects.”

───────────────────────────────────────────────────────────────────

Core Capabilities (9 Major Editing Matrices)

👁️ Semantic and visual dual-mode editing: It allows both fine-tuning of local elements (such as adding a cup) and global reconstruction (such as rotating an object by 180° or transforming an IP character).

✍️ Seamless bilingual text rendering: It precisely modifies Chinese and English text in posters and signs, perfectly matching the background lighting and artistic font effects.

👤 Human identity consistency: When changing poses or styles (such as switching to anime style), it accurately locks facial features, solving the “alienation” problem often seen after AI face-swapping.

🎨 Commercial product poster creation: It can instantly transform plain product images into atmospheric commercial masterpieces, maintaining 100% product fidelity.

🖼️ Multi-image logical fusion: It seamlessly combines elements from different images across time and space, automatically handling light, shadow, and occlusion to achieve natural group photos or scene replacements.

📐 Native structural control: By leveraging Canny edges or keypoint skeletal points, it precisely defines human movements and architectural structures.

🌈 Environment and light-and-shadow reconstruction: It intelligently analyzes the background and automatically adjusts the subject’s lighting, such as transforming a midday scene into a neon-lit rainy night, with simultaneous updates to light-and-shadow details.

🕰️ Smart restoration of old photos: It enhances textures and completes details for creased, faded, and low-resolution images, bringing historical moments back to life.

🛠️ Intelligent agent-level visual positioning: It boasts strong spatial awareness and supports targeted editing of specific areas via natural language or coordinate boxes.

───────────────────────────────────────────────────────────────────



API Console

Log in to explore more features! Click to Log In

API Analytics

API Reference (1)

API DescriptionAPI EndpointRequest MethodStabilityParameter Description
Qwen-Image-Edit-Plus(Aliyun)
POST
Stable
View Details

API Pricing

$
ModelDescription302.AI Price

qwen-image-edit-plus-2025-12-15

-

$0.03/image