qwen-image-max

qwen-image-max

Alibaba’s flagship image-generation model, featuring top-tier realism and sophisticated text rendering capabilities.
2026-01-05
Image Generations
Pricing:
$0.08/piece
Bulk order? Contact your manager for exclusive deals

API Overview

Qwen-Image-Max is Alibaba’s flagship image-generation API product, primarily positioned as a general-purpose image-generation model with top-tier realism and sophisticated text-rendering capabilities. While maintaining natural compositions, it can accurately handle text at the paragraph level in both Chinese and English, enabling professional-grade mixed-text-and-image layouts.

  • Breakthrough in Realism: Compared to the Plus series, the Max version significantly enhances image realism and naturalness, effectively eliminating any traces of AI synthesis. Particularly in rendering textures of human subjects, skin details, and hair strands, it achieves a “zero-AI feel,” rivaling the quality of photos taken by professional photographers.
  • Exclusive Text Rendering: It boasts industry-leading multi-line text-generation capabilities for both Chinese and English, supporting complex typesetting designs (such as posters and PPT pages). It can precisely render details like horizontal banners and couplets, with font styles that are elegant and natural, addressing the common pain points of blurry or distorted text in traditional AI-generated images.
  • Leading Benchmark Performance: It has achieved SOTA (state-of-the-art) results in multiple authoritative benchmarks, including GenEval and DPG. In blind tests conducted by AI Arena, its performance even surpassed some closed-source commercial models, placing it at the top of the open-source model rankings.
  • Professional Scene Coverage: It perfectly adapts to high-demand scenarios such as e-commerce poster design, complex infographic creation, realistic portrait generation, and the production of PPTs containing large amounts of text.

───────────────────────────────────────────────────────────────────

Core Capabilities

📸 Ultimate Realism: It excels in delivering image quality with a “zero-AI feel”. Details on human faces, fine wrinkles around the eyes, hair strand direction, and natural light and shadow reflections are reproduced with exceptional accuracy, producing visual effects comparable to those of camera-shot photographs.

✍️ High-Fidelity Text Rendering: It uniquely excels at complex text layouts, supporting paragraph-level mixing of Chinese and English texts. It precisely controls font styles and positions, easily generating posters or PPTs with detailed copy.

🎨 Multi-Style Mastery: It covers dozens of styles, including realistic, anime, ink-wash, cyberpunk, and more. Whether it’s Ghibli-style animation or surreal scenes, it can accurately follow instructions and produce outputs that match the desired style.

Fine-Grained Detail Control: It features fine-grained detail-rendering capabilities, accurately understanding and presenting deep semantic nuances such as “the direction of hair strands blown by the wind” or “the texture of blue-and-white porcelain.” Its compositions are elegant and dignified.

───────────────────────────────────────────────────────────────────

Effect Demonstrations

API Console

Log in to explore more features! Click to Log In

API Analytics

API Reference (1)

API DescriptionAPI EndpointRequest MethodStabilityParameter Description
Qwen-Image-Max
POST
Stable
View Details

API Pricing

$
ModelDescription302.AI Price

Qwen-Image-Max

Qwen-Image-Max

$0.08/piece