Category

API

App

Tongyi Qianwen
Date
Price

qwen3.6-flash

The Qwen3.6 native vision-language series Flash model delivers significantly improved performance compared to the 3.5-Flash model.
Model
LLM
Model capability: imageModel capability: videoModel capability: thinkingModel capability: function_call
Input:
$0.19/1M tokensstarting from
Output:
$1.13/1M tokensstarting from

qwen3.6-35b-a3b

The Qwen3.6 series 35B-A3B native vision-language model, designed based on a hybrid architecture, integrates linear attention mechanisms with sparse mixture-of-experts models.
Model
LLM
Model capability: imageModel capability: videoModel capability: thinkingModel capability: function_call
Input:
$0.28/1M tokens
Output:
$1.7/1M tokens

qwen3.6-plus

Alibaba’s Qwen3.6 series supports multimodal capabilities.
Model
LLM
Model capability: imageModel capability: videoModel capability: thinkingModel capability: function_call
Input:
$0.3/1M tokensstarting from
Output:
$1.8/1M tokensstarting from

qwen3.5-122b-a10b

Alibaba’s Qwen3.5 series supports multimodal capabilities.
Model
LLM
Model capability: imageModel capability: videoModel capability: thinkingModel capability: function_call
Input:
$0.12/1M tokensstarting from
Output:
$0.92/1M tokensstarting from

qwen3.5-27b

Alibaba’s Qwen3.5 series supports multimodal capabilities.
Model
LLM
Model capability: imageModel capability: videoModel capability: thinkingModel capability: function_call
Input:
$0.09/1M tokensstarting from
Output:
$0.69/1M tokensstarting from

qwen3.5-35b-a3b

Alibaba’s Qwen3.5 series supports multimodal capabilities.
Model
LLM
Model capability: imageModel capability: videoModel capability: thinkingModel capability: function_call
Input:
$0.06/1M tokensstarting from
Output:
$0.46/1M tokensstarting from

qwen3.5-flash

Alibaba’s Qwen3.5 series supports multimodal capabilities.
Model
LLM
Model capability: imageModel capability: videoModel capability: thinkingModel capability: function_call
Input:
$0.03/1M tokensstarting from
Output:
$0.29/1M tokensstarting from

qwen3.5-397b-a17b

Alibaba’s Qwen3.5 series supports multimodal capabilities.
Model
LLM
Model capability: imageModel capability: videoModel capability: thinkingModel capability: function_call
Input:
$0.171/1M tokensstarting from
Output:
$1.03/1M tokensstarting from

qwen3.5-plus

Alibaba’s Qwen3.5 series supports multimodal capabilities.
Model
LLM
Model capability: imageModel capability: videoModel capability: thinkingModel capability: function_call
Input:
$0.12/1M tokensstarting from
Output:
$0.69/1M tokensstarting from

qwen3-max-2026-01-23

Alibaba’s Qwen3 Max series models feature adaptive tool invocation and scalability during testing.
Model
LLM
Model capability: thinkingModel capability: function_call
Input:
$0.36/1M tokensstarting from
Output:
$1.43/1M tokensstarting from

qwen3-vl-plus-2025-12-19

The Qwen3 series visual understanding models achieve an effective integration of thinking and non-thinking modes.
Model
LLM
Model capability: image
Input:
$0.143/1M tokensstarting from
Output:
$1.43/1M tokensstarting from

qwen-mt-lite

A foundational-level text translation large model comprehensively upgraded based on Qwen3
Model
LLM
Input:
$0.086/1M tokens
Output:
$0.23/1M tokens

qwen-mt-flash

Lightweight large-scale text translation model fully upgraded based on Qwen3
Model
LLM
Input:
$0.1/1M tokens
Output:
$0.28/1M tokens

qwen3-vl-32b-instruct

The non-inference version of the largest Dense model in the Qwen3-VL series, whose overall performance is second only to Qwen3-VL-235B-Instruct.
Model
LLM
Model capability: imageModel capability: function_call
Input:
$0.29/1M tokens
Output:
$1.143/1M tokens

qwen3-vl-32b-thinking

The inference version of the largest Dense model in the Qwen3-VL series, with multimodal reasoning capabilities second only to Qwen3-VL-235B-Thinking.
Model
LLM
Model capability: imageModel capability: function_call
Input:
$0.29/1M tokens
Output:
$2.86/1M tokens

qwen3-vl-flash-2025-10-15

The Qwen3 series of small-scale visual understanding models achieve an effective integration of thinking and non-thinking modes, with fast response speeds.
Model
LLM
Model capability: image
Input:
$0.022/1M tokensstarting from
Output:
$0.22/1M tokensstarting from

qwen3-vl-flash

The Qwen3 series of small-scale visual understanding models achieve an effective integration of thinking and non-thinking modes, with fast response speeds.
Model
LLM
Model capability: image
Input:
$0.022/1M tokensstarting from
Output:
$0.22/1M tokensstarting from