qwen3.6-flash

qwen3.6-flash

The Qwen3.6 native vision-language series Flash model delivers significantly improved performance compared to the 3.5-Flash model.
2026-04-17
LLM
Model capability: imageModel capability: videoModel capability: thinkingModel capability: function_call
Input:
$0.19/1M tokensstarting from
Output:
$1.13/1M tokensstarting from
Bulk order? Contact your manager for exclusive deals

API Overview

Qwen3.6-Flash is a lightweight model in the Tongyi Qwen series that emphasizes “instant response” and “ultra-high energy efficiency.” While maintaining the Qwen family’s outstanding semantic understanding and instruction-following capabilities, it achieves inference latency as low as milliseconds through an extremely optimized model-lightweight design.───────────────────────────────────────────────────────────────────Core CapabilitiesUltra-fast Inference Speed: Specifically designed for high-frequency invocation scenarios, this model delivers an ultra-fast experience—generating content in seconds—thanks to its streamlined model architecture and efficient design, making it perfectly suited for real-time chat and streaming interactions.Ultra-high Concurrent Throughput: With its lightweight size, Qwen3.6-Flash can support far more concurrent requests per unit of computing power than models of similar caliber, significantly lowering the operational threshold for large-scale AI applications.High-Level Instruction Following: Despite being the “Flash” version, it still inherits the powerful foundational logical capabilities of the Qwen series, demonstrating remarkable accuracy in tasks such as structured data extraction, content summarization, and rapid question answering.Flexible Application Integration: Thanks to its low resource consumption, Qwen3.6-Flash can seamlessly integrate into various API workflows, mobile applications, and edge devices, meeting diverse product deployment requirements.

Playground

Log in to explore more features! Click to Log In

API Reference (1)

API DescriptionAPI EndpointRequest MethodStabilityParameter Description
qwen3.6-flash
POST
Stable
View Details

API Pricing

$
ModelDescriptionContextOfficial Price302.AI Price

qwen3.6-flash

Input <= 256k
1000000

Input$0.17 / 1M tokens
Output$1.03 / 1M tokens

Input$0.19/ 1M tokens
Output$1.13/ 1M tokens
10%

qwen3.6-flash

Input 256K-1M
1000000

Input$0.69 / 1M tokens
Output$4.11 / 1M tokens

Input$0.76/ 1M tokens
Output$4.52/ 1M tokens
10%