qwen3.6-35b-a3b

qwen3.6-35b-a3b

The Qwen3.6 series 35B-A3B native vision-language model, designed based on a hybrid architecture, integrates linear attention mechanisms with sparse mixture-of-experts models.
2026-04-17
LLM
Model capability: imageModel capability: videoModel capability: thinkingModel capability: function_call
Input:
$0.28/1M tokens
Output:
$1.7/1M tokens
Bulk order? Contact your manager for exclusive deals

API Overview

Qwen3.6-35B-A3B is a standout member of the Tongyi Qwen series that introduces the MoE (Mixture of Experts) architecture. By deeply decoupling its 35 billion parameters (35B), it dynamically activates only 3 billion parameters (3B) when handling complex reasoning tasks. This architectural design enables it to achieve cognitive depth and logical sophistication nearly on par with large models (35B), while maintaining the inference speed and ultra-low latency typical of tiny models (3B). ───────────────────────────────────────────────────────────────────

Core Capabilities

Ultimate “Large-Model Intelligence, Small-Model Power Consumption”: The 35B model capacity endows it with robust knowledge reserves and sophisticated logical parsing capabilities, while the 3B activation mechanism ensures flexibility and lightning-fast performance during runtime. Dynamic Computing Resource Allocation: The MoE architecture can autonomously select and activate different “expert neurons” based on the complexity of the input, enabling precise adaptation and efficient processing across various tasks—including code generation, creative content creation, and logical reasoning. Outstanding Engineering Adaptability: Compared to traditional dense models, 35B-A3B boasts significant advantages in terms of memory usage and inference costs, allowing it to run efficiently in large-scale production environments with lower hardware requirements and maximizing cost-effectiveness. Deep Semantic and Programming DNA: Inheriting the top-tier strengths of the Qwen3.6 series in multilingual understanding, code generation, and tool invocation, this model is particularly well-suited for tasks that demand deep logical reasoning and structured programming.

Playground

Log in to explore more features! Click to Log In

API Reference (1)

API DescriptionAPI EndpointRequest MethodStabilityParameter Description
qwen3.6-35b-a3b
POST
Stable
View Details

API Pricing

$
ModelDescriptionContextOfficial Price302.AI Price

qwen3.6-35b-a3b

Input <= 256k
256000

Input$0.26 / 1M tokens
Output$1.55 / 1M tokens

Input$0.28/ 1M tokens
Output$1.7/ 1M tokens
10%