qwen3-coder-480b-a35b-instruct

qwen3-coder-480b-a35b-instruct

Alibaba has launched a MoE encoding model with 480 billion parameters, specifically designed for complex software engineering tasks.
2025-07-23
LLM
Model capability: function_call
Input:
$0.86/1M tokensstarting from
Output:
$3.43/1M tokensstarting from
Bulk order? Contact your manager for exclusive deals

API Overview

Qwen3-Coder-480B-A35B-Instruct is a MoE coding model with 480 billion parameters launched by Alibaba. Its core mission is to serve as the world’s most powerful open-source agentic coding foundation, natively supporting a context length of 256K (expandable up to 1M), and specifically designed for complex software engineering tasks.

  • Top-tier performance: It has set new records among open-source models in 12 benchmarks, including SWE-bench Verified (69.6%) and WebArena (49.9%), achieving overall performance comparable to Claude Sonnet 4.
  • Ultra-long context understanding: Natively supports a context length of 256K tokens, and can be scaled up to 1 million tokens via YaRN technology, effortlessly handling full repository code analysis.
  • Dual-mode switching: It features a unique “thinking mode” and “non-thinking mode,” dynamically balancing response speed (sub-second) with deep reasoning (multi-round planning).
  • Full-stack toolchain: Comes with the open-source command-line tool Qwen Code (compatible with OpenAI SDK) and seamlessly integrates with mainstream IDEs such as Claude Code and Cline.
  • Exceptional cost-effectiveness: Through FP8 quantization and MoE dynamic activation (only 35B parameters are computed in real time), inference costs are reduced by 60%, with API call prices as low as 0.8 yuan per million tokens.

───────────────────────────────────────────────────────────────────

Core Capabilities

🤖 Agent-based programming: It autonomously handles the entire workflow—from requirement analysis and code generation to tool invocation and debugging—with a measured success rate of 69.6% in fixing GitHub issues after 500 rounds of interaction.

🌐 Cross-language expertise: Deeply proficient in over 20 languages including Python, Java, and C++, ranking first among open-source models in the Aider-Polyglot multilingual programming test with a score of 61.8.

⚡ MoE high-speed engine: With 480 billion parameters, it efficiently processes diverse tasks by activating only 35 billion parameters during inference, perfectly decoupling high performance from low latency (RTX 4090 response time <500ms).

🔧 Zero-friction ecosystem: Natively compatible with tools like VS Code and PyCharm; with the qwen! command, you can launch intelligent programming with one click. It supports private deployment and enterprise-level security compliance.

───────────────────────────────────────────────────────────────────

Test Data

Playground

Log in to explore more features! Click to Log In

API Analytics

API Reference (1)

API DescriptionAPI EndpointRequest MethodStabilityParameter Description
Chat (Tongyi Qianwen)
POST
Stable
View Details

API Pricing

$
ModelDescriptionContextOfficial Price302.AI Price

qwen3-coder-480b-a35b-instruct

0<Token≤32K
256000

Input$0.86 / 1M tokens
Output$3.43 / 1M tokens

Input$0.86/ 1M tokens
Output$3.43/ 1M tokens
Original Price

qwen3-coder-480b-a35b-instruct

32K<Token≤128K
256000

Input$1.29 / 1M tokens
Output$5.15 / 1M tokens

Input$1.29/ 1M tokens
Output$5.15/ 1M tokens
Original Price

qwen3-coder-480b-a35b-instruct

128K<Token≤200K
256000

Input$2.15 / 1M tokens
Output$8.58 / 1M tokens

Input$2.15/ 1M tokens
Output$8.58/ 1M tokens
Original Price