qwen/qwen3-coder-480b-a35b-instruct

qwen/qwen3-coder-480b-a35b-instruct

Alibaba has launched a MoE encoding model with 480 billion parameters, specifically designed for complex software engineering tasks.
2025-07-23
LLM
Model capability: function_call
Input:
$2.14/1M tokens
Output:
$2.14/1M tokens
Bulk order? Contact your manager for exclusive deals

API Overview

Qwen3-Coder-480B-A35B-Instruct is a MoE coding model with 480 billion parameters launched by Alibaba. Its core mission is to serve as the world’s most powerful open-source agentic coding foundation, natively supporting a context length of 256K (expandable up to 1M), and specifically designed for complex software engineering tasks.

  • Top-tier performance: It has set new records among open-source models in 12 benchmarks, including SWE-bench Verified (69.6%) and WebArena (49.9%), achieving overall performance comparable to Claude Sonnet 4.
  • Ultra-long context understanding: Natively supports a context length of 256K tokens, and can be scaled up to 1 million tokens via YaRN technology, effortlessly handling full repository code analysis.
  • Dual-mode switching: It features a unique “thinking mode” and “non-thinking mode,” dynamically balancing response speed (sub-second) with deep reasoning (multi-round planning).
  • Full-stack toolchain: Comes with the open-source command-line tool Qwen Code (compatible with OpenAI SDK) and seamlessly integrates with mainstream IDEs such as Claude Code and Cline.
  • Exceptional cost-effectiveness: Through FP8 quantization and MoE dynamic activation (only 35B parameters are computed in real-time), inference costs are reduced by 60%, with API call prices as low as 0.8 yuan per million tokens.

───────────────────────────────────────────────────────────────────

Core Capabilities

🤖 Agent-based programming: It autonomously handles the entire workflow—from requirement analysis and code generation to tool invocation and debugging—with a measured success rate of 69.6% in fixing GitHub issues after 500 rounds of interaction.

🌐 Cross-language expertise: Deeply proficient in over 20 languages including Python, Java, and C++, ranking first among open-source models in the Aider-Polyglot multilingual programming test with a score of 61.8.

⚡ MoE high-speed engine: With 480 billion parameters, it efficiently processes vast amounts of data while activating only 35 billion parameters during inference, perfectly decoupling high performance from low latency (RTX 4090 response time <500ms).

🔧 Zero-friction ecosystem: Natively compatible with tools like VS Code and PyCharm; with the qwen! command, you can launch intelligent programming with one click. It supports private deployment and enterprise-level security compliance.

───────────────────────────────────────────────────────────────────

Test Data

Playground

Log in to explore more features! Click to Log In

API Analytics

API Reference (1)

API DescriptionAPI EndpointRequest MethodStabilityParameter Description
Chat(PPIO)
POST
Stable
View Details

API Pricing

$
ModelDescriptionContextOfficial Price302.AI Price

qwen/qwen3-coder-480b-a35b-instruct

-
262144

Input$2.14 / 1M tokens
Output$2.14 / 1M tokens

Input$2.14/ 1M tokens
Output$2.14/ 1M tokens
Original Price