
qwen/qwen3-coder-480b-a35b-instruct
API Overview
Qwen3-Coder-480B-A35B-Instruct is a MoE coding model with 480 billion parameters launched by Alibaba. Its core mission is to serve as the world’s most powerful open-source agentic coding foundation, natively supporting a context length of 256K (expandable up to 1M), and specifically designed for complex software engineering tasks.
- Top-tier performance: It has set new records among open-source models in 12 benchmarks, including SWE-bench Verified (69.6%) and WebArena (49.9%), achieving overall performance comparable to Claude Sonnet 4.
- Ultra-long context understanding: Natively supports a context length of 256K tokens, and can be scaled up to 1 million tokens via YaRN technology, effortlessly handling full repository code analysis.
- Dual-mode switching: It features a unique “thinking mode” and “non-thinking mode,” dynamically balancing response speed (sub-second) with deep reasoning (multi-round planning).
- Full-stack toolchain: Comes with the open-source command-line tool Qwen Code (compatible with OpenAI SDK) and seamlessly integrates with mainstream IDEs such as Claude Code and Cline.
- Exceptional cost-effectiveness: Through FP8 quantization and MoE dynamic activation (only 35B parameters are computed in real-time), inference costs are reduced by 60%, with API call prices as low as 0.8 yuan per million tokens.
───────────────────────────────────────────────────────────────────
Core Capabilities
🤖 Agent-based programming: It autonomously handles the entire workflow—from requirement analysis and code generation to tool invocation and debugging—with a measured success rate of 69.6% in fixing GitHub issues after 500 rounds of interaction.
🌐 Cross-language expertise: Deeply proficient in over 20 languages including Python, Java, and C++, ranking first among open-source models in the Aider-Polyglot multilingual programming test with a score of 61.8.
⚡ MoE high-speed engine: With 480 billion parameters, it efficiently processes vast amounts of data while activating only 35 billion parameters during inference, perfectly decoupling high performance from low latency (RTX 4090 response time <500ms).
🔧 Zero-friction ecosystem: Natively compatible with tools like VS Code and PyCharm; with the qwen! command, you can launch intelligent programming with one click. It supports private deployment and enterprise-level security compliance.
───────────────────────────────────────────────────────────────────
Test Data

Playground
Log in to explore more features! Click to Log In