
Qwen/Qwen3-Next-80B-A3B-Instruct
API Overview
Qwen3-Next-80B-A3B-Instruct is the next-generation foundational model released by Alibaba's Tongyi Qianwen team. Built on the brand-new Qwen3-Next architecture, it is designed to deliver unparalleled training and inference efficiency. The model features an innovative hybrid attention mechanism (Gated DeltaNet and Gated Attention), a highly sparse Mixture-of-Experts (MoE) structure, and several optimizations for enhanced training stability. As a sparse model with a total of 80 billion parameters, it activates only about 3 billion parameters during inference, significantly reducing computational costs. Moreover, when handling long-context tasks involving more than 32K tokens, its inference throughput is over 10 times higher than that of the Qwen3-32B model. This model is an instruction-tuned version specifically tailored for general-purpose tasks and does not support the Thinking chain mode. In terms of performance, it matches the flagship Qwen3-235B model in certain benchmark tests, particularly excelling in ultra-long-context tasks.
Playground
Log in to explore more features! Click to Log In