Qwen/Qwen3-Next-80B-A3B-Instruct

Qwen/Qwen3-Next-80B-A3B-Instruct

Silicon-based, flow-deployed Qwen3-Next-80B-A3B-Instruct
2025-09-10
LLM
Model capability: function_call
Input:
$0.143/1M tokens
Output:
$0.572/1M tokens
Bulk order? Contact your manager for exclusive deals

API Overview

Qwen3-Next-80B-A3B-Instruct is the next-generation foundational model released by Alibaba's Tongyi Qianwen team. Built on the brand-new Qwen3-Next architecture, it is designed to deliver unparalleled training and inference efficiency. The model features an innovative hybrid attention mechanism (Gated DeltaNet and Gated Attention), a highly sparse Mixture-of-Experts (MoE) structure, and several optimizations for enhanced training stability. As a sparse model with a total of 80 billion parameters, it activates only about 3 billion parameters during inference, significantly reducing computational costs. Moreover, when handling long-context tasks involving more than 32K tokens, its inference throughput is over 10 times higher than that of the Qwen3-32B model. This model is an instruction-tuned version specifically tailored for general-purpose tasks and does not support the Thinking chain mode. In terms of performance, it matches the flagship Qwen3-235B model in certain benchmark tests, particularly excelling in ultra-long-context tasks.

Playground

Log in to explore more features! Click to Log In

API Analytics

API Reference (1)

API DescriptionAPI EndpointRequest MethodStabilityParameter Description
Chat(SiliconFlow)
POST
Stable
View Details

API Pricing

$
ModelDescriptionContextOfficial Price302.AI Price

Qwen/Qwen3-Next-80B-A3B-Instruct

-
256000

Input$0.143 / 1M tokens
Output$0.572 / 1M tokens

Input$0.143/ 1M tokens
Output$0.572/ 1M tokens
Original Price