sophnet/Qwen3-Next-80B-A3B-Instruct

sophnet/Qwen3-Next-80B-A3B-Instruct

Next-generation high-efficiency Mixture of Experts (MoE) instruction fine-tuning model
2025-09-24
LLM
Input:
$0.143/1M tokens
Output:
$0.5715/1M tokens
Bulk order? Contact your manager for exclusive deals

API Overview

Qwen3-Next-80B-A3B-Instruct is a next-generation efficient Mixture of Experts (MoE) instruction fine-tuning model launched by Alibaba Tongyi Lab, with its core positioning as the mainstay of general large models featuring "high cost-effectiveness, strong instruction following, and enterprise-level usability".

  • Compact MoE Architecture: With a total of 80B parameters and only 3B (A3B) active parameters, it significantly reduces inference costs and latency while maintaining strong language capabilities.
  • Exceptional Instruction Following Ability: After fine-grained alignment training, it performs stably and reliably in complex instruction tasks such as multi-turn conversations, format constraints, and style control.
  • 128K Ultra-Long Context Support: Capable of handling long documents, multi-turn historical conversations, or composite task inputs, with a high recall rate for key information.
  • Multilingual and Code Optimization: Deeply enhances Chinese context understanding, while also supporting mainstream languages such as English, Japanese, and French, as well as programming languages such as Python and JavaScript.

───────────────────────────────────────────────────────────────────

Core Capabilities

High Throughput and Low Latency Response: With fewer active parameters and low memory footprint, a single GPU card can support high-concurrency enterprise-level applications, such as customer service, content generation, and intelligent work.

🧠 Precise Task Execution: Can accurately understand fine-grained instructions such as "summarize in a table", "write in the style of a government official document", and "list advantages and disadvantages point by point".

🌍 Natural Bilingual Expression in Chinese and English: Output is idiomatic and fluent, taking into account both professional terminology and cultural context, suitable for Internationalization business and Localization scenarios.

🛡️ Security Compliance and Controllability: Supports content filtering, sensitive word interception, and audit logs, meeting the deployment requirements of industries such as finance, government affairs, and education.

Playground

Log in to explore more features! Click to Log In

API Analytics

API Reference (1)

API DescriptionAPI EndpointRequest MethodStabilityParameter Description
Chat(SophNet)
POST
Stable
View Details

API Pricing

$
ModelDescriptionContextOfficial Price302.AI Price

sophnet/Qwen3-Next-80B-A3B- Instruct

-
128000

Input$0.143 / 1M tokens
Output$0.5715 / 1M tokens

Input$0.143/ 1M tokens
Output$0.5715/ 1M tokens
Original Price