Qwen/Qwen3-235B-A22B-Instruct-2507

Qwen/Qwen3-235B-A22B-Instruct-2507

Ultra-large-scale Mixture-of-Experts (MoE) Instruction-Finetuned Language Model
2025-07-23
LLM
Model capability: function_call
Input:
$0.36/1M tokens
Output:
$1.43/1M tokens
Bulk order? Contact your manager for exclusive deals

API Overview

Qwen3-235B-A22B-Instruct-2507 is a super-large-scale Mixture-of-Experts (MoE) instruction-tuned language model released by Alibaba’s Tongyi Lab. Its core positioning is as a flagship AI foundation model featuring “top-tier general capabilities + enterprise-grade reliability.”

  • Ultra-large MoE architecture: With a total of 235 billion parameters, yet activating only 22 billion parameters, it achieves leading performance among open-source models in authoritative benchmarks such as MMLU, C-Eval, GSM8K, and HumanEval.
  • 128K ultra-long context: Natively supports long-text inputs, making it suitable for advanced tasks such as complex document analysis, multi-turn deep conversations, and cross-paragraph reasoning.
  • Extremely optimized instruction following: Fine-tuned based on high-quality human preference data, accurately responding to fine-grained instruction requirements regarding format, style, and multiple constraints.
  • Multi-language and code enhancement: Deeply optimized for understanding Chinese contexts while supporting dozens of languages including English, Japanese, French, and mainstream programming languages for code generation.

───────────────────────────────────────────────────────────────────

Core Capabilities

🧠 Expert-level comprehensive reasoning: Demonstrates expert-like logical chains in complex tasks such as mathematical proofs, technical proposal writing, and legal clause interpretation.

High-throughput, low-latency service: The MoE architecture strikes a balance between high performance and cost-effectiveness, enabling a single GPU to support highly concurrent enterprise-grade API services.

🌍 Globalized content generation: Produces natural, fluent outputs that take cultural adaptation and specialized terminology into account, making it ideal for international business and localized scenarios.

🛡️ Secure, compliant, and controllable: Supports private deployment, content filtering, sensitive word interception, and audit logs, meeting the stringent regulatory requirements of industries such as finance, government, and healthcare.

Playground

Log in to explore more features! Click to Log In

API Analytics

API Reference (1)

API DescriptionAPI EndpointRequest MethodStabilityParameter Description
Chat(SiliconFlow)
POST
Stable
View Details

API Pricing

$
ModelDescriptionContextOfficial Price302.AI Price

Qwen/Qwen3-235B-A22B-Instruct-2507

-
256003

Input$0.36 / 1M tokens
Output$1.43 / 1M tokens

Input$0.36/ 1M tokens
Output$1.43/ 1M tokens
Original Price