qwen2-57b-a14b-instruct

qwen2-57b-a14b-instruct

The 57B-scale MOE model with 14B activation parameters, open-sourced by Tongyi Qianwen 2.
2024-06-07
LLM
Model capability: function_call
Input:
$0.5/1M tokens
Output:
$1/1M tokens
Bulk order? Contact your manager for exclusive deals

API Overview

Qwen2-57B-A14B-Instruct is an efficient mixture-of-experts (MoE) instruction-tuned language model released by Alibaba’s Tongyi Lab. Its core positioning is as a large-capacity, low-activation, cost-effective general-purpose AI foundation for enterprise-level applications.

  • Advanced MoE Architecture: With a total of 57 billion parameters, it activates only 14 billion parameters (A14B), significantly reducing inference costs and latency while maintaining powerful overall capabilities.
  • Ultra-long Context of 128K Tokens: Natively supports ultra-long text inputs, making it ideal for complex document processing, multi-turn deep conversations, and integrating information across paragraphs.
  • Fine-Tuned Instruction Following: Trained on high-quality human preference data, it delivers precise and reliable performance in format control, style imitation, and multi-constraint tasks.
  • Balanced Multilingual and Code Capabilities: It excels at understanding Chinese contexts while supporting mainstream languages such as English, Japanese, and French, as well as programming languages like Python and JavaScript.

───────────────────────────────────────────────────────────────────

Core Capabilities

High Energy-Efficiency Inference: Achieves performance close to that of fully parameterized dense models with only about one-fourth the computational overhead, making it suitable for high-concurrency enterprise-level API services.

🧠 Stable Execution of Complex Tasks: Delivers outstanding performance in benchmarks such as C-Eval, MMLU, GSM8K, and HumanEval, ideal for scenarios like technical solution generation and logical analysis.

🌍 Natural Bilingual Expression in Chinese and English: Produces outputs that are both natural and fluent, balancing technical terminology with cultural context, perfect for content creation, intelligent customer service, and office automation.

🛡️ Secure, Compliant, and Controllable: Supports private deployment, content filtering, and audit logs, meeting regulatory requirements in industries such as finance, government, and education.

Playground

Log in to explore more features! Click to Log In

API Analytics

API Reference (1)

API DescriptionAPI EndpointRequest MethodStabilityParameter Description
Chat(Qwen2)
POST
Stable
View Details

API Pricing

$
ModelDescriptionContextOfficial Price302.AI Price

qwen2-57b-a14b-instruct

-
64000

Input$0.5 / 1M tokens
Output$1 / 1M tokens

Input$0.5/ 1M tokens
Output$1/ 1M tokens
Original Price