qwen/qwen-2.5-72b-instruct

qwen/qwen-2.5-72b-instruct

Alibaba’s flagship open-source language model—a high-performance inference engine designed for enterprise-level scenarios.
2025-06-10
LLM
Input:
$0.4/1M tokens
Output:
$0.4/1M tokens
Bulk order? Contact your manager for exclusive deals

API Overview

Qwen2.5-72B-Instruct is Alibaba’s flagship open-source language model, primarily positioned as a high-performance inference engine for enterprise use. It achieves comprehensive breakthroughs in code generation, mathematical reasoning, and multilingual capabilities, outperforming Llama-3.1 models of similar scale.

  • Outstanding Performance: Surpasses Llama-3.1-70B and Claude-3.5-Sonnet in over 20 benchmark tests, including MMLU-Pro, MATH, and HumanEval; its inference speed is twice as fast as Llama-3.1-70B.
  • Champion for Long Text Processing: Natively supports context windows up to 128K tokens, combined with a uniquely optimized sliding-window attention mechanism, enabling effortless handling of ultra-long document summarization and analysis.
  • Strong Dual Capabilities in Code and Math: Significantly enhanced code-generation abilities, with a math reasoning score reaching 83.1—a top choice for complex logical tasks.
  • Foundation for Multimodal Applications: When paired with Qwen2-VL-72B, it forms a unified framework for vision-and-text understanding, supporting cross-modal tasks.

───────────────────────────────────────────────────────────────────

Core Capabilities

🚀 Ultra-Fast Inference: Employs a proprietary optimized architecture, significantly boosting inference throughput.

⌨️ Native Agent Support: Deeply integrated tool-call capabilities, featuring a built-in Python interpreter and API-call templates, making it easy to build AI agent applications.

🌐 Multilingual Proficiency: Supports over 29 languages, with deep optimizations for Chinese and English, while also covering minor languages such as Arabic and Thai—enabling seamless cross-language understanding without barriers.

🛡️ Enterprise-Grade Security: Includes built-in sensitive-word filtering and data anonymization mechanisms, meeting enterprise-level security compliance requirements and ensuring the safety of business data.

Playground

Log in to explore more features! Click to Log In

API Analytics

API Reference (1)

API DescriptionAPI EndpointRequest MethodStabilityParameter Description
Chat(PPIO)
POST
Stable
View Details

API Pricing

$
ModelDescriptionContextOfficial Price302.AI Price

qwen/qwen-2.5-72b-instruct

-
32000

Input$0.4 / 1M tokens
Output$0.4 / 1M tokens

Input$0.4/ 1M tokens
Output$0.4/ 1M tokens
Original Price