qwen2.5-14b-instruct-1m

qwen2.5-14b-instruct-1m

Compared to Qwen2, Qwen2.5 has acquired significantly more knowledge and achieved substantial improvements in programming and mathematical abilities. It supports a context length of 1 million tokens.
2025-01-30
LLM
Model capability: function_call
Input:
$0.143/1M tokens
Output:
$0.43/1M tokens
Bulk order? Contact your manager for exclusive deals

API Overview

Qwen2.5-14B-Instruct-1M is a ultra-long-context instruction-tuned language model released by Alibaba’s Tongyi Lab, primarily positioned as an expert in handling long documents with “million-level context + high-precision instruction execution.”

  • Native 1M (1 million) token context: Without the need for external extension mechanisms, it directly supports ultra-long text inputs and maintains high recall and positional awareness even in extreme-length tasks such as “finding a needle in a haystack.”
  • 14B dense architecture—stable and reliable: Fully parameterized activation, free from MoE routing uncertainties, delivering superior output consistency in long-context scenarios compared to mixture-of-experts models.
  • Fine-grained instruction-following alignment: Optimized for complex tasks such as long-document summarization, cross-paragraph question answering, and multi-chapter analysis, accurately responding to formatting and logical constraints.
  • Efficient attention mechanism: Employs an optimized attention computation strategy that controls memory usage and computational costs while maintaining performance, making it suitable for real-world deployment.

───────────────────────────────────────────────────────────────────

Core Capabilities

📚 Ultra-long document understanding: Can fully load and analyze entire novels, technical white papers, legal contracts, or annual financial reports, enabling end-to-end information extraction.

🔍 Precise cross-paragraph retrieval: Accurately locates key information within millions of tokens, answering fine-grained questions such as, “What are the experimental parameters mentioned in Chapter 37?”

🧠 Structured long-range reasoning: Supports inferences, comparisons, and deductions based on the logic of the entire text—for example, “Summarize the strategic trend changes across three quarterly reports.”

🛡️ Enterprise-grade security and compliance: Supports private deployment and content filtering, making it ideal for high-value scenarios such as financial research analysis, legal due diligence, and scientific literature reviews.

Playground

Log in to explore more features! Click to Log In

API Analytics

API Reference (1)

API DescriptionAPI EndpointRequest MethodStabilityParameter Description
Chat(Qwen2.5)
POST
Stable
View Details

API Pricing

$
ModelDescriptionContextOfficial Price302.AI Price

qwen2.5-14b-instruct-1m

-
1000000

Input$0.143 / 1M tokens
Output$0.43 / 1M tokens

Input$0.143/ 1M tokens
Output$0.43/ 1M tokens
Original Price