deepseek-ai/DeepSeek-R1-Distill-Qwen-7B

deepseek-ai/DeepSeek-R1-Distill-Qwen-7B

DeepSeek's open-source language model-level lightweight product, based on knowledge distillation technology, achieves dual breakthroughs in inference efficiency and cost by optimizing Qwen-7B
2025-02-04
LLM
Model capability: function_call
Input:
Free
Output:
Free
Bulk order? Contact your manager for exclusive deals

API Overview

DeepSeek-R1-Distill-Qwen-7B is an open-source, lightweight language model product launched by DeepSeek. Its core positioning is as a high-performance, lightweight model based on knowledge distillation technology, achieving dual breakthroughs in inference efficiency and cost through optimization of Qwen-7B.

  • Technical Principle: It employs knowledge distillation to transfer the capabilities of a complex teacher model (Qwen-7B) to a lightweight student model, boosting inference speed by 40%.
  • Performance Advantages: In benchmarks such as MT-Bench and AlpacaEval 2.0, it outperforms the native Qwen-7B, scoring 82.5 points on the mathematical reasoning task (GSM8K), compared to 78.3 for Qwen-7B.
  • Open-Source License: It adopts the Apache 2.0 license and is compatible with the Hugging Face Transformers framework, providing complete training code and fine-tuning guidelines.
  • Applicable Scenarios: It is well-suited for resource-constrained environments such as edge computing, real-time translation, and lightweight code generation, reducing memory usage by 50%.

───────────────────────────────────────────────────────────────────

Core Capabilities

⚡ Ultra-Lightweight: Its proprietary distillation architecture compresses the model size to one-third of its original size, enabling smooth operation even on consumer-grade GPUs like the 3090.

📊 High-Performance Inference: It scores 5.78 on the MT-Bench benchmark (compared to 5.21 for Qwen-7B), with response latency below 80ms.

🔑 Low-Cost Deployment: Memory usage is reduced by 50%, allowing a single A100 GPU to support four concurrent requests, cutting operational costs by 40%.

🌍 Multi-Framework Compatibility: It natively supports Hugging Face and vLLM inference frameworks, enabling API service deployment with just three lines of code.

🛠️ Ready-to-Use: It provides pre-trained weights and domain-specific fine-tuning solutions, covering vertical scenarios such as healthcare and finance.

Playground

Log in to explore more features! Click to Log In

API Analytics

API Reference (1)

API DescriptionAPI EndpointRequest MethodStabilityParameter Description
Chat(SiliconFlow)
POST
Stable
View Details

API Pricing

$
ModelDescriptionContext302.AI Price

deepseek-ai/DeepSeek-R1-Distill-Qwen-7B

-
64000

InputFree
OutputFree
Original Price