deepseek-v3-aliyun

deepseek-v3-aliyun

DeepSeek-V3 Alibaba Cloud Deployment Version
2025-02-05
LLM
Model capability: function_call
Input:
$0.3/1M tokens
Output:
$1.2/1M tokens
Bulk order? Contact your manager for exclusive deals

API Overview

DeepSeek-V3 is the flagship open-source language model launched by the DeepSeek team. Its core mission is to achieve high performance while significantly reducing training costs through innovative architecture and training techniques.

  • Outstanding Performance: It excels in benchmarks such as MMLU and GPQA, outperforming some closed-source models in coding and math tasks, and delivering superior results in Chinese factual knowledge tasks.
  • Reduced Costs: Training costs have been slashed to 2.788 million H800 GPU hours—less than one-third of traditional approaches.
  • Increased Speed: Inference speed has more than doubled compared to the previous generation.
  • Long-Text Support: It supports long-text processing with a context length of up to 128K tokens.

───────────────────────────────────────────────────────────────────

Core Capabilities

⚙️ Efficient Architecture: Multi-head latent attention reduces inference memory usage, and the DeepSeekMoE architecture achieves load balancing.

🚀 Multi-Token Prediction: Allows the model to predict multiple future tokens at each position, accelerating inference speed by 1.8 times.

💪 FP8 Training: The first-ever validation of feasibility in ultra-large-scale models, reducing memory usage with minimal performance loss.

Parallel Framework: Uses bidirectional pipeline scheduling to minimize communication overhead, bringing training efficiency close to the theoretical maximum.

Playground

Log in to explore more features! Click to Log In

API Analytics

API Reference (1)

API DescriptionAPI EndpointRequest MethodStabilityParameter Description
Chat(Deepseek)
POST
Stable
View Details

API Pricing

$
ModelDescriptionContextOfficial Price302.AI Price

deepseek-v3-aliyun

-
64000

Input$0.3 / 1M tokens
Output$1.2 / 1M tokens

Input$0.3/ 1M tokens
Output$1.2/ 1M tokens
Original Price