Pro/deepseek-ai/DeepSeek-V3.2

Pro/deepseek-ai/DeepSeek-V3.2

The open-source large model with a multi-expert mixture-of-experts (MoE) architecture launched by DeepSeek.
2025-12-01
LLM
Model capability: thinkingModel capability: function_call
Input:
$0.286/1M tokens
Output:
$0.429/1M tokens
Bulk order? Contact your manager for exclusive deals

API Overview

DeepSeek-V3.2 is the flagship open-source general-purpose language model launched by DeepSeek (DeepSeek), with a core focus on delivering exceptional performance that surpasses comparable dense models while maintaining extremely low inference costs, thanks to its state-of-the-art matrix multiplication optimization and multi-expert mixture-of-experts (MoE) architecture.

  • Outstanding cost-effectiveness: Open-sourced under the MIT License, with fully public weights and commercial-use permissions. Its API call costs are exceptionally low, making it one of the most cost-efficient flagship models currently available on the market.
  • Top-tier performance: It comprehensively outperforms Llama-3.1/3.2-405B in multiple benchmark tests, including MMLU and MATH-500, achieving top-notch inference and coding capabilities, with overall performance approaching that of GPT-4o.
  • Ultra-large-scale architecture: Featuring a MoE architecture with 2.168 trillion total parameters and 416 billion activated parameters, it supports a context length of 256k, enabling it to handle massive amounts of information and complex tasks efficiently.
  • High-efficiency inference: Leveraging FP8 quantization technology and extreme matrix multiplication (GEMM) optimizations, it delivers lightning-fast inference speeds, supporting smooth interactions even in high-concurrency scenarios.
  • Multi-language capabilities: Particularly outstanding in Chinese and English tasks, it also boasts powerful multilingual understanding and generation abilities, making it well-suited for globalized application scenarios.

───────────────────────────────────────────────────────────────────

Core Capabilities

⚡ Extreme Matrix Optimization

State-of-the-art inference optimization based on FP8 and GEMM. Through deep refinement of underlying operators, it achieves extremely high computational density and throughput, enabling ultra-large-scale models to run efficiently even with limited computing resources.

🧠 Powerful Mixture-of-Experts

Adopting a 2.168T MoE architecture with up to 416B activated parameters, it maintains an enormous model capacity while requiring only a small number of experts to be activated for task completion, striking the perfect balance between "large model" and "low cost."

🌐 Superior Inference and Coding

It sets new SOTA records in MATH-500 and code-generation tasks. With top-tier logical reasoning and coding capabilities, it can deliver high-quality solutions whether tackling complex mathematical proofs or full-stack software development.

Playground

Log in to explore more features! Click to Log In

API Analytics

API Reference (1)

API DescriptionAPI EndpointRequest MethodStabilityParameter Description
Chat(SiliconFlow)
POST
Stable
View Details

API Pricing

$
ModelDescriptionContextOfficial Price302.AI Price

Pro/deepseek-ai/DeepSeek-V3.2

-
160000

Input$0.286 / 1M tokens
Output$0.429 / 1M tokens

Input$0.286/ 1M tokens
Output$0.429/ 1M tokens
Original Price