sophnet/DeepSeek-V3.2

sophnet/DeepSeek-V3.2

The open-source large model with a multi-expert mixture-of-experts (MoE) architecture launched by DeepSeek.
2025-12-02
LLM
Model capability: thinkingModel capability: function_call
Input:
$0.286/1M tokens
Output:
$0.429/1M tokens
Bulk order? Contact your manager for exclusive deals

API Overview

DeepSeek-V3.2 is the flagship open-source general-purpose language model launched by DeepSeek (DeepSeek). Its core positioning is to deliver exceptional performance—surpassing comparable dense models—while maintaining extremely low inference costs, thanks to its state-of-the-art matrix multiplication optimization and multi-expert mixture-of-experts (MoE) architecture.

  • Outstanding cost-effectiveness: Open-sourced under the MIT License, with fully public weights and commercial-use permissions. API call costs are exceptionally low, making it one of the most cost-efficient flagship models currently available on the market.
  • Top-tier performance: It comprehensively outperforms Llama-3.1/3.2-405B in multiple benchmarks, including MMLU and MATH-500. Its inference and coding capabilities rank among the best, with overall performance approaching that of GPT-4o.
  • Ultra-large-scale architecture: Featuring a MoE architecture with 2.168 trillion total parameters and 416 billion activated parameters, it supports a context length of 256k, enabling it to handle massive amounts of information and complex tasks.
  • Efficient inference: Leveraging FP8 quantization technology and extreme matrix multiplication (GEMM) optimizations, it delivers lightning-fast inference speeds and supports smooth interactions even in high-concurrency scenarios.
  • Multi-language capabilities: It excels particularly in Chinese and English tasks, while also demonstrating strong multilingual understanding and generation abilities, making it well-suited for globalized application scenarios.

───────────────────────────────────────────────────────────────────

Core Capabilities

⚡ Extreme Matrix Optimization

Based on FP8 and GEMM, this model achieves extreme inference optimization. Through deep refinement of underlying operators, it attains exceptionally high computational density and throughput, enabling ultra-large-scale models to run efficiently even with limited computing resources.

🧠 Powerful Mixture-of-Experts

Adopting a 2.168T MoE architecture with up to 416B activated parameters. While maintaining an enormous model capacity, it only activates a small number of experts to complete tasks, striking the perfect balance between "large model" and "low cost."

🌐 Superior Inference and Coding

It sets new SOTA records in MATH-500 and code-generation tasks. It boasts top-notch logical reasoning and coding abilities, delivering high-quality solutions whether tackling complex mathematical proofs or full-stack software development.

Playground

Log in to explore more features! Click to Log In

API Analytics

API Reference (1)

API DescriptionAPI EndpointRequest MethodStabilityParameter Description
Chat(SophNet)
POST
Stable
View Details

API Pricing

$
ModelDescriptionContextOfficial Price302.AI Price

sophnet/DeepSeek-V3.2

-
128000

Input$0.286 / 1M tokens
Output$0.429 / 1M tokens

Input$0.286/ 1M tokens
Output$0.429/ 1M tokens
Original Price