deepseek-v3.2-exp

deepseek-v3.2-exp

Experimental version of DeepSeek-V3.2
2025-09-29
LLM
Model capability: function_call
Input:
$0.29/1M tokens
Output:
$0.43/1M tokens
Bulk order? Contact your manager for exclusive deals

API Overview

DeepSeek-V3.2-Exp is an experimental language model released by DeepSeek, primarily designed to explore optimizations in long-text training and inference efficiency through the introduction of a sparse attention mechanism.

  • Improved Efficiency: The introduction of the DeepSeek Sparse Attention mechanism significantly boosts the efficiency of both long-text training and inference without compromising output quality.
  • Consistent Performance: On various publicly available benchmark datasets, its performance is essentially on par with V3.1 - Terminus.
  • Open-Source Model: The model has been open-sourced on Huggingface and ModelScope, and the corresponding paper has also been made publicly available.
  • Open-Source Operators: The key operators for both TileLang and CUDA versions have been open-sourced, facilitating research and experimentation.

───────────────────────────────────────────────────────────────────

Core Capabilities

Sparse and Efficient: A fine-grained sparse attention mechanism that delivers high efficiency in processing long texts. 📚 Consistent Results: Its performance on public benchmark datasets is essentially identical to that of V3.1 - Terminus, ensuring reliable results.

Playground

Log in to explore more features! Click to Log In

API Analytics

API Reference (1)

API DescriptionAPI EndpointRequest MethodStabilityParameter Description
Chat (DeepSeek)
POST
Stable
View Details

API Pricing

$
ModelDescriptionContextOfficial Price302.AI Price

deepseek-chat

-
128000

Input$0.29 / 1M tokens
Output$0.43 / 1M tokens

Input$0.29/ 1M tokens
Output$0.43/ 1M tokens
Original Price