
deepseek/deepseek-v3.2-exp
Experimental version of DeepSeek-V3.2
2025-09-30
Input:
$0.286/1M tokens
Output:
$0.4286/1M tokens
Bulk order? Contact your manager for exclusive deals
API Overview
DeepSeek-V3.2-Exp is an experimental language model released by DeepSeek, primarily designed to explore optimizations in long-text training and inference efficiency through the introduction of a sparse attention mechanism.
- Improved Efficiency: The introduction of the DeepSeek Sparse Attention mechanism significantly boosts the efficiency of both long-text training and inference without compromising output quality.
- Consistent Performance: On various publicly available benchmark datasets, its performance is essentially on par with V3.1 - Terminus.
- Open-Source Model: The model has been open-sourced on Huggingface and ModelScope, and the corresponding paper has also been made publicly available.
- Open-Source Operators: The key operators for both TileLang and CUDA versions have been open-sourced, facilitating research and experimentation.
───────────────────────────────────────────────────────────────────
Core Capabilities
⚡ Sparse and Efficient: A fine-grained sparse attention mechanism that delivers high efficiency in processing long texts. 📚 Consistent Results: Its performance on public benchmark datasets is essentially identical to that of V3.1 - Terminus, ensuring reliable results.
Playground
Log in to explore more features! Click to Log In
API Analytics
API Reference (1)
API Pricing
$¥ 円 ₽