deepseek-v4-flash

deepseek-v4-flash

DeepSeek’s newly released language model, designed for high-performance production scenarios, is specifically optimized for ultimate inference efficiency and response speed.
2026-04-24
LLM
Model capability: thinkingModel capability: function_call
Input:
$0.143/1M tokens
Output:
$0.286/1M tokens
Bulk order? Contact your manager for exclusive deals

API Overview

DeepSeek-V4-Flash is the latest language model released by DeepSeek, designed for high-performance production scenarios. As a core member of the DeepSeek-V4 series, it is specifically tailored for developers and enterprise users who prioritize extreme inference efficiency and response speed, striking a balance between model intelligence and computational cost.

───────────────────────────────────────────────────────────────────

Core Capabilities

Ultra-Fast Response: The Flash version has been specially optimized for response speed and inference throughput, enabling it to handle high-frequency requests with remarkable agility.

Decision-Making Intelligence: Despite its smaller parameter activation scale, the underlying total of 28.4 billion parameters ensures that the model maintains exceptionally high decision-making capabilities when processing complex instructions, logical reasoning, and multi-turn dialogues, making it well-suited to meet most mainstream AI business requirements.

Version Diversity: This model offers two versions—Base (base model) and post-trained Instruct (instruction-fine-tuned model)—to accommodate different deep development needs.

Playground

Log in to explore more features! Click to Log In

API Analytics

API Reference (3)

API DescriptionAPI EndpointRequest MethodStabilityParameter Description
Chat (DeepSeek)
POST
Stable
View Details
Chat (DeepSeek)
POST
Stable
View Details
Messages (for Claude Code)
POST
Stable
View Details

API Pricing

$
ModelDescriptionContextOfficial Price302.AI Price

deepseek-chat

-
1000000

Input$0.143 / 1M tokens
Output$0.286 / 1M tokens

Input$0.143/ 1M tokens
Output$0.286/ 1M tokens
Original Price