deepseek-vl2

deepseek-vl2

Open-source multimodal large model with a Mixture of Experts (MoE) architecture
2025-01-20
LLM
Model capability: image
Input:
$0.165/1M tokens
Output:
$0.165/1M tokens
Bulk order? Contact your manager for exclusive deals

API Overview

DeepSeek-VL2 is a professional-grade vision-language model launched by deepseek-ai, primarily designed to achieve advanced multimodal understanding.

  • Advanced Architecture: Employs a Mixture-of-Experts architecture to enhance multimodal comprehension capabilities.
  • Wide Range of Applications: Suitable for multimodal scenarios that require processing both images and text.
  • Outstanding Performance: Delivers exceptional results in multimodal understanding tasks.

───────────────────────────────────────────────────────────────────

Core Capabilities

📊 Multimodal Understanding: Easily handles both image and text information, enabling deep comprehension of multimodal content.

💪 Mixture-of-Experts Architecture: The Mixture-of-Experts architecture allows the model to flexibly allocate expert networks based on different tasks, improving processing efficiency and effectiveness.

🚀 Efficient Processing: Responds quickly and efficiently to complete understanding and analysis in multimodal tasks.

Playground

Log in to explore more features! Click to Log In

API Analytics

API Reference (1)

API DescriptionAPI EndpointRequest MethodStabilityParameter Description
Chat (DeepSeek-VL2)
POST
Stable
View Details

API Pricing

$
ModelDescriptionContextOfficial Price302.AI Price

deepseek-vl2

-
4000

Input$0.15 / 1M tokens
Output$0.15 / 1M tokens

Input$0.165/ 1M tokens
Output$0.165/ 1M tokens
10%