mistral-large-2512

mistral-large-2512

Mistral AI’s flagship multimodal language model, whose core positioning is “enterprise-grade efficient inference.”
2025-12-17
LLM
Model capability: imageModel capability: function_call
Input:
$1.1/1M tokens
Output:
$3.3/1M tokens
Bulk order? Contact your manager for exclusive deals

API Overview

Mistral-Large-2512 is Mistral AI’s flagship multimodal multi-expert (MoE) language model, jointly optimized by Unsloth, vLLM, and Red Hat into an NVFP4 quantized version. Its core positioning is “enterprise-grade efficient inference,” balancing a massive 675B parameter scale with practical deployment feasibility.

───────────────────────────────────────────────────────────────────

Core Capabilities

🧠 Granular MoE Intelligence: 41B activation parameters deliver trillion-parameter-level performance, enabling more precise and efficient inference

👁️ Multimodal Understanding: Built-in 2.5B visual encoder capable of parsing image content and integrating it with text-based reasoning

📦 NVFP4 Quantized Deployment: Can run on A100/H100 GPUs, with significantly lower memory requirements than FP8, offering superior cost efficiency

⚡ 256K Long Context: Supports ultra-long documents, codebases, and multi-turn conversations, making enterprise knowledge processing effortless

🤖 Native Agentic Capabilities: Automatic tool selection and structured output, allowing easy construction of AI agents

🌍 Full Multilingual Coverage: Supports dozens of languages including Chinese, with strong instruction adherence and ideal for globalized scenarios

───────────────────────────────────────────────────────────────────

Applicable Scenarios

  • Long-document understanding
  • Powerful everyday driving AI assistant
  • State-of-the-art agent and tool utilization capabilities
  • Enterprise knowledge work
  • General-purpose coding assistant

───────────────────────────────────────────────────────────────────

Benchmark Results

Playground

Log in to explore more features! Click to Log In

API Analytics

API Reference (1)

API DescriptionAPI EndpointRequest MethodStabilityParameter Description
Chat(Mistral)
POST
Stable
View Details

API Pricing

$
ModelDescriptionContextOfficial Price302.AI Price

mistral-large-2512

-
256000

Input$1 / 1M tokens
Output$3 / 1M tokens

Input$1.1/ 1M tokens
Output$3.3/ 1M tokens
10%