mistral-large-latest

mistral-large-latest

Mistral AI’s flagship multimodal language model,
2025-03-18
LLM
Model capability: function_call
Input:
$2.2/1M tokens
Output:
$6.6/1M tokens
Bulk order? Contact your manager for exclusive deals

API Overview

Mistral-Large-2512 is Mistral AI’s flagship multimodal multi-expert (MoE) language model, jointly optimized by Unsloth, vLLM, and Red Hat into an NVFP4 quantized version. Its core positioning is “enterprise-grade efficient inference,” balancing a massive 675B parameter scale with practical deployment feasibility.

───────────────────────────────────────────────────────────────────

Core Capabilities

🧠 Granular MoE Intelligence: 41B activation parameters deliver trillion-parameter-level performance, enabling more precise and efficient inference

👁️ Multimodal Understanding: Built-in 2.5B visual encoder capable of parsing image content and integrating it with text-based reasoning

📦 NVFP4 Quantized Deployment: Can run on A100/H100 GPUs, with significantly lower memory requirements than FP8, offering superior cost efficiency

⚡ 256K Long Context: Supports ultra-long documents, codebases, and multi-turn conversations, making enterprise knowledge processing effortless

🤖 Native Agentic Capabilities: Automatic tool selection and structured output, allowing easy construction of AI agents

🌍 Full Multilingual Coverage: Supports dozens of languages including Chinese, with strong instruction adherence and ideal for globalized scenarios

───────────────────────────────────────────────────────────────────

Applicable Scenarios

  • Long-document understanding
  • Powerful everyday driving AI assistant
  • State-of-the-art agent and tool utilization capabilities
  • Enterprise knowledge work
  • General-purpose coding assistant

───────────────────────────────────────────────────────────────────

Benchmark Results

Playground

Log in to explore more features! Click to Log In

API Analytics

API Reference (1)

API DescriptionAPI EndpointRequest MethodStabilityParameter Description
Chat(Mistral)
POST
Stable
View Details

API Pricing

$
ModelDescriptionContextOfficial Price302.AI Price

mistral-large-latest

-
128000

Input$2 / 1M tokens
Output$6 / 1M tokens

Input$2.2/ 1M tokens
Output$6.6/ 1M tokens
10%