mistral-small-2503

mistral-small-2503

Lightweight Multimodal Language Model
2025-03-18
LLM
Model capability: function_call
Input:
$0.55/1M tokens
Output:
$0.55/1M tokens
Bulk order? Contact your manager for exclusive deals

API Overview

Mistral Small 3.1 is a lightweight multimodal language model launched by Mistral AI, designed as an efficient inference engine that strikes a balance between being “small yet powerful, fast yet comprehensive,” while delivering robust capabilities in text, image, multilingual support, and long-context understanding.

  • Outstanding overall performance: In multiple benchmarks—including MMLU, HumanEval, and MMMU—Mistral Small 3.1 outperforms peer models such as Gemma 3 and GPT-4o Mini.
  • Native multimodal understanding: It can directly process image-plus-text inputs and supports visual tasks like chart interpretation and screenshot-based question answering.
  • Ultra-long context of 128K tokens: Easily handles long-document summarization and complex multi-turn dialogues, with consistently reliable retrieval of key information.
  • Deep multilingual support: Covers major language families across Europe, East Asia, the Middle East, and more, significantly improving performance in non-English scenarios.

───────────────────────────────────────────────────────────────────

Core Capabilities

High-speed, low-consumption inference: Generates up to 150 tokens per second, offering fast response times and low costs, making it ideal for high-concurrency business applications. 👁️ Integrated image-and-text analysis: Understands screenshots of interfaces with embedded text, data charts, or product images, and provides precise answers based on contextual understanding. 🌍 Natural cross-language interaction: Outputs responses that align with local linguistic conventions, avoiding mechanical translation and truly enabling global usability. 🧩 Agent-friendly architecture: Supports Function Calling and structured responses, allowing seamless integration into automated workflows or AI assistants.

Playground

Log in to explore more features! Click to Log In

API Analytics

API Reference (20)

API DescriptionAPI EndpointRequest MethodStabilityParameter Description
Chat(LLaMA3.3)
POST
Stable
View Details
Chat(LLaMA3.2 multimodal)
POST
Stable
View Details
Chat(LLaMA3.1)
POST
Stable
View Details
Chat(Mixtral-8x7B)
POST
Stable
View Details
Chat(Gemma-7B)
POST
Stable
View Details
Chat(Gemma2-9B)
POST
Stable
View Details
Chat(Command R+)
POST
Stable
View Details
Command R
POST
Stable
View Details
Chat(Qwen2)
POST
Stable
View Details
Chat(Qwen2.5)
POST
Stable
View Details
Chat(Llama-3.1-nemotron)
POST
Stable
View Details
Chat(Mistral)
POST
Stable
View Details
Chat(Pixtral-Large-2411multimodal)
POST
Stable
View Details
Chat(QwQ-32B-Preview)
POST
Stable
View Details
Marco-o1
POST
Stable
View Details
QVQ-72B-Preview
POST
Stable
View Details
QwQ-32B
POST
Stable
View Details
Gemma-3-27b-it
POST
Stable
View Details
Qwen3
POST
Stable
View Details
Chat(LLaMA4)
POST
Stable
View Details

API Pricing

$
ModelDescriptionContextOfficial Price302.AI Price

mistral-small-2503

-
128000

Input$0.5 / 1M tokens
Output$0.5 / 1M tokens

Input$0.55/ 1M tokens
Output$0.55/ 1M tokens
Original Price