MiniMax-Text-01

MiniMax-Text-01

Adopts a hybrid architecture integrating Lightning Attention, Softmax Attention, and Mixture-of-Experts (MoE)
2025-01-15
LLM
Model capability: imageModel capability: function_call
Input:
$0.154/1M tokens
Output:
$1.232/1M tokens
Bulk order? Contact your manager for exclusive deals

API Overview

MiniMax-Text-01 is a lightweight, high-efficiency large language model launched by MiniMax (Shanghai Xiyu Technology). Its core positioning is as a general-purpose text-processing engine that delivers "low latency, high cost-effectiveness, and strong Chinese-language understanding."

  • Ultimate Inference Efficiency: Optimized specifically for high-frequency, low-latency scenarios, with response times reaching the millisecond level. Ideal for real-time interactions such as customer service, search, and content generation.
  • Deep Optimization for Chinese Semantics: Outstanding performance in understanding Chinese instructions, using idiomatic expressions, and restoring cultural context, resulting in more natural and authentic outputs.
  • Great Capabilities in a Small Model: With a compact parameter size, it can be deployed on consumer-grade GPUs or CPUs, significantly lowering the barrier to entry for enterprises.
  • Ready-to-Use Across Multiple Scenarios: Supports mainstream NLP tasks such as summarization, rewriting, question answering, and content creation.

───────────────────────────────────────────────────────────────────

Core Capabilities

🇨🇳 More Authentic Chinese Expressions: Accurately captures implicit meanings and local expression habits, avoiding stiff, translated tones and generating copy that reads like it was written by a native speaker.

Lightweight and High-Speed Response: A small-footprint architecture delivers high-throughput inference, easily handling high-concurrency business demands with extremely low resource consumption.

🧩 Strong Task Generalization Ability: From marketing copy to official document writing, from logical question answering to emotional support—this single model covers a wide range of text scenarios.

🛡️ Localized, Secure, and Controllable: Full-link support for private deployment meets compliance requirements for data-sensitive industries such as finance, government, and education.

Playground

Log in to explore more features! Click to Log In

API Analytics

API Reference (25)

API DescriptionAPI EndpointRequest MethodStabilityParameter Description
Chat (Baidu ERNIE)
POST
Stable
View Details
Chat (Tongyi Qianwen)
POST
Stable
View Details
Chat (Tongyi Qianwen-VL)
POST
Stable
View Details
Chat (Zhipu GLM-4)
POST
Stable
View Details
Chat (Zhipu GLM-4V)
POST
Stable
View Details
Chat (Baichuan AI)
POST
Stable
View Details
Chat (Moonshot AI)
POST
Stable
View Details
Chat (Moonshot AI-Vision)
POST
Stable
View Details
Chat (01.AI)
POST
Stable
View Details
Chat (01.AI-VL)
POST
Stable
View Details
Chat (DeepSeek)
POST
Stable
View Details
Chat (ByteDance Doubao)
POST
Stable
View Details
Chat (ByteDance Doubao-Vision)
POST
Stable
View Details
Chat (Stepfun Multimodal)
POST
Stable
View Details
Chat (iFLYTEK Spark)
POST
Stable
View Details
Chat (SenseTime)
POST
Stable
View Details
Chat(Minimax)
POST
Stable
View Details
Chat (Tencent Hunyuan)
POST
Stable
View Details
Chat(Tongyi Qianwen)
POST
Stable
View Details
Hunyuan(Text-to-Video)
POST
Stable
View Details
Hunyuan(Obtain Task Results)
GET
Stable
View Details
Chat(Tongyi Qianwen-OCR)
POST
Stable
View Details
GLM-Zero-Preview
POST
Stable
View Details
QwQ-Plus
POST
Stable
View Details
Chat(ByteDance Doubao Image Generation)
POST
Stable
View Details

API Pricing

$
ModelDescriptionContextOfficial Price302.AI Price

MiniMax-Text-01

-
1000000

Input$0.14 / 1M tokens
Output$1.12 / 1M tokens

Input$0.154/ 1M tokens
Output$1.232/ 1M tokens
10%