abab6.5s-chat

abab6.5s-chat

MoE Architecture General Large Model
2024-04-17
LLM
Model capability: function_call
Input:
$0.154/1M tokens
Output:
$0.154/1M tokens
Bulk order? Contact your manager for exclusive deals

API Overview

The abab 6.5 series is a general-purpose large language model based on the MoE architecture, launched by MiniMax (Shanghai Xiyu Technology). Its core positioning is as an efficient and intelligent foundation characterized by "trillion parameters, sub-second processing, and precise handling of long texts."

  • Leading MoE Architecture: The first domestically developed model series to extensively adopt the Mixture-of-Experts (MoE) architecture. The abab 6.5 boasts trillion-level total parameters, while the abab 6.5s is even more efficient and streamlined.
  • Ultra-long Context Support: The entire series supports contexts of up to 200K tokens, and in the "needle-in-a-haystack" test, it accurately retrieved key information in all 891 instances.
  • Extreme Processing Speed: The abab 6.5s can process nearly 30,000 characters of text within just one second, meeting enterprise-level requirements for high throughput and low latency.
  • Comprehensive Performance Approaching Top Models: In multiple core benchmarks, its performance closely rivals that of internationally leading models such as GPT-4, Claude-3, and Gemini-1.5.

───────────────────────────────────────────────────────────────────

Core Capabilities

Sub-second Parsing of Long Texts: Even when dealing with hundred-page documents or entire novels, it can quickly pinpoint details, maintain logical coherence, and provide accurate answers to questions.

🧠 High-Density Intelligent Output: By optimizing the data pipeline and training algorithms, it achieves high-quality reasoning with limited activation parameters.

🔍 Robust Contextual Memory: Even when irrelevant content is inserted into ultra-long inputs of up to 200K tokens, it can stably identify and extract the critical "needle" information.

Playground

Log in to explore more features! Click to Log In

API Analytics

API Reference (25)

API DescriptionAPI EndpointRequest MethodStabilityParameter Description
Chat (Baidu ERNIE)
POST
Stable
View Details
Chat (Tongyi Qianwen)
POST
Stable
View Details
Chat (Tongyi Qianwen-VL)
POST
Stable
View Details
Chat (Zhipu GLM-4)
POST
Stable
View Details
Chat (Zhipu GLM-4V)
POST
Stable
View Details
Chat (Baichuan AI)
POST
Stable
View Details
Chat (Moonshot AI)
POST
Stable
View Details
Chat (Moonshot AI-Vision)
POST
Stable
View Details
Chat (01.AI)
POST
Stable
View Details
Chat (01.AI-VL)
POST
Stable
View Details
Chat (DeepSeek)
POST
Stable
View Details
Chat (ByteDance Doubao)
POST
Stable
View Details
Chat (ByteDance Doubao-Vision)
POST
Stable
View Details
Chat (Stepfun Multimodal)
POST
Stable
View Details
Chat (iFLYTEK Spark)
POST
Stable
View Details
Chat (SenseTime)
POST
Stable
View Details
Chat(Minimax)
POST
Stable
View Details
Chat (Tencent Hunyuan)
POST
Stable
View Details
Chat(Tongyi Qianwen)
POST
Stable
View Details
Hunyuan(Text-to-Video)
POST
Stable
View Details
Hunyuan(Obtain Task Results)
GET
Stable
View Details
Chat(Tongyi Qianwen-OCR)
POST
Stable
View Details
GLM-Zero-Preview
POST
Stable
View Details
QwQ-Plus
POST
Stable
View Details
Chat(ByteDance Doubao Image Generation)
POST
Stable
View Details

API Pricing

$
ModelDescriptionContextOfficial Price302.AI Price

abab6.5s-chat

-
245000

Input$0.14 / 1M tokens
Output$0.14 / 1M tokens

Input$0.154/ 1M tokens
Output$0.154/ 1M tokens
10%