
abab6.5s-chat
API Overview
The abab 6.5 series is a general-purpose large language model based on the MoE architecture, launched by MiniMax (Shanghai Xiyu Technology). Its core positioning is as an efficient and intelligent foundation characterized by "trillion parameters, sub-second processing, and precise handling of long texts."
- Leading MoE Architecture: The first domestically developed model series to extensively adopt the Mixture-of-Experts (MoE) architecture. The abab 6.5 boasts trillion-level total parameters, while the abab 6.5s is even more efficient and streamlined.
- Ultra-long Context Support: The entire series supports contexts of up to 200K tokens, and in the "needle-in-a-haystack" test, it accurately retrieved key information in all 891 instances.
- Extreme Processing Speed: The abab 6.5s can process nearly 30,000 characters of text within just one second, meeting enterprise-level requirements for high throughput and low latency.
- Comprehensive Performance Approaching Top Models: In multiple core benchmarks, its performance closely rivals that of internationally leading models such as GPT-4, Claude-3, and Gemini-1.5.
───────────────────────────────────────────────────────────────────
Core Capabilities
⚡ Sub-second Parsing of Long Texts: Even when dealing with hundred-page documents or entire novels, it can quickly pinpoint details, maintain logical coherence, and provide accurate answers to questions.
🧠 High-Density Intelligent Output: By optimizing the data pipeline and training algorithms, it achieves high-quality reasoning with limited activation parameters.
🔍 Robust Contextual Memory: Even when irrelevant content is inserted into ultra-long inputs of up to 200K tokens, it can stably identify and extract the critical "needle" information.
Playground
Log in to explore more features! Click to Log In