inclusionAI/Ling-mini-2.0

inclusionAI/Ling-mini-2.0

Silicon-based, flow-deployed Ling-mini-2.0
2025-09-09
LLM
Model capability: function_call
Input:
$0.0715/1M tokens
Output:
$0.286/1M tokens
Bulk order? Contact your manager for exclusive deals

API Overview

Ling-mini-2.0 is a small, high-performance large language model based on the MoE architecture. It boasts 16 billion total parameters, yet each token activates only 1.4 billion (with 789 million non-embedding parameters), enabling exceptionally fast generation speeds. Thanks to its efficient MoE design and access to vast, high-quality training data, Ling-mini-2.0 achieves state-of-the-art performance in downstream tasks—performance that rivals dense LLMs with fewer than 10 billion parameters as well as larger-scale MoE models—despite having an active parameter count of just 1.4 billion.

Playground

Log in to explore more features! Click to Log In

API Analytics

API Reference (1)

API DescriptionAPI EndpointRequest MethodStabilityParameter Description
Chat(SiliconFlow)
POST
Stable
View Details

API Pricing

$
ModelDescriptionContextOfficial Price302.AI Price

inclusionAI/Ling-mini-2.0

-
128000

Input$0.0715 / 1M tokens
Output$0.286 / 1M tokens

Input$0.0715/ 1M tokens
Output$0.286/ 1M tokens
Original Price