
ascend-tribe/pangu-pro-moe
Mixture of Experts (MoGE) Architecture Model
2025-07-02
Input:
$0.143/1M tokens
Output:
$0.572/1M tokens
Bulk order? Contact your manager for exclusive deals
API Overview
Pangu-Pro-MoE 72B-A16B is a sparse large language model with 72 billion parameters and 16 billion activated parameters. It is based on the Grouped Mixture of Experts (MoGE) architecture, which groups experts during the expert selection phase and constrains tokens to activate an equal number of experts within each group, thereby achieving expert load balancing and significantly improving the model's deployment efficiency on the Ascend platform.
Playground
Log in to explore more features! Click to Log In
API Analytics
API Reference (1)
API Pricing
$¥ 円 ₽