glm-4.5-x

glm-4.5-x

GLM-4.5 series high-performance, strong inference, ultra-fast response model
2025-07-29
LLM
Model capability: function_call
Input:
$1.143/1M tokensstarting from
Output:
$2.29/1M tokensstarting from
Bulk order? Contact your manager for exclusive deals

API Overview

GLM-4.5 is a MoE large model with 355 billion parameters launched by Zhipu AI. Its core positioning is as a full-stack open-source foundation specifically designed for AI agents, natively integrating inference, programming, and tool-call capabilities.

  • Leading in Open Source: Achieving an overall score of 63.2 across 12 industry-standard benchmarks, GLM-4.5 ranks first among global open-source models and third globally in overall performance.
  • Two Version Configurations: Offering the flagship version GLM-4.5 (355B-A32B) and the lightweight version GLM-4.5-Air (106B-A12B), catering to all scenarios—from cloud to edge devices.
  • Native Agent Support: For the first time, a single model unifies inference, coding, and agent capabilities, enabling it to plan projects, call tools, and execute code just like a human engineer.
  • Exceptional Cost-Effectiveness: With extremely low API call costs (as low as 0.8 yuan per million tokens for input) and support for FP8 quantization, GLM-4.5 significantly lowers the barrier to enterprise deployment.
  • Dual-Mode Switching: Featuring a uniquely designed “Thinking Mode” and “Non-Thinking Mode,” users can freely switch between them based on task complexity, balancing response speed and deep reasoning capabilities.

───────────────────────────────────────────────────────────────────

Core Capabilities

🤖 Agent Brain: Natively supports tool calls and code execution, enabling autonomous completion of full-stack development tasks from requirement analysis to deployment—truly a digital employee.

⚡ MoE Efficiency Revolution: With 355 billion parameters, it seamlessly integrates diverse knowledge sources; during inference, only about 9% of parameters (32 billion) are activated, perfectly decoupling high performance from low cost.

💻 Full-Stack Development: Just one sentence instruction is enough to generate complete web pages, games, or databases; tested results show outstanding performance on code benchmarks such as SWE-bench.

👁️ Visual Reasoning: Combined with GLM-4.5V, it boasts screen understanding and manipulation capabilities, able to recognize icons, locate elements, and even automatically order takeout or book flights for you.

───────────────────────────────────────────────────────────────────

Related Evaluations

“The New King of Domestic Large Models! GLM-4.5 Tops Open Source Charts, Pitting Logical Reasoning Against Grok 4?”

302.AI 基准实验室丨国产大模型新卷王!GLM-4.5 开源登顶,逻辑推理硬刚 Grok 4?

Playground

Log in to explore more features! Click to Log In

API Analytics

API Reference (1)

API DescriptionAPI EndpointRequest MethodStabilityParameter Description
Chat (Zhipu GLM-4)
POST
Stable
View Details

API Pricing

$
ModelDescriptionContextOfficial Price302.AI Price

glm-4.5-x

Input [0,32k] Output[0,0.2k]
128000

Input$1.143 / 1M tokens
Output$2.29 / 1M tokens

Input$1.143/ 1M tokens
Output$2.29/ 1M tokens
Original Price

glm-4.5-x

Input [0,32k] Output [0.2+k ]
128000

Input$1.714 / 1M tokens
Output$4.571 / 1M tokens

Input$1.714/ 1M tokens
Output$4.571/ 1M tokens
Original Price

glm-4.5-x

Input[32,148k]
128000

Input$2.286 / 1M tokens
Output$9.143 / 1M tokens

Input$2.286/ 1M tokens
Output$9.143/ 1M tokens
Original Price