
glm-4.5-flash
API Overview
GLM-4.5 is a MoE large model with 355 billion parameters launched by Zhipu AI. Its core positioning is as a full-stack open-source foundation specifically designed for AI agents, natively integrating inference, programming, and tool-call capabilities.
- Leading in Open Source: Achieving an overall score of 63.2 across 12 industry-standard benchmarks, GLM-4.5 ranks first among global open-source models and third globally in overall performance.
- Two Version Configurations: Offering the flagship version GLM-4.5 (355B-A32B) and the lightweight version GLM-4.5-Air (106B-A12B), catering to all scenarios—from cloud to edge devices.
- Native Agent Support: For the first time, a single model unifies inference, coding, and agent capabilities, enabling it to plan projects, call tools, and execute code just like a human engineer.
- Exceptional Cost-Effectiveness: With extremely low API call costs (as low as 0.8 yuan per million tokens for input) and support for FP8 quantization, GLM-4.5 significantly lowers the barrier to enterprise deployment.
- Dual-Mode Switching: Featuring a uniquely designed “Thinking Mode” and “Non-Thinking Mode,” users can freely switch between them based on task complexity, balancing response speed and deep reasoning capabilities.
───────────────────────────────────────────────────────────────────
Core Capabilities
🤖 Agent Brain: Natively supports tool calls and code execution, enabling autonomous completion of full-stack development tasks from requirement analysis to deployment—truly a digital employee.
⚡ MoE Efficiency Revolution: With 355 billion parameters, it seamlessly integrates diverse knowledge sources; during inference, only about 9% of parameters (32 billion) are activated, perfectly decoupling high performance from low cost.
💻 Full-Stack Development: Just one sentence instruction is enough to generate complete web pages, games, or databases; tested results show outstanding performance on code benchmarks such as SWE-bench.
👁️ Visual Reasoning: Combined with GLM-4.5V, it boasts screen understanding and manipulation capabilities, able to recognize icons, locate elements, and even automatically order takeout or book flights for you.
Playground
Log in to explore more features! Click to Log In