
glm-5
API Overview
GLM-5 is the next-generation flagship foundation model released by Zhipu AI. Its core positioning is as a “general-purpose agent foundation for agentic engineering,” specifically designed for complex systems engineering and long-term agent tasks. It achieves state-of-the-art performance in coding and agent capabilities among open-source models, with a real programming experience approaching that of Claude Opus 4.5.
- Capability Leap: The model’s parameter scale has expanded from 35.5B (with 32B activated) to 74.4B (with 40B activated), and its pre-training data reaches 28.5T tokens. Combined with the brand-new “Slime” asynchronous reinforcement learning framework and DeepSeek’s sparse attention mechanism, it significantly reduces deployment costs while maintaining context processing performance around 200K tokens.
- Programming Strength: GLM-5 achieves the highest scores among open-source models on SWE-bench-Verified (77.8) and Terminal Bench 2.0 (56.2), surpassing Gemini 3.0 Pro. It can independently handle system-level engineering tasks such as backend refactoring, deep debugging, and full-stack development.
- Agent Capabilities: In multi-tool long-term task evaluations including BrowseComp, MCP-Atlas, and τ²-Bench, GLM-5 ranks first among open-source models, demonstrating strong goal consistency, resource scheduling, and multi-step dependency handling capabilities.
- Office Integration: GLM-5 natively supports GLM in Excel and is compatible with Microsoft’s official AI plugin, enabling spreadsheet automation. It also supports structured outputs (JSON), function calls, MCP tool invocations, and context caching.
- Recommended Scenarios: Agentic Coding, end-to-end agent execution, cross-phase office tasks, immersive role-playing, professional script/storyboard generation, precise translation, structured data extraction, and information quality inspection.
───────────────────────────────────────────────────────────────────
Core Capabilities
🧠 Agentic-Ready Architecture
Supports the leap from “writing code” to “writing engineering,” enabling autonomous planning, execution, verification, and delivery of complete system-level products.
💻 SOTA-Level Programming Capability
Provides a real development environment experience close to Claude Opus 4.5, significantly reducing manual intervention and covering the entire frontend, backend, and data processing pipeline.
🛠️ Powerful Tool Ecosystem Integration
Supports function calls and the MCP protocol, allowing flexible invocation of external tools and data sources to build complex agent workflows.
📊 Deep Adaptation for Office Productivity
Through GLM in Excel and long-context memory, it stably handles multi-step, highly logically connected table and document tasks.
🎭 Highly Consistent Role-Playing
Consistently maintains character settings, emotions, and narrative logic during long-text interactions, enabling evolving, immersive conversations.
📄 Professional Content Generation
Long-text outputs such as scripts and storyboards exhibit production-quality usability, with greatly enhanced character development and plot coherence.
🔍 Precise Information Processing
It can extract structured data from complex texts like contracts, financial reports, and customer service tickets, and automatically perform quality checks and risk identification.
⚡ Efficient Inference Optimization
With a 200K context window, a maximum output of 128K, and a sparse attention mechanism, it balances long-term task capabilities with token efficiency.
Playground
Log in to explore more features! Click to Log In