sophnet/GLM-4.7

sophnet/GLM-4.7

Zhipu’s flagship text model, covering complex task solving and efficient creation across multiple scenarios.
2025-12-24
LLM
Model capability: thinkingModel capability: function_call
Input:
$0.286/1M tokensstarting from
Output:
$1.143/1M tokensstarting from
Bulk order? Contact your manager for exclusive deals

API Overview

GLM-4.7 is Zhipu’s flagship text model, primarily positioned to enhance agentic coding capabilities, covering complex task solving and efficient creation across multiple scenarios, enabling autonomous planning, execution, and validation of code tasks.

  • Key Upgrades: Compared to GLM-4.6, its mathematical and reasoning abilities have improved by 41%, achieving a score of 42.8% on the HLE benchmark—exceeding GPT-5.1; it introduces retained thinking and round-based thinking modes, ensuring more stable execution of complex tasks and allowing users to control inference costs as needed.
  • Applicable Scenarios: It covers multiple professional scenarios, including agentic coding, multimodal interaction app development, frontend design, office productivity, deep research, software development, algorithm implementation, and script automation.
  • Evaluation Data: The τ²-Bench interactive tool invocation evaluation scored 84.7 points (open-source SOTA), and the BrowseComp web task evaluation scored 67 points.
  • Product Value: It generates complete, runnable code frameworks, reducing manual assembly and debugging costs and accelerating the process from idea to implementation.
  • Core Parameters: Context window size is 200K, with a maximum output token limit of 128K, supporting structured format outputs such as JSON.
  • Evaluation Data: It supports over 100 programming languages, averages 3.2 toolchain calls per task, and boasts an autonomous repair rate exceeding 65%.

───────────────────────────────────────────────────────────────────

Core Capabilities

💻 Strong Breakthrough in Agentic Coding: From requirement understanding to solution decomposition, it autonomously completes integration across multiple tech stacks, generating structurally complete, runnable code that covers front-to-back end integration scenarios.

🧠 Controllable Thinking Modes: Supports interleaved, retained, and round-based thinking. For simple tasks, it reduces thinking time to lower latency; for complex tasks, it increases thinking depth to improve accuracy; and for long-term tasks, it enables more efficient caching.

🎨 Enhanced Frontend Aesthetic Quality: It precisely understands UI design standards and provides high-quality default solutions for layout, color schemes, and component styles, significantly reducing the time spent on fine-tuning styles.

📝 Immersive Creation Experience: Text expressions are detailed and vividly visual, character roles maintain consistent personality traits, plot progression feels natural, and office creation adheres closely to mainstream formatting standards.

───────────────────────────────────────────────────────────────────

Test Data

Description

Playground

Log in to explore more features! Click to Log In

API Analytics

API Reference (1)

API DescriptionAPI EndpointRequest MethodStabilityParameter Description
Chat(SophNet)
POST
Stable
View Details

API Pricing

$
ModelDescriptionContextOfficial Price302.AI Price

sophnet/GLM-4.7

Input < 32k, output < 0.2k
202000

Input$0.286 / 1M tokens
Output$1.143 / 1M tokens

Input$0.286/ 1M tokens
Output$1.143/ 1M tokens
Original Price

sophnet/GLM-4.7

Input < 32k, output >= 0.2k
202000

Input$0.429 / 1M tokens
Output$2 / 1M tokens

Input$0.429/ 1M tokens
Output$2/ 1M tokens
Original Price

sophnet/GLM-4.7

32k <= input <= 202k
202000

Input$0.572 / 1M tokens
Output$2.286 / 1M tokens

Input$0.572/ 1M tokens
Output$2.286/ 1M tokens
Original Price