kat-coder

kat-coder

High-performance AI coding model from Kuaishou StreamLake
2025-10-27
LLM
Model capability: function_call
Input:
$0.3/1M tokens
Output:
$1.2/1M tokens
Bulk order? Contact your manager for exclusive deals

API Overview

The Kuaishou KAT-Coder series is an agentic coding large model family launched by Kuaishou’s Kwaipilot team. Its core positioning is as a “high-cost-performance AI programming assistant covering all scenarios.” The series includes three differentiated models, catering to diverse needs—from lightweight personal development to complex enterprise-level projects. The Air edition is permanently free, lowering the barrier to entry for users.

  • Differentiated Model Layout: The series comprises three core models: KAT-Coder-Pro V1 (high-performance closed-source version, 73.4% solution rate verified by SWE-Bench, 256K context window, suitable for complex enterprise-level tasks); KAT-Coder-Air V1 (lightweight free version, fifth globally among open-source models of similar size, 128K context window, ultra-fast response time); and KAT-Coder-Exp-72B (open-source experimental version, 74.6% solution rate verified by SWE-Bench, leading the open-source SOTA, designed for academic exploration).
  • Top-Tier Practical Performance: The Pro version generates code that can be run directly—for example, a web-based operating system (complete with features like a text editor and drawing board) and a cyberpunk-style Fruit Ninja HTML5 game (exceeding Grok4 and Claude Sonnet 4.5 in completion level). In the Kilo Code benchmark, it completed 10 features in just 4 minutes with zero tool invocation failures, offering better cost-effectiveness than Haiku 4.5 and GLM 4.6.
  • Wide Tool and Deployment Compatibility: The entire series is compatible with mainstream AI programming tools such as Claude Code, Cline, Kilo Code, and Roo Code. The Pro version uses the Wanqing API; the Air version supports both API and Hugging Face deployment (requiring 1×A100 or 4×RTX4090), while the Exp-72B requires 2×A100 or 8×RTX4090. It supports over 20 mainstream programming languages.

───────────────────────────────────────────────────────────────────

Core Capabilities

💻 End-to-End Code Generation: Covers the entire workflow from requirement analysis and architecture planning through coding implementation and debugging/correction. The Pro version supports project-level development, while the Air version meets basic feature-writing needs.

🔧 Stable Tool Invocation: Features low API invocation error rates and can call Git, compilers, debuggers, and other tools, adapting to complex development workflows. The Kilo Code benchmark showed zero failure rates.

📚 Ultra-Long Context Handling: The Pro version supports 256K context, while the Air and Exp versions handle 128K contexts, easily managing large-scale codebase comprehension and long-text programming tasks.

🌍 Multi-Scenario Adaptation: Covers frontend (HTML/CSS/JS), backend, algorithm implementation, and other scenarios, supporting multiple programming languages. The Pro version is tailored for industrial-grade and enterprise-level engineering needs.

High Efficiency and Low Consumption: The Air version delivers ultra-fast response times, while the Pro version stands out in cost-effectiveness—consuming less resources than models with comparable performance, thus reducing development costs.

Playground

Log in to explore more features! Click to Log In

API Analytics

API Reference (1)

API DescriptionAPI EndpointRequest MethodStabilityParameter Description
Chat(PPIO)
POST
Stable
View Details

API Pricing

$
ModelDescriptionContextOfficial Price302.AI Price

kat-coder

-
256000

Input$0.3 / 1M tokens
Output$1.2 / 1M tokens

Input$0.3/ 1M tokens
Output$1.2/ 1M tokens
Original Price