kimi-latest

kimi-latest

Kimi dynamic update LLM
2025-02-25
LLM
Model capability: imageModel capability: function_call
Input:
$0.315/1M tokensstarting from
Output:
$1.573/1M tokensstarting from
Bulk order? Contact your manager for exclusive deals

API Overview

Kimi Latest is a dynamically updated large language model launched by Moonshot AI. Its core positioning is “a cutting-edge capability foundation that stays实时 synchronized with the Kimi smart assistant product + an advanced experience channel on the open platform.”

  • Real-time Synchronized Product Model: Directly aligns with the latest model version currently used in the Kimi smart assistant’s web and app interfaces, automatically updating with product iterations to ensure developers can experience new features—such as enhanced online search, improved JSON output, and experimental reasoning modes—at the earliest opportunity.
  • Retains Ultra-long Context of 128K Tokens: Supports context lengths of up to 128,000 tokens and intelligently selects billing tiers of 8K, 32K, or 128K based on actual usage, striking a balance between flexibility and cost efficiency.
  • Comprehensive Multimodal and Tool Capabilities: As a vision-language model, it supports image understanding; at the same time, it fully integrates with core open-platform features such as Tool Calls, JSON Mode, Partial Streaming, and online search.
  • Clear Capability Boundaries Prompt: Suitable for scenarios that prioritize the latest chat experience, early access to cutting-edge features, or alignment with Kimi product behavior. However, since it includes experimental features that are not yet fully stable, it is not recommended for production systems that rely on strict prompt stability.

───────────────────────────────────────────────────────────────────

Core Capabilities

💬 Synchronized Product-Level Interaction: Consistent with the Kimi smart assistant experience, providing responses rich in emotional value, suitable for both everyday chats and anthropomorphic assistant scenarios.

🖼️ Basic Multimodal Support: Equipped with image understanding capabilities, capable of handling mixed text-and-image inputs, expanding interaction dimensions. 📚 Ultra-long Context Processing: A 128K window supports long-document comprehension and multi-turn conversations, automatically adapting to billing specifications to balance performance and cost.

🛠️ Full Functionality Compatibility: Inherits capabilities such as tool calls, online connectivity, and JSON output, seamlessly taking on lightweight automation tasks.

⚡ Low-Cost Cache Optimization: Automatically caches context, reducing costs for repeated interactions and enhancing cost-effectiveness for high-frequency use.

Playground

Log in to explore more features! Click to Log In

API Analytics

API Reference (1)

API DescriptionAPI EndpointRequest MethodStabilityParameter Description
Chat (Moonshot AI-Vision)
POST
Stable
View Details

API Pricing

$
ModelDescriptionContextOfficial Price302.AI Price

kimi-latest

Context length ≤ 8,192 tokens
128000

Input$0.286 / 1M tokens
Output$1.43 / 1M tokens

Input$0.315/ 1M tokens
Output$1.573/ 1M tokens
10%

kimi-latest

8,192 < context length ≤ 32,768 tokens
128000

Input$0.72 / 1M tokens
Output$2.86 / 1M tokens

Input$0.792/ 1M tokens
Output$3.146/ 1M tokens
10%

kimi-latest

32,768 < context length ≤ 131,072 tokens
128000

Input$1.43 / 1M tokens
Output$4.29 / 1M tokens

Input$1.573/ 1M tokens
Output$4.719/ 1M tokens
10%