PROVIDER

Moonshot AI AI Models

Moonshot AI's Kimi K2 — 1T-parameter MoE with 256K context and 58.6% SWE-bench Pro, via one RunAPI key.

1 models · 2 variants · from $0.020
All Moonshot AI models available through RunAPI 1 models
OVERVIEW

Moonshot AI builds the Kimi K2 family — 1 trillion total parameters, 32B active per token, 384 experts per layer — optimized for autonomous coding and multi-agent orchestration. kimi-k2.6 reaches 58.6% on SWE-bench Pro and scales Agent Swarm to 300 sub-agents. Both kimi-k2.5 and kimi-k2.6 are available through RunAPI from the OpenAI and Anthropic SDKs.

  • Single API key shared across all providers
  • No separate %{provider} account required
  • Model skills carry docs, schemas, and setup steps into your workspace
  • Per-call billing in USD, no subscription or minimum spend
  • Failed generations are never charged
  • Switch models by changing one parameter
  • Billing consolidated into one monthly invoice
FEATURES

What stands out

MODELS

All Moonshot AI models available through RunAPI

QUICKSTART

Install a Moonshot AI model skill for your app.

Pick a model and add its skill so your coding tool has docs, schemas, pricing notes, and setup steps. Skills work with Claude Code, Codex, Gemini CLI, Cursor, and VS Code. Install once, then switch models by changing one parameter.

runapi.ai
# Base URL
https://runapi.ai

# Endpoints
POST /v1/chat/completions
curl https://runapi.ai/v1/chat/completions \
  -H "Authorization: Bearer $RUNAPI_KEY" \
  -H "Content-Type: application/json" \
  -d '{
  "model": "kimi-k2.6",
  "messages": [
    {
      "role": "user",
      "content": "Plan and implement a small CLI tool: scaffold the project, write the commands, add tests, and run them until they pass."
    }
  ]
}'
from openai import OpenAI

client = OpenAI(
    base_url="https://runapi.ai/v1",
    api_key="your-runapi-key"
)

response = client.chat.completions.create(
    model="kimi-k2.6",
    messages=[{"role": "user", "content": "Plan and implement a small CLI tool: scaffold the project, write the commands, add tests, and run them until they pass."}]
)
import OpenAI from "openai";

const client = new OpenAI({
  baseURL: "https://runapi.ai/v1",
  apiKey: "your-runapi-key"
});

const response = await client.chat.completions.create({
  model: "kimi-k2.6",
  messages: [{ role: "user", content: "Plan and implement a small CLI tool: scaffold the project, write the commands, add tests, and run them until they pass." }]
});
https://runapi.ai /v1/chat/completions
REFERENCE

Every Moonshot AI variant with pricing and model IDs

Full pricing table →
Model Variant Billing From
Kimi
kimi-k2.5 1K tokens $0.020 View →
kimi-k2.6 1K tokens $0.020 View →
FAQ

Frequently asked questions about Moonshot AI

Is this an official Moonshot AI integration?

RunAPI exposes a managed API surface with transparent per-call pricing, fully documented capability and parameters, and clear error behavior. You get the same model output quality without managing a direct provider relationship or provider-side account.

Do I need a Moonshot AI account?

No. Your RunAPI API key is enough for managed access to all Moonshot AI models. You do not need to create a separate account, manage provider-specific credentials, or handle provider-side billing.

What's the latency overhead from proxying through RunAPI?

Typically under 20 ms. RunAPI keeps the proxy layer close to model execution regions to minimize added latency. Media generation time is dominated by the model itself, not the proxy.

Are images / videos cached?

Generated outputs are stored and retrievable by task ID. You can fetch completed results at any time using the task status endpoint or the RunAPI dashboard. Output URLs remain accessible for the retention period shown in the API docs. Inputs are not cached or stored.

Can I bring my own key?

Not currently. Calls use RunAPI-managed access, which simplifies authentication and lets RunAPI handle rate limiting, retries, and billing consolidation on your behalf.

How is billing consolidated?

All API calls across all providers appear on a single monthly USD invoice. There is no per-provider billing, no subscription, and no minimum spend. Failed generations are never charged.

What SDKs can I use with Moonshot AI models?

Official SDKs are available for Python, Node.js, PHP, Java, Ruby, and Go. Each SDK handles authentication, async task polling, and typed responses. For LLM models, the OpenAI and Anthropic SDKs also work by pointing the base URL to RunAPI.

What are model skills and how do they work?

Model skills are installable packages that load a model's docs, typed schemas, pricing notes, and setup steps directly into your coding workspace. Install a skill with one command and your agent has the right context before you write integration code. Skills work with Claude Code, Codex, Gemini CLI, Cursor, and VS Code.

How do I switch between Moonshot AI models?

Change the model parameter in your API request. All Moonshot AI models share the same API key, the same request shape, and the same billing. No code changes, no re-authentication, and no separate billing setup are required when switching between models or between variants of the same model. You can also switch to models from other providers by changing the same parameter — the API surface is unified across the entire catalog.

START NOW

Start building with Moonshot AI models.