Text · DeepSeek

DeepSeek API

DeepSeek API access via RunAPI — flash for fast, low-cost work; pro for complex agentic tasks.

Operational · 2 variants · from $0.010

runapi.ai

# Base URL
https://runapi.ai

# Endpoints
POST /v1/chat/completions
POST /v1/responses
POST /v1/messages
POST /v1beta/models/{model}:generateContent
POST /v1beta/models/{model}:streamGenerateContent

curl https://runapi.ai/v1/chat/completions \
  -H "Authorization: Bearer $RUNAPI_KEY" \
  -H "Content-Type: application/json" \
  -d '{
  "model": "deepseek-v4-flash",
  "max_tokens": 1024,
  "messages": [
    {
      "role": "user",
      "content": "Refactor this Python module for readability, explain each change, then add unit tests for the edge cases."
    }
  ]
}'

from openai import OpenAI

client = OpenAI(
    base_url="https://runapi.ai/v1",
    api_key="your-runapi-key"
)

response = client.chat.completions.create(
    model="deepseek-v4-flash",
    messages=[{"role": "user", "content": "Refactor this Python module for readability, explain each change, then add unit tests for the edge cases."}]
)

import OpenAI from "openai";

const client = new OpenAI({
  baseURL: "https://runapi.ai/v1",
  apiKey: "your-runapi-key"
});

const response = await client.chat.completions.create({
  model: "deepseek-v4-flash",
  messages: [{ role: "user", content: "Refactor this Python module for readability, explain each change, then add unit tests for the edge cases." }]
});

https://runapi.ai 5 endpoints

OVERVIEW

DeepSeek is a family of reasoning-first language models. deepseek-v4-flash is the fast, low-cost tier with an optional thinking mode; deepseek-v4-pro targets the hardest reasoning and agentic workloads. Both are available through RunAPI with one key and per-token billing.

Multiple variants for different speed, quality, and cost tiers
Model skill includes docs, schemas, pricing, and setup notes
Works with Claude Code, Codex, Gemini CLI, Cursor, and VS Code
Single API key and unified billing across all variants
Async task management with polling and webhook callbacks
Failed generations are not charged

VARIANTS

Compare all API variants

Variant	Billing	From
deepseek-v4-flash	1K tokens	$0.010	View →
deepseek-v4-pro	1K tokens	$0.010	View →

API

DeepSeek API endpoints

Use the OpenAI or Anthropic SDK with your RunAPI key. No extra SDK required.

Endpoint	Protocol
/v1/chat/completions	OpenAI compatible
/v1/responses	OpenAI Responses
/v1/messages	Anthropic compatible
/v1beta/models/{model}:generateContent	Gemini generateContent
/v1beta/models/{model}:streamGenerateContent	Gemini streamGenerateContent

OpenClaw

Use DeepSeek in OpenClaw

Prompt, curl command, parameters, and FAQ.

Hermes Agent

Use DeepSeek in Hermes Agent

Custom provider setup, API call, and FAQ.

HOW IT WORKS

From model skill to first result in four steps

Choose a model

Browse the model catalog and pick the model and variant that match your output type, quality bar, and latency target. Each variant page shows its model ID, pricing, and parameter constraints so you can compare before committing.

Configure

Set your RunAPI API key as an environment variable and install the model skill in your coding workspace. The skill loads docs, typed schemas, pricing notes, and setup steps so your agent has the right context from the start.

Call

Use the skill instructions to add the model feature inside your application. Send a POST request with your prompt, model ID, and parameters. RunAPI routes the request, manages the async lifecycle, and returns structured JSON.

Receive

Poll by task ID for completion, stream results end-to-end when supported, or configure a webhook callback URL to receive results automatically. The CLI provides a built-in wait command, and the SDKs offer both polling and callback patterns.

CONTEXT

What is the DeepSeek API?

DeepSeek models are large language models tuned for reasoning, code, and agentic tool use. Through RunAPI they share a single API key with pay-as-you-go token billing, and are callable from both the OpenAI and Anthropic SDKs.

Provider

DeepSeek

Modality

Text

WHY RUNAPI

Why route the DeepSeek API through RunAPI

One auth, every provider

A single RunAPI API key unlocks the whole model catalog across all providers. No separate accounts to create, no API keys to rotate per integration, and no credential management overhead. Add a new model to your app by changing one parameter.

Unified pricing & billing

Per-call pricing in USD, billed monthly into a single invoice. No subscription tiers, no minimum spend, and failed generations are never charged. The pricing page and check_pricing API show exact costs before you commit to a model.

Schema-first SDK

Typed schemas, parameter constraints, and setup notes are packaged in the model skill so your implementation starts from the right contract. The skill loads into Claude Code, Codex, Gemini CLI, Cursor, and VS Code — your agent knows the correct request shape before you write a line of code.

FAQ