---
title: "DeepSeek API — Variants, pricing & model skill | RunAPI"
url: "https://runapi.ai/models/deepseek.md"
canonical: "https://runapi.ai/models/deepseek.md"
locale: "en"
model: "DeepSeek"
provider: "DeepSeek"
modality: "text"
variant_count: 2
price_from_cents: 1
---

# DeepSeek API

DeepSeek API access via RunAPI — flash for fast, low-cost work; pro for complex agentic tasks.

**Provider:** DeepSeek
**Modality:** Text
**Catalog:** 2 variants

DeepSeek is a family of reasoning-first language models. deepseek-v4-flash is the fast, low-cost tier with an optional thinking mode; deepseek-v4-pro targets the hardest reasoning and agentic workloads. Both are available through RunAPI with one key and per-token billing.

## Variants

| Version | Variant | Pricing | Billing | URL |
|---|---|---|---|---|
| deepseek-v4-flash | `v4-flash` | $0.010 | 1K tokens | https://runapi.ai/models/deepseek/v4-flash.md |
| deepseek-v4-pro | `v4-pro` | $0.010 | 1K tokens | https://runapi.ai/models/deepseek/v4-pro.md |


## API endpoints

Base URL: `https://runapi.ai`

- `POST /v1/messages`
- `POST /v1/chat/completions`

Use the OpenAI or Anthropic SDK with your RunAPI API key. No extra SDK required.

## Context

DeepSeek models are large language models tuned for reasoning, code, and agentic tool use. Through RunAPI they share a single API key with pay-as-you-go token billing, and are callable from both the OpenAI and Anthropic SDKs.

## FAQ

### Which variant should I start with?

Pick the cheapest variant that meets your quality bar. Most teams start on the fast variant and graduate to pro for production.

### Is there a free tier?

New accounts get free first calls on every model. After that, pay per call.

### Do you stream results?

Where streaming is available, RunAPI streams end-to-end.

### How are failures billed?

Failed generations are not charged.

### Are outputs cached?

Generated outputs are stored and retrievable by task ID. Inputs are not cached.

### Can I use commercially?

Yes — commercial use is included for every variant unless a model license explicitly restricts it, which is called out on the variant page.

### What about rate limits?

Per-key rate limits scale with usage tier. See pricing page for current limits.

### Where can I report issues?

Open an issue on the public GitHub repo or email support.