---
title: "MiniMax API — Variants, pricing & model skill | RunAPI"
url: "https://runapi.ai/models/minimax.md"
canonical: "https://runapi.ai/models/minimax.md"
locale: "en"
model: "MiniMax"
provider: "MiniMax"
modality: "text"
variant_count: 7
price_from_cents: 1
---

# MiniMax API

MiniMax text API access via RunAPI — 230B MoE models from 200K to 1M context, up to 80.5% SWE-bench Verified.

**Provider:** MiniMax
**Modality:** Text
**Catalog:** 7 variants

MiniMax&#39;s M-series are sparse Mixture-of-Experts text models (230B total / ~10B active, 256 experts) built for cost-efficient coding. M2 through M2.7 offer 200K context with progressively stronger agentic capabilities — M2.7 reaches 56.2% on SWE-bench Pro. MiniMax-M3 restores 1M context with a new Sparse Attention architecture, scoring 80.5% on SWE-bench Verified and 59.0% on SWE-bench Pro. Highspeed variants run the same weights at ~100 tokens/sec for latency-sensitive work. All are available through RunAPI with one key and per-token billing.

## Variants

| Version | Variant | Pricing | Billing | URL |
|---|---|---|---|---|
| MiniMax-M2 | `m2` | $0.010 | 1K tokens | https://runapi.ai/models/minimax/m2.md |
| MiniMax-M2.1 | `m2.1` | $0.010 | 1K tokens | https://runapi.ai/models/minimax/m2.1.md |
| MiniMax-M2.5 | `m2.5` | $0.010 | 1K tokens | https://runapi.ai/models/minimax/m2.5.md |
| MiniMax-M2.5-highspeed | `m2.5-highspeed` | $0.020 | 1K tokens | https://runapi.ai/models/minimax/m2.5-highspeed.md |
| MiniMax-M2.7 | `m2.7` | $0.010 | 1K tokens | https://runapi.ai/models/minimax/m2.7.md |
| MiniMax-M2.7-highspeed | `m2.7-highspeed` | $0.020 | 1K tokens | https://runapi.ai/models/minimax/m2.7-highspeed.md |
| MiniMax-M3 | `m3` | $0.020 | 1K tokens | https://runapi.ai/models/minimax/m3.md |


## API endpoints

Base URL: `https://runapi.ai`

- `POST /v1/chat/completions`

Use the OpenAI or Anthropic SDK with your RunAPI API key. No extra SDK required.

## Context

MiniMax M-series text models are 230B MoE LLMs with 200K–1M context, delivering frontier coding scores at a fraction of the cost of dense models. Through RunAPI they share a single API key with pay-as-you-go token billing, callable from the OpenAI Chat Completions, OpenAI Responses, and Anthropic Messages surfaces. These are MiniMax&#39;s text models, distinct from MiniMax Hailuo video generation.

## FAQ

### Which variant should I start with?

Pick the cheapest variant that meets your quality bar. Most teams start on the fast variant and graduate to pro for production.

### Is there a free tier?

New accounts get free first calls on every model. After that, pay per call.

### Do you stream results?

Where streaming is available, RunAPI streams end-to-end.

### How are failures billed?

Failed generations are not charged.

### Are outputs cached?

Generated outputs are stored and retrievable by task ID. Inputs are not cached.

### Can I use commercially?

Yes — commercial use is included for every variant unless a model license explicitly restricts it, which is called out on the variant page.

### What about rate limits?

Per-key rate limits scale with usage tier. See pricing page for current limits.

### Where can I report issues?

Open an issue on the public GitHub repo or email support.