---
title: "Gemini API — All variants, pricing, code samples | RunAPI"
url: "https://runapi.ai/models/gemini.md"
canonical: "https://runapi.ai/models/gemini.md"
locale: "en"
model: "Gemini"
provider: "Google"
modality: "text"
variant_count: 5
price_from_cents: 3
---

# Gemini

Google&#39;s multimodal LLM for chat, code generation, reasoning, and long-context tasks.

**Provider:** Google
**Modality:** Text
**Catalog:** 5 variants

Gemini is Google&#39;s multimodal large language model family supporting text, image, audio, and code understanding. Available in Flash and Pro variants, it handles tasks from quick summarization to deep reasoning and complex coding.

## Variants

| Version | Variant | Pricing | Billing | URL |
|---|---|---|---|---|
| gemini-2.5-flash | `2.5-flash` | $0.030 | 1K tokens | https://runapi.ai/models/gemini/2.5-flash.md |
| gemini-2.5-pro | `2.5-pro` | $0.080 | 1K tokens | https://runapi.ai/models/gemini/2.5-pro.md |
| gemini-3-flash-preview | `3-flash-preview` | $0.030 | 1K tokens | https://runapi.ai/models/gemini/3-flash-preview.md |
| gemini-3-pro-preview | `3-pro-preview` | $0.100 | 1K tokens | https://runapi.ai/models/gemini/3-pro-preview.md |
| gemini-3.1-pro-preview | `3.1-pro-preview` | $0.100 | 1K tokens | https://runapi.ai/models/gemini/3.1-pro-preview.md |


## Install

```sh
npm install @runapi.ai/gemini
# or
gem install runapi-gemini
```

## Quickstart

```sh
curl -X POST https://runapi.ai/v1/chat/completions \
  -H "Authorization: Bearer $RUNAPI_KEY" \
  -H "Content-Type: application/json" \
  -d '{ "model": "gemini-2.5-flash", "prompt": "..." }'
```

## Context

Gemini is Google&#39;s flagship multimodal LLM, available in Flash (fast) and Pro (frontier reasoning) variants. Through RunAPI, all Gemini models share the same API shape and billing.

## FAQ

### Which variant should I start with?

Pick the cheapest variant that meets your quality bar. Most teams start on the fast variant and graduate to pro for production.

### Is there a free tier?

New accounts get free first calls on every model. After that, pay per call.

### Do you stream results?

Where streaming is available, RunAPI streams end-to-end.

### How are failures billed?

Failed generations are not charged.

### Are outputs cached?

Generated outputs are stored and retrievable by task ID. Inputs are not cached.

### Can I use commercially?

Yes — commercial use is included for every variant unless a model license explicitly restricts it, which is called out on the variant page.

### What about rate limits?

Per-key rate limits scale with usage tier. See pricing page for current limits.

### Where can I report issues?

Open an issue on the public GitHub repo or email support.