Gemini gemini-3.5-flash API
A model variant exposed through RunAPI's unified AI API.
Operational
·
text
·
Commercial use supported
# Works with Claude Code, Codex, Gemini CLI, Cursor, and 50+ agents
npx skills add runapi-ai/gemini -g
Or paste this prompt to your AI agent:
Install the Gemini skill for me: 1. Clone https://github.com/runapi-ai/gemini 2. Copy the skills/gemini/ directory into your user-level skills directory (e.g. ~/.claude/skills/ for Claude Code, ~/.codex/skills/ for Codex). 3. Verify that SKILL.md is present. 4. Confirm the install path when done.
Switch variant
ÜBERBLICK
Gemini gemini-3.5-flash is available through the same RunAPI auth, SDKs, and agent skill workflow.
- Unified endpoint
- SDK snippets
- Agent install path
- Fehlgeschlagene Generierungen werden nicht berechnet
PREISE
PREISE
Fehlgeschlagene Generierungen werden nicht berechnet
Generate content stream
Input
$0.75
/ 1M tokens
Output
$4.50
/ 1M tokens
DATENBLATT
DATENBLATT
| Model ID | gemini-3.5-flash |
| Anbieter | |
| Modalität | text |
| Task type | synchronous |
| Abrechnung | 1K tokens |
| Endpoint | /v1beta/models/gemini-3.5-flash:streamGenerateContent |
| Commercial | Yes |
| Status | Operational |
QUICKSTART
Quickstart — gemini-3.5-flash
curl -X POST https://runapi.ai/v1beta/models/gemini-3.5-flash:streamGenerateContent \
-H "Authorization: Bearer $RUNAPI_KEY" \
-H "Content-Type: application/json" \
-d '{
"model": "gemini-3.5-flash",
"contents": [
{
"parts": [
{
"text": "Analyze this codebase and suggest three performance improvements with before/after examples."
}
]
}
]
}'
import { GeminiClient } from "@runapi.ai/gemini";
const client = new GeminiClient();
const result = await client.streamGenerateContent.run({
model: "gemini-3.5-flash",
contents: [{"parts":[{"text":"Analyze this codebase and suggest three performance improvements with before/after examples."}]}],
});
require "runapi/gemini"
client = RunApi::Gemini::Client.new
result = client.stream_generate_content.run(
model: "gemini-3.5-flash",
contents: [{parts: [{text: "Analyze this codebase and suggest three performance improvements with before/after examples."}]}]
)
SO FUNKTIONIERT ES
How to use gemini-3.5-flash
01
Choose endpoint
Pick the endpoint and copy the SDK snippet.
02
Pass model ID
Use this variant ID in the request body.
03
Run task
Submit the request and store the returned task ID.
04
Collect output
Poll or receive the callback when the task finishes.
UNTERSCHIEDE
How gemini-3.5-flash compares
VS GEMINI-2.5-FLASH
Fast multimodal streaming for high-volume production workloads
Speed/cost optimized; 1M context; older generation baseline
VS GEMINI-2.5-PRO
Fast multimodal streaming for high-volume production workloads
Best reasoning in 2.5 gen; 1M context
VS GEMINI-3-FLASH-PREVIEW
Fast multimodal streaming for high-volume production workloads
gemini-3-flash-preview
ANWENDUNGSFÄLLE
Where to use this variant
Chat
Use LLMs for chat and reasoning.
Code
Generate and review implementation work.
Automation
Connect models into backend tasks.
FAQ
Frequently asked questions about gemini-3.5-flash
How do I select gemini-3.5-flash?
Pass the model ID shown in the quickstart.
Is pricing usage-based?
Yes. Pricing is metered per call or unit.
Other Gemini variants
gemini-2.5-flash
Cheapest
2.5-flash
$0.030 / 1K tokens
gemini-3-flash-preview
Fast
3-flash-preview
$0.030 / 1K tokens
gemini-2.5-pro
Quality
2.5-pro
$0.080 / 1K tokens
gemini-3-pro-preview
Quality
3-pro-preview
$0.100 / 1K tokens
gemini-3.1-pro-preview
Quality
3.1-pro-preview
$0.100 / 1K tokens
Related models
JETZT STARTEN