Gemini Omni gemini-omni-text-to-video API
Same API, same SDK — switch variants by changing one parameter.
# Works with Claude Code, Codex, Gemini CLI, Cursor, and 50+ agents
npx skills add runapi-ai/gemini-omni -g
Install the Gemini Omni skill for me: 1. Clone https://github.com/runapi-ai/gemini-omni 2. Copy the skills/gemini-omni/ directory into your user-level skills directory (e.g. ~/.claude/skills/ for Claude Code, ~/.codex/skills/ for Codex). 3. Verify that SKILL.md is present. 4. Confirm the install path when done.
gemini-omni-text-to-video targets the sweet spot of quality and cost within the Gemini Omni family.
- Pay-per-call pricing in USD
- Failed generations not charged
- Streaming when supported by the model
- Schema-validated tool calls
Pricing
Technical details
| Model ID | gemini-omni-text-to-video |
| Provider | |
| Modality | video |
| Task type | asynchronous |
| Billing unit | call |
| API endpoint | /api/v1/gemini_omni/text_to_video |
| Commercial license | Yes — included via API |
| Status | Operational |
Quickstart — gemini-omni-text-to-video
curl -X POST https://runapi.ai/api/v1/gemini_omni/text_to_video \
-H "Authorization: Bearer $RUNAPI_KEY" \
-H "Content-Type: application/json" \
-d '{
"model": "gemini-omni-text-to-video",
"prompt": "Create a 1080p neon city tracking shot with a reusable character walking through rain while a calm narrator speaks."
}'
import { GeminiOmniClient } from "@runapi.ai/gemini-omni";
const client = new GeminiOmniClient();
const result = await client.textToVideo.run({
model: "gemini-omni-text-to-video",
prompt: "Create a 1080p neon city tracking shot with a reusable character walking through rain while a calm narrator speaks.",
});
require "runapi/gemini_omni"
client = RunApi::GeminiOmni::Client.new
result = client.text_to_video.run(
model: "gemini-omni-text-to-video",
prompt: "Create a 1080p neon city tracking shot with a reusable character walking through rain while a calm narrator speaks."
)
Use gemini-omni-text-to-video in four steps
Install
Install the model SDK or agent skill for this model line.
Configure
Set the model field to the full model ID shown on this page.
Call
Send a typed request with your prompt, inputs, and callback settings.
Receive
Read the task response, webhook callback, or cached output URL from RunAPI.
What's different about gemini-omni-text-to-video
Prompted multimodal video with image, audio, character, and source-clip references
Synchronous reusable voice resource creation from preset voices
Prompted multimodal video with image, audio, character, and source-clip references
Synchronous reusable character resource creation from one reference image
Best for
Ad & social content
Generate product launch clips and short-form ads from a text brief, cutting production from weeks to hours.
E-learning
Convert lesson scripts into animated explainer videos at scale without a camera or crew.
Creator workflows
Produce viral short-form content for social platforms directly from a prompt.
Frequently asked questions about gemini-omni-text-to-video
Is the model ID stable across versions?
RunAPI keeps the model ID stable and handles compatible version refreshes without changing your request shape.
What's the rate limit on this variant?
Per-key rate limits scale with usage tier. See pricing page for current limits.
Can I switch variants later?
Yes — variant is a flag. Switch by changing the model parameter.
Does it stream?
Where streaming is available, RunAPI streams end-to-end.
Where do I report quality issues?
Open an issue on the public GitHub repo or email support.