Gemini Omni
Voice, character, and multimodal video generation resources for narration, dialogue, and agent media workflows.
# Works with Claude Code, Codex, Gemini CLI, Cursor, and 50+ agents
npx skills add runapi-ai/gemini-omni -g
Install the Gemini Omni skill for me: 1. Clone https://github.com/runapi-ai/gemini-omni 2. Copy the skills/gemini-omni/ directory into your user-level skills directory (e.g. ~/.claude/skills/ for Claude Code, ~/.codex/skills/ for Codex). 3. Verify that SKILL.md is present. 4. Confirm the install path when done.
Gemini Omni creates reusable voice resources, reusable character resources, and multimodal videos that can combine prompts, reference images, audio IDs, character IDs, and a source video clip.
- Installable agent skill
- Unified API key
- SDK examples included
- 실패한 생성은 과금되지 않습니다
변형
Use this model from your agent
# User prompt to the agent
"Create a 1080p neon city tracking shot with a reusable character walking through rain while a calm narrator speaks."
// Code generated by the agent via @runapi.ai/gemini-omni
import { GeminiOmniClient } from '@runapi.ai/gemini-omni';
const client = new GeminiOmniClient();
const result = await client.createAudio.run({
model: 'gemini-omni-audio',
prompt: 'Create a 1080p neon city tracking shot with a reusable character walking through rain while a calm narrator speaks.',
});
How to call this model
Install the skill
Add the model skill to your agent or install the SDK package.
Authenticate once
Use your RunAPI key for every supported model.
Send a request
Use the unified schema for the endpoint you need.
Receive output
Poll by task ID or handle the callback when the generation completes.
SDK quickstart
curl -X POST https://runapi.ai/api/v1/gemini_omni/create_audio \
-H "Authorization: Bearer $RUNAPI_KEY" \
-H "Content-Type: application/json" \
-d '{
"audio_id": "achernar",
"name": "Acher Narrator",
"voice_description": "Create a 1080p neon city tracking shot with a reusable character walking through rain while a calm narrator speaks.",
"example_dialogue": "Hello, I am achernar"
}'
import { GeminiOmniClient } from "@runapi.ai/gemini-omni";
const client = new GeminiOmniClient();
const result = await client.createAudio.run({
audio_id: "achernar",
name: "Acher Narrator",
voice_description: "Create a 1080p neon city tracking shot with a reusable character walking through rain while a calm narrator speaks.",
example_dialogue: "Hello, I am achernar",
});
require "runapi/gemini_omni"
client = RunApi::GeminiOmni::Client.new
result = client.create_audio.run(
audio_id: "achernar",
name: "Acher Narrator",
voice_description: "Create a 1080p neon city tracking shot with a reusable character walking through rain while a calm narrator speaks.",
example_dialogue: "Hello, I am achernar"
)
Where Gemini Omni fits
Gemini Omni belongs to the Google catalog on RunAPI and shares the same SDK package, CLI namespace, and billing surfaces across audio, character, and video variants.
Why use Gemini Omni through RunAPI
One API key
Use the same credentials across models and providers.
Agent-ready
Installable skills include docs and schema for tool calls.
Predictable billing
Usage-based pricing is visible before you call.
Frequently asked questions
How do I call this model?
Use the RunAPI SDK, CLI, or REST endpoint shown on this page.
Do failed generations cost money?
실패한 생성은 과금되지 않습니다
Can agents use it directly?
Yes. Install the model skill and your agent gets the docs, schema, and examples.