Image · Google

Gemini Omni

Voice, character, and multimodal video generation resources for narration, dialogue, and agent media workflows.

Operational · 3 variants · à partir de $0.0000
# Works with Claude Code, Codex, Gemini CLI, Cursor, and 50+ agents
npx skills add runapi-ai/gemini-omni -g
The -g flag installs globally so every project picks it up.
Or paste this prompt to your AI agent:
Install the Gemini Omni skill for me:

1. Clone https://github.com/runapi-ai/gemini-omni
2. Copy the skills/gemini-omni/ directory into your
   user-level skills directory (e.g. ~/.claude/skills/
   for Claude Code, ~/.codex/skills/ for Codex).
3. Verify that SKILL.md is present.
4. Confirm the install path when done.
APERÇU

Gemini Omni creates reusable voice resources, reusable character resources, and multimodal videos that can combine prompts, reference images, audio IDs, character IDs, and a source video clip.

  • Installable agent skill
  • Unified API key
  • SDK examples included
  • Les générations échouées ne sont pas facturées
VARIANTES

Variantes

Variant Billing From
gemini-omni-audio call $0.0000 Voir →
gemini-omni-character call $0.0000 Voir →
gemini-omni-text-to-video call $3.60 Voir →
AGENT USAGE

Use this model from your agent

Prompt + SDK example
# User prompt to the agent
"Create a 1080p neon city tracking shot with a reusable character walking through rain while a calm narrator speaks."
// Code generated by the agent via @runapi.ai/gemini-omni
import { GeminiOmniClient } from '@runapi.ai/gemini-omni';

const client = new GeminiOmniClient();
const result = await client.createAudio.run({
  model: 'gemini-omni-audio',
  prompt: 'Create a 1080p neon city tracking shot with a reusable character walking through rain while a calm narrator speaks.',
});
@runapi.ai/gemini-omni npm
FONCTIONNEMENT

How to call this model

01

Install the skill

Add the model skill to your agent or install the SDK package.

02

Authenticate once

Use your RunAPI key for every supported model.

03

Send a request

Use the unified schema for the endpoint you need.

04

Receive output

Poll by task ID or handle the callback when the generation completes.

SDK QUICKSTART

SDK quickstart

runapi.ai
curl -X POST https://runapi.ai/api/v1/gemini_omni/create_audio \
  -H "Authorization: Bearer $RUNAPI_KEY" \
  -H "Content-Type: application/json" \
  -d '{
  "audio_id": "achernar",
  "name": "Acher Narrator",
  "voice_description": "Create a 1080p neon city tracking shot with a reusable character walking through rain while a calm narrator speaks.",
  "example_dialogue": "Hello, I am achernar"
}'
import { GeminiOmniClient } from "@runapi.ai/gemini-omni";

const client = new GeminiOmniClient();
const result = await client.createAudio.run({
    audio_id: "achernar",
    name: "Acher Narrator",
    voice_description: "Create a 1080p neon city tracking shot with a reusable character walking through rain while a calm narrator speaks.",
    example_dialogue: "Hello, I am achernar",
});
require "runapi/gemini_omni"

client = RunApi::GeminiOmni::Client.new
result = client.create_audio.run(
    audio_id: "achernar",
    name: "Acher Narrator",
    voice_description: "Create a 1080p neon city tracking shot with a reusable character walking through rain while a calm narrator speaks.",
    example_dialogue: "Hello, I am achernar"
)
@runapi.ai/gemini-omni v1
CONTEXTE

Where Gemini Omni fits

Gemini Omni belongs to the Google catalog on RunAPI and shares the same SDK package, CLI namespace, and billing surfaces across audio, character, and video variants.

Provider
Google
Modality
Image
POURQUOI RUNAPI

Why use Gemini Omni through RunAPI

One API key

Use the same credentials across models and providers.

Agent-ready

Installable skills include docs and schema for tool calls.

Predictable billing

Usage-based pricing is visible before you call.

FAQ

Frequently asked questions

How do I call this model?

Use the RunAPI SDK, CLI, or REST endpoint shown on this page.

Do failed generations cost money?

Les générations échouées ne sont pas facturées

Can agents use it directly?

Yes. Install the model skill and your agent gets the docs, schema, and examples.

COMMENCER

Start building with Gemini Omni.