VARIANTE · Google / Gemini Omni

Gemini Omni gemini-omni-audio API

A model variant exposed through RunAPI's unified AI API.

Operational · audio_music · Commercial use supported
# Works with Claude Code, Codex, Gemini CLI, Cursor, and 50+ agents
npx skills add runapi-ai/gemini-omni -g
The -g flag installs globally so every project picks it up.
Or paste this prompt to your AI agent:
Install the Gemini Omni skill for me:

1. Clone https://github.com/runapi-ai/gemini-omni
2. Copy the skills/gemini-omni/ directory into your
   user-level skills directory (e.g. ~/.claude/skills/
   for Claude Code, ~/.codex/skills/ for Codex).
3. Verify that SKILL.md is present.
4. Confirm the install path when done.
Switch variant
ÜBERBLICK

Gemini Omni gemini-omni-audio is available through the same RunAPI auth, SDKs, and agent skill workflow.

  • Unified endpoint
  • SDK snippets
  • Agent install path
  • Fehlgeschlagene Generierungen werden nicht berechnet
PREISE

PREISE

Fehlgeschlagene Generierungen werden nicht berechnet
Create audio
Free / track
DATENBLATT

DATENBLATT

Model ID gemini-omni-audio
Anbieter Google
Modalität audio_music
Task type synchronous
Abrechnung call
Endpoint /api/v1/gemini_omni/create_audio
Commercial Yes
Status Operational
QUICKSTART

Quickstart — gemini-omni-audio

runapi.ai
curl -X POST https://runapi.ai/api/v1/gemini_omni/create_audio \
  -H "Authorization: Bearer $RUNAPI_KEY" \
  -H "Content-Type: application/json" \
  -d '{
  "audio_id": "achernar",
  "name": "Acher Narrator",
  "voice_description": "Create a 1080p neon city tracking shot with a reusable character walking through rain while a calm narrator speaks.",
  "example_dialogue": "Hello, I am achernar"
}'
import { GeminiOmniClient } from "@runapi.ai/gemini-omni";

const client = new GeminiOmniClient();
const result = await client.createAudio.run({
    audio_id: "achernar",
    name: "Acher Narrator",
    voice_description: "Create a 1080p neon city tracking shot with a reusable character walking through rain while a calm narrator speaks.",
    example_dialogue: "Hello, I am achernar",
});
require "runapi/gemini_omni"

client = RunApi::GeminiOmni::Client.new
result = client.create_audio.run(
    audio_id: "achernar",
    name: "Acher Narrator",
    voice_description: "Create a 1080p neon city tracking shot with a reusable character walking through rain while a calm narrator speaks.",
    example_dialogue: "Hello, I am achernar"
)
@runapi.ai/gemini-omni v1
SO FUNKTIONIERT ES

How to use gemini-omni-audio

01

Choose endpoint

Pick the endpoint and copy the SDK snippet.

02

Pass model ID

Use this variant ID in the request body.

03

Run task

Submit the request and store the returned task ID.

04

Collect output

Poll or receive the callback when the task finishes.

UNTERSCHIEDE

How gemini-omni-audio compares

VS GEMINI-OMNI-CHARACTER

Synchronous reusable voice resource creation from preset voices

Synchronous reusable character resource creation from one reference image

VS GEMINI-OMNI-TEXT-TO-VIDEO

Synchronous reusable voice resource creation from preset voices

Prompted multimodal video with image, audio, character, and source-clip references

ANWENDUNGSFÄLLE

Where to use this variant

Music generation

Create tracks and audio assets.

Voice workflows

Build speech and audio pipelines.

Agent creation

Expose audio tools to agents.

FAQ

Frequently asked questions about gemini-omni-audio

How do I select gemini-omni-audio?

Pass the model ID shown in the quickstart.

Is pricing usage-based?

Yes. Pricing is metered per call or unit.

JETZT STARTEN

Start with Gemini Omni today.