Gemini Omni gemini-omni-audio API
A model variant exposed through RunAPI's unified AI API.
Operational
·
audio_music
·
Commercial use supported
# Works with Claude Code, Codex, Gemini CLI, Cursor, and 50+ agents
npx skills add runapi-ai/gemini-omni -g
Or paste this prompt to your AI agent:
Install the Gemini Omni skill for me: 1. Clone https://github.com/runapi-ai/gemini-omni 2. Copy the skills/gemini-omni/ directory into your user-level skills directory (e.g. ~/.claude/skills/ for Claude Code, ~/.codex/skills/ for Codex). 3. Verify that SKILL.md is present. 4. Confirm the install path when done.
Switch variant
개요
Gemini Omni gemini-omni-audio is available through the same RunAPI auth, SDKs, and agent skill workflow.
- Unified endpoint
- SDK snippets
- Agent install path
- 실패한 생성은 과금되지 않습니다
요금
요금
실패한 생성은 과금되지 않습니다
Create audio
Free
/ track
스펙
스펙
| Model ID | gemini-omni-audio |
| 제공사 | |
| 모달리티 | audio_music |
| Task type | synchronous |
| 과금 | call |
| Endpoint | /api/v1/gemini_omni/create_audio |
| Commercial | Yes |
| Status | Operational |
빠른 시작
Quickstart — gemini-omni-audio
curl -X POST https://runapi.ai/api/v1/gemini_omni/create_audio \
-H "Authorization: Bearer $RUNAPI_KEY" \
-H "Content-Type: application/json" \
-d '{
"audio_id": "achernar",
"name": "Acher Narrator",
"voice_description": "Create a 1080p neon city tracking shot with a reusable character walking through rain while a calm narrator speaks.",
"example_dialogue": "Hello, I am achernar"
}'
import { GeminiOmniClient } from "@runapi.ai/gemini-omni";
const client = new GeminiOmniClient();
const result = await client.createAudio.run({
audio_id: "achernar",
name: "Acher Narrator",
voice_description: "Create a 1080p neon city tracking shot with a reusable character walking through rain while a calm narrator speaks.",
example_dialogue: "Hello, I am achernar",
});
require "runapi/gemini_omni"
client = RunApi::GeminiOmni::Client.new
result = client.create_audio.run(
audio_id: "achernar",
name: "Acher Narrator",
voice_description: "Create a 1080p neon city tracking shot with a reusable character walking through rain while a calm narrator speaks.",
example_dialogue: "Hello, I am achernar"
)
작동 방식
How to use gemini-omni-audio
01
Choose endpoint
Pick the endpoint and copy the SDK snippet.
02
Pass model ID
Use this variant ID in the request body.
03
Run task
Submit the request and store the returned task ID.
04
Collect output
Poll or receive the callback when the task finishes.
차이
How gemini-omni-audio compares
VS GEMINI-OMNI-CHARACTER
Synchronous reusable voice resource creation from preset voices
Synchronous reusable character resource creation from one reference image
VS GEMINI-OMNI-TEXT-TO-VIDEO
Synchronous reusable voice resource creation from preset voices
Prompted multimodal video with image, audio, character, and source-clip references
사용 사례
Where to use this variant
Music generation
Create tracks and audio assets.
Voice workflows
Build speech and audio pipelines.
Agent creation
Expose audio tools to agents.
FAQ
Frequently asked questions about gemini-omni-audio
How do I select gemini-omni-audio?
Pass the model ID shown in the quickstart.
Is pricing usage-based?
Yes. Pricing is metered per call or unit.
Other Gemini Omni variants
gemini-omni-character
Cheapest
character
$0.0000 / call
gemini-omni-text-to-video
text-to-video
$3.60 / call
Related models
지금 시작