VARIANT · Google / Gemini Omni

Gemini Omni gemini-omni-audio API

透過 RunAPI 統一 AI API 提供的模型變體。

可正常運作 · audio_music · 可商用

# Install the model skill for app development workflows
npx skills add runapi-ai/gemini-omni -g

Installs docs, schemas, pricing context, and setup notes into your developer workspace.

Or use this setup request in your coding tool:

Install the Gemini Omni skill for this app:

1. Add runapi-ai/gemini-omni with the skills installer.
2. Load SKILL.md in this workspace.
3. Use its docs, schemas, pricing notes, and setup steps when adding model features.
4. Confirm the install path when done.

切換 variant

gemini-omni-character gemini-omni-text-to-video

OVERVIEW

gemini-omni-audio 針對 Gemini Omni 系列中品質與成本的最佳平衡點。

按次收費，美元計價
生成失敗不收費
如模型支援，提供串流輸出
Model skill setup

PRICING

收費

失敗的生成不會收費

Create audio

Free / track

規格說明

技術詳情

模型 ID	gemini-omni-audio
供應商	Google
模式	audio_music
任務類型	synchronous
計費單位	call
API endpoint	/api/v1/gemini_omni/create_audio
商業授權	是 — 已透過 API 包含
狀態	可正常運作

SKILLS

快速開始 — gemini-omni-audio

結構一致 · variant 已固定於 model 中

# Install the model skill for app development workflows
npx skills add runapi-ai/gemini-omni -g

Installs docs, schemas, pricing context, and setup notes into your developer workspace.

Or use this setup request in your coding tool:

Install the Gemini Omni skill for this app:

1. Add runapi-ai/gemini-omni with the skills installer.
2. Load SKILL.md in this workspace.
3. Use its docs, schemas, pricing notes, and setup steps when adding model features.
4. Confirm the install path when done.

運作方式

四步使用 gemini-omni-audio

01

安裝

為此模型系列安裝 model skill。

02

設定

將 model 欄位設為本頁顯示的完整 model ID。

03

呼叫

連同 prompt、inputs 及 callback 設定，送出一個 typed request。

04

接收

從 RunAPI 讀取 task response、webhook callback 或 cached output URL。

DIFFERENCES

gemini-omni-audio 有咩不同

VS GEMINI-OMNI-CHARACTER

Synchronous reusable voice resource creation from preset voices

Synchronous reusable character resource creation from one reference image

VS GEMINI-OMNI-TEXT-TO-VIDEO

Synchronous reusable voice resource creation from preset voices

Prompted multimodal video with image, audio, character, and source-clip references

使用場景

最適合

Podcast 及影片配樂

生成同集數氛圍匹配的免版稅背景音樂，無需付授權費。

遊戲音效

為程序生成關卡製作可自適應的環境音景同音效。

廣告旁白及音效

無需錄音室，都可以為客戶廣告生成自訂旁白同音效。

FAQ

關於 gemini-omni-audio 的常見問題

模型 ID 會否在不同版本之間保持穩定？

RunAPI 會保持 model ID 穩定，並處理相容的版本更新，而不需要改動你的 request 格式。

這個 variant 的 rate limit 係多少？

每個 key 的 rate limit 會按使用層級而定。請查看定價頁了解最新限制。

之後可以切換 variant 嗎？

可以——variant 只係一個旗標。只要更改 model 參數就可以切換。

支援 streaming 嗎？

在可用 streaming 的情況下，RunAPI 會端到端串流。

我應該在邊度回報品質問題？

可以在公開 GitHub repo 開 issue，或者電郵支援團隊。

Gemini Omni 的其他 variant

gemini-omni-character 最平

gemini-omni-text-to-video

其他模型的替代方案

Image and video generation from text — text-to-image, image-to-video, and editing with audio.

Text and image-to-video at native 1080p with accurate physics simulation and motion.

Text, image, and edit-video generation with 720p and 1080p output, duration control, first-frame image support, ordered reference images for character-guided clips, and source-video editing.

立即開始

開始用 Gemini Omni 建立產品。

建立免費帳戶閱讀快速開始 →