What stability and similarity settings should I use for the most natural voice?

Start with stability at 0.5 and similarity_boost at 0.75. Higher stability makes the voice more consistent but less expressive. Higher similarity keeps the voice closer to the original profile. For audiobooks, try stability 0.6-0.8. For conversational content, lower stability (0.3-0.5) adds natural variation.

What is the difference between turbo-v2.5 and multilingual-v2?

Turbo-v2.5 is optimized for low latency and English-first output -- it applies a default voice when none is specified. Multilingual-v2 supports 29 languages and requires an explicit voice ID and optional language_code. Turbo costs roughly half as much per character.

How do I reduce ElevenLabs costs for long-form content like audiobooks?

Use turbo-v2.5 for English content -- it costs roughly half as much per character as multilingual-v2. Break long texts into chunks under 5000 characters per request. Use the RunAPI batch approach to process chapters in parallel rather than sequentially.

How is ElevenLabs billed on RunAPI?

Text-to-speech and dialogue endpoints are billed per character of input text. Speech-to-text is billed per minute of audio. Audio isolation is billed per task. Check the RunAPI pricing page for current rates.

Can I use ElevenLabs dialogue-v3 for multi-speaker audio in OpenClaw?

Yes. Call the text_to_dialogue endpoint with model text-to-dialogue-v3. Pass a dialogue array where each item has a text and a voice ID. The total text across all speakers must be under 5000 characters.

OPENCLAW + ELEVENLABS

OpenClaw で ElevenLabs を使う。

ElevenLabs は RunAPI 経由で6つの音声エンドポイントを提供します——サブ秒のレイテンシを持つ turbo-v2.5 TTS、29言語をカバーする multilingual-v2、複数話者の会話向け dialogue-v3、効果音、音声からの文字起こし、ボーカル分離です。OpenClaw のエージェントは、チャットと同じ RunAPI キーでこれらのいずれも呼び出せます。

API キーを取得ドキュメントを読む

1つの APIキー · テキスト読み上げエンドポイント · 文字単位の課金

RunAPI を使って ElevenLabs のテキスト読み上げで音声を生成します。

要件：
- RUNAPI_API_KEY から API キーを読み込みます。
- POST https://runapi.ai/api/v1/elevenlabs/text_to_speech を呼び出します。
- model を "text-to-speech-turbo-v2.5" に設定します。
- text に読み上げたい内容を設定します。
- 必要に応じて voice を特定の ElevenLabs voice ID に設定します。
- 必要に応じて speed を 0.7〜1.2 の間に設定します。
- このタスクは非同期です。返された task_id を status が "completed" になるまでポーリングします。
- 完了したら、レスポンスの output から音声の URL を読み取ります。

curl -X POST https://runapi.ai/api/v1/elevenlabs/text_to_speech \
  -H "Authorization: Bearer $RUNAPI_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "text-to-speech-turbo-v2.5",
    "text": "Welcome to RunAPI. This audio was generated by ElevenLabs turbo v2.5.",
    "speed": 1.0,
    "stability": 0.5,
    "similarity_boost": 0.75
  }'

{
  "task_id": "tsk_abc123",
  "status": "pending",
  "model": "text-to-speech-turbo-v2.5"
}

curlコマンドをコピーしてテスト elevenlabs

仕組み

OpenClaw で ElevenLabs を使う3ステップ

Configure RunAPI

Set RUNAPI_API_KEY in your environment. If you already configured RunAPI for chat in OpenClaw, the same key works for all ElevenLabs endpoints — TTS, STT, dialogue, sound effects, and audio isolation.

export RUNAPI_API_KEY=runapi_xxx

Call text_to_speech

Send a POST to the text_to_speech endpoint with model set to text-to-speech-turbo-v2.5, the text you want spoken, and optional voice, speed, and stability parameters. For multilingual output, use text-to-speech-multilingual-v2 with a voice and language_code.

POST /api/v1/elevenlabs/text_to_speech

Poll for the result

The endpoint returns a task_id immediately. Poll the task status endpoint until the status is completed, then read the output audio URL from the response.

GET /api/v1/elevenlabs/text_to_speech/tsk_abc123

パラメータ

ElevenLabs text_to_speech API パラメータ

パラメータ	型	説明
`model`	`string`	Required. text-to-speech-turbo-v2.5 (low latency) or text-to-speech-multilingual-v2 (29 languages).
`text`	`string`	Required. The text to convert to speech. Max 5000 characters.
`voice`	`string`	ElevenLabs voice ID. Required for multilingual-v2. Turbo-v2.5 uses a default voice if omitted.
`speed`	`float`	Optional. Playback speed multiplier. Range 0.7 to 1.2.
`stability`	`float`	Optional. Voice consistency. Range 0.0 to 1.0. Lower values add expressiveness.
`similarity_boost`	`float`	Optional. Voice similarity enforcement. Range 0.0 to 1.0.
`style`	`float`	Optional. Style exaggeration. Range 0.0 to 1.0.
`language_code`	`string`	Optional. Target language for multilingual-v2, e.g. en, es, ja.
`callback_url`	`string`	Optional. Webhook URL that receives a POST when the task completes.

OpenClaw上のElevenLabsとは？

ElevenLabsは自然な音声出力のための定番のテキスト→音声APIです。RunAPIを通じてOpenClaw agentはturbo-v2.5（英語サブ秒レイテンシー）・multilingual-v2（29言語）・dialogue-v3（マルチスピーカー会話）・効果音生成・音声認識・ボーカル分離にアクセスできます。