---
title: &quot;RunAPI 経由で OpenClaw で ElevenLabs を使う — TTS &amp; 音声 API ガイド&quot;
url: &quot;https://runapi.ai/ja/openclaw-elevenlabs.md&quot;
canonical: &quot;https://runapi.ai/ja/openclaw-elevenlabs&quot;
locale: &quot;ja&quot;
model: &quot;elevenlabs&quot;
---

# OpenClaw で ElevenLabs を使う。

ElevenLabs は RunAPI 経由で6つの音声エンドポイントを提供します——サブ秒のレイテンシを持つ turbo-v2.5 TTS、29言語をカバーする multilingual-v2、複数話者の会話向け dialogue-v3、効果音、音声からの文字起こし、ボーカル分離です。OpenClaw のエージェントは、チャットと同じ RunAPI キーでこれらのいずれも呼び出せます。

## API example

```bash
curl -X POST https://runapi.ai/api/v1/elevenlabs/text_to_speech \
  -H &quot;Authorization: Bearer $RUNAPI_API_KEY&quot; \
  -H &quot;Content-Type: application/json&quot; \
  -d &#39;{
    &quot;model&quot;: &quot;text-to-speech-turbo-v2.5&quot;,
    &quot;text&quot;: &quot;Welcome to RunAPI. This audio was generated by ElevenLabs turbo v2.5.&quot;,
    &quot;speed&quot;: 1.0,
    &quot;stability&quot;: 0.5,
    &quot;similarity_boost&quot;: 0.75
  }&#39;

```

### Response

```json
{
  &quot;task_id&quot;: &quot;tsk_abc123&quot;,
  &quot;status&quot;: &quot;pending&quot;,
  &quot;model&quot;: &quot;text-to-speech-turbo-v2.5&quot;
}

```

## How it works

1. **Configure RunAPI** — Set RUNAPI_API_KEY in your environment. If you already configured RunAPI for chat in OpenClaw, the same key works for all ElevenLabs endpoints — TTS, STT, dialogue, sound effects, and audio isolation.
2. **Call text_to_speech** — Send a POST to the text_to_speech endpoint with model set to text-to-speech-turbo-v2.5, the text you want spoken, and optional voice, speed, and stability parameters. For multilingual output, use text-to-speech-multilingual-v2 with a voice and language_code.
3. **Poll for the result** — The endpoint returns a task_id immediately. Poll the task status endpoint until the status is completed, then read the output audio URL from the response.

## Parameters

| Parameter | Type | Description |
|-----------|------|-------------|
| `model` | `string` | Required. text-to-speech-turbo-v2.5 (low latency) or text-to-speech-multilingual-v2 (29 languages). |
| `text` | `string` | Required. The text to convert to speech. Max 5000 characters. |
| `voice` | `string` | ElevenLabs voice ID. Required for multilingual-v2. Turbo-v2.5 uses a default voice if omitted. |
| `speed` | `float` | Optional. Playback speed multiplier. Range 0.7 to 1.2. |
| `stability` | `float` | Optional. Voice consistency. Range 0.0 to 1.0. Lower values add expressiveness. |
| `similarity_boost` | `float` | Optional. Voice similarity enforcement. Range 0.0 to 1.0. |
| `style` | `float` | Optional. Style exaggeration. Range 0.0 to 1.0. |
| `language_code` | `string` | Optional. Target language for multilingual-v2, e.g. en, es, ja. |
| `callback_url` | `string` | Optional. Webhook URL that receives a POST when the task completes. |

## FAQ

### What stability and similarity settings should I use for the most natural voice?

Start with stability at 0.5 and similarity_boost at 0.75. Higher stability makes the voice more consistent but less expressive. Higher similarity keeps the voice closer to the original profile. For audiobooks, try stability 0.6-0.8. For conversational content, lower stability (0.3-0.5) adds natural variation.

### What is the difference between turbo-v2.5 and multilingual-v2?

Turbo-v2.5 is optimized for low latency and English-first output -- it applies a default voice when none is specified. Multilingual-v2 supports 29 languages and requires an explicit voice ID and optional language_code. Turbo costs roughly half as much per character.

### How do I reduce ElevenLabs costs for long-form content like audiobooks?

Use turbo-v2.5 for English content -- it costs roughly half as much per character as multilingual-v2. Break long texts into chunks under 5000 characters per request. Use the RunAPI batch approach to process chapters in parallel rather than sequentially.

### How is ElevenLabs billed on RunAPI?

Text-to-speech and dialogue endpoints are billed per character of input text. Speech-to-text is billed per minute of audio. Audio isolation is billed per task. Check the RunAPI pricing page for current rates.

### Can I use ElevenLabs dialogue-v3 for multi-speaker audio in OpenClaw?

Yes. Call the text_to_dialogue endpoint with model text-to-dialogue-v3. Pass a dialogue array where each item has a text and a voice ID. The total text across all speakers must be under 5000 characters.


## Links

- [OpenClaw セットアップガイド →](https://runapi.ai/ja/openclaw)
- [ElevenLabs モデル →](https://runapi.ai/ja/models/elevenlabs)
- [Model catalog](https://runapi.ai/ja/models)
- [API docs](https://runapi.ai/ja/docs)
