What stability and similarity settings should I use for the most natural voice?

Start with stability at 0.5 and similarity_boost at 0.75. Higher stability makes the voice more consistent but less expressive. Higher similarity keeps the voice closer to the original profile. For audiobooks, try stability 0.6-0.8. For conversational content, lower stability (0.3-0.5) adds natural variation.

What is the difference between turbo-v2.5 and multilingual-v2?

Turbo-v2.5 is optimized for low latency and English-first output -- it applies a default voice when none is specified. Multilingual-v2 supports 29 languages and requires an explicit voice ID and optional language_code. Turbo costs roughly half as much per character.

How do I reduce ElevenLabs costs for long-form content like audiobooks?

Use turbo-v2.5 for English content -- it costs roughly half as much per character as multilingual-v2. Break long texts into chunks under 5000 characters per request. Use the RunAPI batch approach to process chapters in parallel rather than sequentially.

How is ElevenLabs billed on RunAPI?

Text-to-speech and dialogue endpoints are billed per character of input text. Speech-to-text is billed per minute of audio. Audio isolation is billed per task. Check the RunAPI pricing page for current rates.

Can I use ElevenLabs dialogue-v3 for multi-speaker audio in OpenClaw?

Yes. Call the text_to_dialogue endpoint with model text-to-dialogue-v3. Pass a dialogue array where each item has a text and a voice ID. The total text across all speakers must be under 5000 characters.

OPENCLAW + ELEVENLABS

Use ElevenLabs in OpenClaw.

ElevenLabs provides six audio endpoints through RunAPI — turbo-v2.5 TTS with sub-second latency, multilingual-v2 covering 29 languages, dialogue-v3 for multi-speaker conversations, sound effects, speech-to-text transcription, and vocal isolation. OpenClaw agents call any of them with the same RunAPI key used for chat.

Get API Key Read the docs

one API key · text to speech endpoint · per-character billing

Use RunAPI to generate speech audio with ElevenLabs text-to-speech.

Requirements:
- Read the API key from RUNAPI_API_KEY.
- Call POST https://runapi.ai/api/v1/elevenlabs/text_to_speech
- Set model to "text-to-speech-turbo-v2.5".
- Set text to the content you want spoken.
- Optionally set voice to a specific ElevenLabs voice ID.
- Optionally set speed between 0.7 and 1.2.
- The task is async. Poll the returned task_id until status is "completed".
- When done, read the audio URL from the response output.

curl -X POST https://runapi.ai/api/v1/elevenlabs/text_to_speech \
  -H "Authorization: Bearer $RUNAPI_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "text-to-speech-turbo-v2.5",
    "text": "Welcome to RunAPI. This audio was generated by ElevenLabs turbo v2.5.",
    "speed": 1.0,
    "stability": 0.5,
    "similarity_boost": 0.75
  }'

{
  "task_id": "tsk_abc123",
  "status": "pending",
  "model": "text-to-speech-turbo-v2.5"
}

Copy the curl command to test elevenlabs

HOW IT WORKS

Use ElevenLabs in OpenClaw in three steps

Configure RunAPI

Set RUNAPI_API_KEY in your environment. If you already configured RunAPI for chat in OpenClaw, the same key works for all ElevenLabs endpoints — TTS, STT, dialogue, sound effects, and audio isolation.

export RUNAPI_API_KEY=runapi_xxx

Call text_to_speech

Send a POST to the text_to_speech endpoint with model set to text-to-speech-turbo-v2.5, the text you want spoken, and optional voice, speed, and stability parameters. For multilingual output, use text-to-speech-multilingual-v2 with a voice and language_code.

POST /api/v1/elevenlabs/text_to_speech

Poll for the result

The endpoint returns a task_id immediately. Poll the task status endpoint until the status is completed, then read the output audio URL from the response.

GET /api/v1/elevenlabs/text_to_speech/tsk_abc123

PARAMETERS

ElevenLabs text_to_speech API parameters

Parameter	Type	Description
`model`	`string`	Required. text-to-speech-turbo-v2.5 (low latency) or text-to-speech-multilingual-v2 (29 languages).
`text`	`string`	Required. The text to convert to speech. Max 5000 characters.
`voice`	`string`	ElevenLabs voice ID. Required for multilingual-v2. Turbo-v2.5 uses a default voice if omitted.
`speed`	`float`	Optional. Playback speed multiplier. Range 0.7 to 1.2.
`stability`	`float`	Optional. Voice consistency. Range 0.0 to 1.0. Lower values add expressiveness.
`similarity_boost`	`float`	Optional. Voice similarity enforcement. Range 0.0 to 1.0.
`style`	`float`	Optional. Style exaggeration. Range 0.0 to 1.0.
`language_code`	`string`	Optional. Target language for multilingual-v2, e.g. en, es, ja.
`callback_url`	`string`	Optional. Webhook URL that receives a POST when the task completes.

What is ElevenLabs on OpenClaw?

ElevenLabs is the go-to text-to-speech API for natural-sounding voice output. Through RunAPI, OpenClaw agents get access to turbo-v2.5 (sub-second latency for English), multilingual-v2 (29 languages), dialogue-v3 (multi-speaker conversations), sound effects generation, speech-to-text transcription, and vocal isolation. You can tune voice output with stability and similarity boost sliders to control how expressive or consistent the voice sounds.

ElevenLabs use cases

Audiobook and podcast narration

Convert long-form text into spoken audio using consistent character voices. Adjust stability for narrator consistency and similarity boost to keep the voice close to the original profile across hours of content.

Video dubbing into multiple languages

Dub video content into 29 languages using multilingual-v2 with the same voice profile, producing localized versions that maintain the original speaker's vocal characteristics.

Sound effects for video and game production

Generate custom Foley sounds, ambient audio, and sound cues from text descriptions through the text_to_sound endpoint, replacing stock audio library searches with on-demand generation.

FAQ

ElevenLabs + OpenClaw questions

OpenClaw general setup

Not configured yet? Start with the RunAPI setup guide for OpenClaw.

OpenClaw setup guide →

ElevenLabs model catalog

See all ElevenLabs variants, pricing, and API docs.

ElevenLabs on RunAPI →

Try ElevenLabs in OpenClaw today.

Get a free RunAPI key, paste the prompt into OpenClaw, and generate speech audio with ElevenLabs — six endpoints, one API key, per-character billing.

Browse models →