ElevenLabs speech-to-text API
A model variant exposed through RunAPI's unified AI API.
Operational
·
audio_music
·
Commercial use supported
# Works with Claude Code, Codex, Gemini CLI, Cursor, and 50+ agents
npx skills add runapi-ai/elevenlabs -g
Or paste this prompt to your AI agent:
Install the ElevenLabs skill for me: 1. Clone https://github.com/runapi-ai/elevenlabs 2. Copy the skills/elevenlabs/ directory into your user-level skills directory (e.g. ~/.claude/skills/ for Claude Code, ~/.codex/skills/ for Codex). 3. Verify that SKILL.md is present. 4. Confirm the install path when done.
Switch variant
개요
ElevenLabs speech-to-text is available through the same RunAPI auth, SDKs, and agent skill workflow.
- Unified endpoint
- SDK snippets
- Agent install path
- 실패한 생성은 과금되지 않습니다
요금
요금
실패한 생성은 과금되지 않습니다
Speech to text
$0.04
/ minute
스펙
스펙
| Model ID | speech-to-text |
| 제공사 | ElevenLabs |
| 모달리티 | audio_music |
| Task type | asynchronous |
| 과금 | minute |
| Endpoint | /api/v1/elevenlabs/speech_to_text |
| Commercial | Yes |
| Status | Operational |
빠른 시작
Quickstart — speech-to-text
curl -X POST https://runapi.ai/api/v1/elevenlabs/speech_to_text \
-H "Authorization: Bearer $RUNAPI_KEY" \
-H "Content-Type: application/json" \
-d '{
"model": "speech-to-text",
"audio_url": "https://cdn.runapi.ai/public/samples/voice.mp3"
}'
import { ElevenlabsClient } from "@runapi.ai/elevenlabs";
const client = new ElevenlabsClient();
const result = await client.speechToText.run({
model: "speech-to-text",
audio_url: "https://cdn.runapi.ai/public/samples/voice.mp3",
});
require "runapi/elevenlabs"
client = RunApi::Elevenlabs::Client.new
result = client.speech_to_text.run(
model: "speech-to-text",
audio_url: "https://cdn.runapi.ai/public/samples/voice.mp3"
)
작동 방식
How to use speech-to-text
01
Choose endpoint
Pick the endpoint and copy the SDK snippet.
02
Pass model ID
Use this variant ID in the request body.
03
Run task
Submit the request and store the returned task ID.
04
Collect output
Poll or receive the callback when the task finishes.
차이
How speech-to-text compares
VS AUDIO-ISOLATION
Transcription across 29+ languages with speaker diarization
Vocal extraction from mixed audio sources
VS SOUND-EFFECT-V2
Transcription across 29+ languages with speaker diarization
Text-to-sound effects for games, video, and podcasts
VS TEXT-TO-DIALOGUE-V3
Transcription across 29+ languages with speaker diarization
Multi-speaker dialogue generation with natural turn-taking
사용 사례
Where to use this variant
Music generation
Create tracks and audio assets.
Voice workflows
Build speech and audio pipelines.
Agent creation
Expose audio tools to agents.
FAQ
Frequently asked questions about speech-to-text
How do I select speech-to-text?
Pass the model ID shown in the quickstart.
Is pricing usage-based?
Yes. Pricing is metered per call or unit.
Other ElevenLabs variants
text-to-speech-turbo-v2.5
Cheapest
text-to-speech-turbo-v2.5
$0.060 / 1K chars
audio-isolation
audio-isolation
$0.120 / minute
text-to-speech-multilingual-v2
text-to-speech-multilingual-v2
$0.120 / 1K chars
text-to-dialogue-v3
text-to-dialogue-v3
$0.140 / 1K chars
sound-effect-v2
sound-effect-v2
$0.150 / minute
Related models
지금 시작