변형 · ElevenLabs / ElevenLabs

ElevenLabs speech-to-text API

A model variant exposed through RunAPI's unified AI API.

Operational · audio_music · Commercial use supported
# Works with Claude Code, Codex, Gemini CLI, Cursor, and 50+ agents
npx skills add runapi-ai/elevenlabs -g
The -g flag installs globally so every project picks it up.
Or paste this prompt to your AI agent:
Install the ElevenLabs skill for me:

1. Clone https://github.com/runapi-ai/elevenlabs
2. Copy the skills/elevenlabs/ directory into your
   user-level skills directory (e.g. ~/.claude/skills/
   for Claude Code, ~/.codex/skills/ for Codex).
3. Verify that SKILL.md is present.
4. Confirm the install path when done.
Switch variant
개요

ElevenLabs speech-to-text is available through the same RunAPI auth, SDKs, and agent skill workflow.

  • Unified endpoint
  • SDK snippets
  • Agent install path
  • 실패한 생성은 과금되지 않습니다
요금

요금

실패한 생성은 과금되지 않습니다
Speech to text
$0.04 / minute
스펙

스펙

Model ID speech-to-text
제공사 ElevenLabs
모달리티 audio_music
Task type asynchronous
과금 minute
Endpoint /api/v1/elevenlabs/speech_to_text
Commercial Yes
Status Operational
빠른 시작

Quickstart — speech-to-text

runapi.ai
curl -X POST https://runapi.ai/api/v1/elevenlabs/speech_to_text \
  -H "Authorization: Bearer $RUNAPI_KEY" \
  -H "Content-Type: application/json" \
  -d '{
  "model": "speech-to-text",
  "audio_url": "https://cdn.runapi.ai/public/samples/voice.mp3"
}'
import { ElevenlabsClient } from "@runapi.ai/elevenlabs";

const client = new ElevenlabsClient();
const result = await client.speechToText.run({
    model: "speech-to-text",
    audio_url: "https://cdn.runapi.ai/public/samples/voice.mp3",
});
require "runapi/elevenlabs"

client = RunApi::Elevenlabs::Client.new
result = client.speech_to_text.run(
    model: "speech-to-text",
    audio_url: "https://cdn.runapi.ai/public/samples/voice.mp3"
)
@runapi.ai/elevenlabs v1
작동 방식

How to use speech-to-text

01

Choose endpoint

Pick the endpoint and copy the SDK snippet.

02

Pass model ID

Use this variant ID in the request body.

03

Run task

Submit the request and store the returned task ID.

04

Collect output

Poll or receive the callback when the task finishes.

차이

How speech-to-text compares

VS AUDIO-ISOLATION

Transcription across 29+ languages with speaker diarization

Vocal extraction from mixed audio sources

VS SOUND-EFFECT-V2

Transcription across 29+ languages with speaker diarization

Text-to-sound effects for games, video, and podcasts

VS TEXT-TO-DIALOGUE-V3

Transcription across 29+ languages with speaker diarization

Multi-speaker dialogue generation with natural turn-taking

사용 사례

Where to use this variant

Music generation

Create tracks and audio assets.

Voice workflows

Build speech and audio pipelines.

Agent creation

Expose audio tools to agents.

FAQ

Frequently asked questions about speech-to-text

How do I select speech-to-text?

Pass the model ID shown in the quickstart.

Is pricing usage-based?

Yes. Pricing is metered per call or unit.

지금 시작

Start with ElevenLabs today.