OPENCLAW + INFINITETALK

Use InfiniteTalk in OpenClaw.

InfiniteTalk generates lip-synced talking avatar videos from an audio file and a face image. OpenClaw agents call it through RunAPI with the same API key used for chat — send audio_to_video, poll the task, and receive a video URL.

one API key · audio to video endpoint · async task polling
Use RunAPI to generate a talking avatar video with InfiniteTalk.

Requirements:
- Read the API key from RUNAPI_API_KEY.
- Call POST https://runapi.ai/api/v1/infinitetalk/audio_to_video
- Set model to "infinitetalk-from-audio".
- Set source_audio_url to the URL of the audio file.
- Set source_image_url to the URL of the face image.
- Set prompt to describe the desired output.
- The task is async. Poll the returned task_id until status is "completed".
- When done, read the video URL from the response output.
curl -X POST https://runapi.ai/api/v1/infinitetalk/audio_to_video \
  -H "Authorization: Bearer $RUNAPI_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "infinitetalk-from-audio",
    "source_audio_url": "https://example.com/speech.mp3",
    "source_image_url": "https://example.com/avatar.jpg",
    "prompt": "A professional presenter delivering a product overview"
  }'
{
  "task_id": "tsk_abc123",
  "status": "pending",
  "model": "infinitetalk-from-audio"
}
Copy the curl command to test infinitetalk
HOW IT WORKS

Use InfiniteTalk in OpenClaw in three steps

1

Configure RunAPI

Set RUNAPI_API_KEY in your environment. If you already configured RunAPI for chat in OpenClaw, the same key works for InfiniteTalk — no extra provider setup needed.

export RUNAPI_API_KEY=runapi_xxx
2

Call InfiniteTalk

Send a POST request to the audio_to_video endpoint with the model, audio URL, image URL, and a prompt describing the output. OpenClaw can build and send this request in your agent workflow.

POST /api/v1/infinitetalk/audio_to_video
3

Poll for the result

The endpoint returns a task_id immediately. Poll the task status endpoint until the status is completed, then read the output video URL from the response.

GET /api/v1/infinitetalk/audio_to_video/tsk_abc123
PARAMETERS

InfiniteTalk API parameters

Parameter Type Description
model string Required. Use infinitetalk-from-audio.
source_audio_url string Required. URL of the source audio file to lip-sync.
source_image_url string Required. URL of the face or avatar image to animate.
prompt string Required. Text description of the desired video output. Max 5000 characters.
output_resolution string Optional. Output video resolution. Accepted values: 480p, 720p. Defaults to 480p.
seed integer Optional. Reproducibility seed. Integer between 10000 and 1000000.
callback_url string Optional. Webhook URL that receives a POST when the task completes.

What is InfiniteTalk on OpenClaw?

InfiniteTalk is an audio-driven talking avatar model that turns a single photo and an audio file into a lip-synced video with head movement, facial expressions, and body posture. OpenClaw agents call it through RunAPI to produce unlimited-length talking head videos without filming equipment or motion capture rigs.

InfiniteTalk use cases

Podcast-to-video conversion

Feed podcast audio and a host photo to generate a talking avatar video, turning audio-only episodes into visual content for YouTube or social media.

Educational and tutorial videos

Create lecture or tutorial videos from a single instructor photo and recorded narration, producing long-form talking head content without a camera setup.

Digital spokesperson for marketing

Build a branded digital presenter from one reference image, then generate spokesperson videos in multiple languages by swapping the audio track.

FAQ

InfiniteTalk + OpenClaw questions

OpenClaw general setup

Not configured yet? Start with the RunAPI setup guide for OpenClaw.

OpenClaw setup guide →

InfiniteTalk model catalog

See InfiniteTalk variants, pricing, and API docs.

InfiniteTalk on RunAPI →

Try InfiniteTalk in OpenClaw today.

Get a free RunAPI key, paste the prompt into OpenClaw, and generate lip-synced talking avatar videos from any audio file.