Use InfiniteTalk in OpenClaw.
InfiniteTalk generates lip-synced talking avatar videos from an audio file and a face image. OpenClaw agents call it through RunAPI with the same API key used for chat — send audio_to_video, poll the task, and receive a video URL.
Use RunAPI to generate a talking avatar video with InfiniteTalk.
Requirements:
- Read the API key from RUNAPI_API_KEY.
- Call POST https://runapi.ai/api/v1/infinitetalk/audio_to_video
- Set model to "infinitetalk-from-audio".
- Set source_audio_url to the URL of the audio file.
- Set source_image_url to the URL of the face image.
- Set prompt to describe the desired output.
- The task is async. Poll the returned task_id until status is "completed".
- When done, read the video URL from the response output.
curl -X POST https://runapi.ai/api/v1/infinitetalk/audio_to_video \
-H "Authorization: Bearer $RUNAPI_API_KEY" \
-H "Content-Type: application/json" \
-d '{
"model": "infinitetalk-from-audio",
"source_audio_url": "https://example.com/speech.mp3",
"source_image_url": "https://example.com/avatar.jpg",
"prompt": "A professional presenter delivering a product overview"
}'
{
"task_id": "tsk_abc123",
"status": "pending",
"model": "infinitetalk-from-audio"
}
Use InfiniteTalk in OpenClaw in three steps
Configure RunAPI
Set RUNAPI_API_KEY in your environment. If you already configured RunAPI for chat in OpenClaw, the same key works for InfiniteTalk — no extra provider setup needed.
export RUNAPI_API_KEY=runapi_xxx
Call InfiniteTalk
Send a POST request to the audio_to_video endpoint with the model, audio URL, image URL, and a prompt describing the output. OpenClaw can build and send this request in your agent workflow.
POST /api/v1/infinitetalk/audio_to_video
Poll for the result
The endpoint returns a task_id immediately. Poll the task status endpoint until the status is completed, then read the output video URL from the response.
GET /api/v1/infinitetalk/audio_to_video/tsk_abc123
InfiniteTalk API parameters
| Parameter | Type | Description |
|---|---|---|
model |
string |
Required. Use infinitetalk-from-audio. |
source_audio_url |
string |
Required. URL of the source audio file to lip-sync. |
source_image_url |
string |
Required. URL of the face or avatar image to animate. |
prompt |
string |
Required. Text description of the desired video output. Max 5000 characters. |
output_resolution |
string |
Optional. Output video resolution. Accepted values: 480p, 720p. Defaults to 480p. |
seed |
integer |
Optional. Reproducibility seed. Integer between 10000 and 1000000. |
callback_url |
string |
Optional. Webhook URL that receives a POST when the task completes. |
What is InfiniteTalk on OpenClaw?
InfiniteTalk is an audio-driven talking avatar model that turns a single photo and an audio file into a lip-synced video with head movement, facial expressions, and body posture. OpenClaw agents call it through RunAPI to produce unlimited-length talking head videos without filming equipment or motion capture rigs.
InfiniteTalk use cases
Podcast-to-video conversion
Feed podcast audio and a host photo to generate a talking avatar video, turning audio-only episodes into visual content for YouTube or social media.
Educational and tutorial videos
Create lecture or tutorial videos from a single instructor photo and recorded narration, producing long-form talking head content without a camera setup.
Digital spokesperson for marketing
Build a branded digital presenter from one reference image, then generate spokesperson videos in multiple languages by swapping the audio track.
InfiniteTalk + OpenClaw questions
Yes. OpenClaw agents can call the RunAPI InfiniteTalk endpoint directly. Configure RunAPI as a provider with your RUNAPI_API_KEY, then send a POST to /api/v1/infinitetalk/audio_to_video with the model, audio URL, and image URL.
Yes. InfiniteTalk uses a sparse-frame approach that supports unlimited-length video generation from one reference image. The output length is determined by the duration of your source audio file.
Image-to-video takes a still photo and animates it with lip sync and head movement. Video-to-video takes an existing video and re-animates the face to match new audio, useful for dubbing or replacing dialogue.
InfiniteTalk uses per-task billing. Check the RunAPI pricing page for current rates. Credits are deducted from the same balance used for all RunAPI models.
OpenClaw general setup
Not configured yet? Start with the RunAPI setup guide for OpenClaw.
OpenClaw setup guide →InfiniteTalk model catalog
See InfiniteTalk variants, pricing, and API docs.
InfiniteTalk on RunAPI →Try InfiniteTalk in OpenClaw today.
Get a free RunAPI key, paste the prompt into OpenClaw, and generate lip-synced talking avatar videos from any audio file.