Use InfiniteTalk in Hermes Agent.
InfiniteTalk generates lip-synced talking avatar videos from an audio file and a face image. Hermes Agent calls it through the RunAPI custom endpoint — configure custom:runapi once, send audio_to_video, and poll for the finished video.
Use RunAPI to generate a talking avatar video with InfiniteTalk.
Requirements:
- Read the API key from RUNAPI_API_KEY.
- Use the custom:runapi provider with base_url https://runapi.ai/v1.
- Call POST https://runapi.ai/api/v1/infinitetalk/audio_to_video
- Set model to "infinitetalk-from-audio".
- Set source_audio_url to the URL of the audio file.
- Set source_image_url to the URL of the face image.
- Set prompt to describe the desired output.
- The task is async. Poll the returned task_id until status is "completed".
- When done, read the video URL from the response output.
curl -X POST https://runapi.ai/api/v1/infinitetalk/audio_to_video \
-H "Authorization: Bearer $RUNAPI_API_KEY" \
-H "Content-Type: application/json" \
-d '{
"model": "infinitetalk-from-audio",
"source_audio_url": "https://example.com/speech.mp3",
"source_image_url": "https://example.com/avatar.jpg",
"prompt": "A professional presenter delivering a product overview"
}'
{
"task_id": "tsk_abc123",
"status": "pending",
"model": "infinitetalk-from-audio"
}
Use InfiniteTalk in Hermes Agent in three steps
Configure RunAPI
Set RUNAPI_API_KEY in the environment where Hermes Agent runs. If you already added RunAPI as a custom:runapi provider, the same key handles InfiniteTalk requests.
export RUNAPI_API_KEY=runapi_xxx
Call InfiniteTalk
Send a POST request to the audio_to_video endpoint with the model, audio URL, image URL, and a prompt. Hermes Agent can build and send this request through the custom:runapi provider.
POST /api/v1/infinitetalk/audio_to_video
Poll for the result
The endpoint returns a task_id immediately. Poll the task status endpoint until the status is completed, then read the output video URL from the response.
GET /api/v1/infinitetalk/audio_to_video/tsk_abc123
InfiniteTalk API parameters
| Parameter | Type | Description |
|---|---|---|
model |
string |
Required. Use infinitetalk-from-audio. |
source_audio_url |
string |
Required. URL of the source audio file to lip-sync. |
source_image_url |
string |
Required. URL of the face or avatar image to animate. |
prompt |
string |
Required. Text description of the desired video output. Max 5000 characters. |
output_resolution |
string |
Optional. Output video resolution. Accepted values: 480p, 720p. Defaults to 480p. |
seed |
integer |
Optional. Reproducibility seed. Integer between 10000 and 1000000. |
callback_url |
string |
Optional. Webhook URL that receives a POST when the task completes. |
What is InfiniteTalk on Hermes Agent?
InfiniteTalk creates talking avatar videos by driving face animation, mouth sync, and head movement from an audio file and a reference image. Hermes Agent calls it through the custom:runapi provider to build multi-step pipelines -- for example, generating speech with ElevenLabs and then passing the audio URL to InfiniteTalk for a complete text-to-talking-video workflow.
InfiniteTalk use cases
YouTube content with AI presenters
Generate talking head videos from a single photo for YouTube channels, eliminating the need for on-camera filming while keeping a consistent presenter across all videos.
Video dubbing with lip sync
Re-animate faces in existing videos to match new audio tracks in different languages, producing dubbed content where mouth movements match the translated speech.
Long-form lecture and presentation videos
Create unlimited-length talking avatar videos from recorded narration for online courses, webinars, or internal training content using just one instructor photo.
InfiniteTalk + Hermes Agent questions
Yes. Hermes Agent can call the RunAPI InfiniteTalk endpoint through the custom:runapi provider. Set RUNAPI_API_KEY in your environment and send a POST to /api/v1/infinitetalk/audio_to_video with the model, audio URL, and image URL.
Yes. InfiniteTalk uses a sparse-frame approach that supports unlimited-length video generation from one reference image. The output length matches the duration of the source audio file you provide.
Add a custom:runapi provider in your Hermes Agent configuration with base_url set to https://runapi.ai/v1 and your RUNAPI_API_KEY. Hermes Agent then routes InfiniteTalk requests through this provider automatically.
Yes. Hermes Agent supports multi-step workflows. You can chain a TTS model like ElevenLabs to generate audio, then pass the audio URL to InfiniteTalk for avatar video generation, all within a single agent run.
Hermes Agent general setup
Not configured yet? Start with the RunAPI setup guide for Hermes Agent.
Hermes Agent setup guide →InfiniteTalk model catalog
See InfiniteTalk variants, pricing, and API docs.
InfiniteTalk on RunAPI →Try InfiniteTalk in Hermes Agent today.
Get a free RunAPI key, configure the custom:runapi provider, and generate lip-synced talking avatar videos from any audio file.