Can I use InfiniteTalk in Hermes Agent?

Yes. Hermes Agent can call the RunAPI InfiniteTalk endpoint through the custom:runapi provider. Set RUNAPI_API_KEY in your environment and send a POST to /api/v1/infinitetalk/audio_to_video with the model, audio URL, and image URL.

Can InfiniteTalk generate unlimited-length talking videos from a single photo?

Yes. InfiniteTalk uses a sparse-frame approach that supports unlimited-length video generation from one reference image. The output length matches the duration of the source audio file you provide.

How do I route InfiniteTalk requests through the Hermes custom provider?

Add a custom:runapi provider in your Hermes Agent configuration with base_url set to https://runapi.ai/v1 and your RUNAPI_API_KEY. Hermes Agent then routes InfiniteTalk requests through this provider automatically.

Can I chain InfiniteTalk with other RunAPI models in a Hermes Agent workflow?

Yes. Hermes Agent supports multi-step workflows. You can chain a TTS model like ElevenLabs to generate audio, then pass the audio URL to InfiniteTalk for avatar video generation, all within a single agent run.

HERMES + INFINITETALK

Use InfiniteTalk in Hermes Agent.

InfiniteTalk generates lip-synced talking avatar videos from an audio file and a face image. Hermes Agent calls it through the RunAPI custom endpoint — configure custom:runapi once, send audio_to_video, and poll for the finished video.

Get API Key Read the docs

one API key · audio to video endpoint · async task polling

Use RunAPI to generate a talking avatar video with InfiniteTalk.

Requirements:
- Read the API key from RUNAPI_API_KEY.
- Use the custom:runapi provider with base_url https://runapi.ai/v1.
- Call POST https://runapi.ai/api/v1/infinitetalk/audio_to_video
- Set model to "infinitetalk-from-audio".
- Set source_audio_url to the URL of the audio file.
- Set source_image_url to the URL of the face image.
- Set prompt to describe the desired output.
- The task is async. Poll the returned task_id until status is "completed".
- When done, read the video URL from the response output.

curl -X POST https://runapi.ai/api/v1/infinitetalk/audio_to_video \
  -H "Authorization: Bearer $RUNAPI_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "infinitetalk-from-audio",
    "source_audio_url": "https://example.com/speech.mp3",
    "source_image_url": "https://example.com/avatar.jpg",
    "prompt": "A professional presenter delivering a product overview"
  }'

{
  "task_id": "tsk_abc123",
  "status": "pending",
  "model": "infinitetalk-from-audio"
}

Copy the curl command to test infinitetalk

HOW IT WORKS

Use InfiniteTalk in Hermes Agent in three steps

Configure RunAPI

Set RUNAPI_API_KEY in the environment where Hermes Agent runs. If you already added RunAPI as a custom:runapi provider, the same key handles InfiniteTalk requests.

export RUNAPI_API_KEY=runapi_xxx

Call InfiniteTalk

Send a POST request to the audio_to_video endpoint with the model, audio URL, image URL, and a prompt. Hermes Agent can build and send this request through the custom:runapi provider.

POST /api/v1/infinitetalk/audio_to_video

Poll for the result

The endpoint returns a task_id immediately. Poll the task status endpoint until the status is completed, then read the output video URL from the response.

GET /api/v1/infinitetalk/audio_to_video/tsk_abc123

PARAMETERS

InfiniteTalk API parameters

Parameter	Type	Description
`model`	`string`	Required. Use infinitetalk-from-audio.
`source_audio_url`	`string`	Required. URL of the source audio file to lip-sync.
`source_image_url`	`string`	Required. URL of the face or avatar image to animate.
`prompt`	`string`	Required. Text description of the desired video output. Max 5000 characters.
`output_resolution`	`string`	Optional. Output video resolution. Accepted values: 480p, 720p. Defaults to 480p.
`seed`	`integer`	Optional. Reproducibility seed. Integer between 10000 and 1000000.
`callback_url`	`string`	Optional. Webhook URL that receives a POST when the task completes.

What is InfiniteTalk on Hermes Agent?

InfiniteTalk creates talking avatar videos by driving face animation, mouth sync, and head movement from an audio file and a reference image. Hermes Agent calls it through the custom:runapi provider to build multi-step pipelines -- for example, generating speech with ElevenLabs and then passing the audio URL to InfiniteTalk for a complete text-to-talking-video workflow.

InfiniteTalk use cases

YouTube content with AI presenters

Generate talking head videos from a single photo for YouTube channels, eliminating the need for on-camera filming while keeping a consistent presenter across all videos.

Video dubbing with lip sync

Re-animate faces in existing videos to match new audio tracks in different languages, producing dubbed content where mouth movements match the translated speech.

Long-form lecture and presentation videos

Create unlimited-length talking avatar videos from recorded narration for online courses, webinars, or internal training content using just one instructor photo.

FAQ

InfiniteTalk + Hermes Agent questions

Hermes Agent general setup

Not configured yet? Start with the RunAPI setup guide for Hermes Agent.

Hermes Agent setup guide →

InfiniteTalk model catalog

See InfiniteTalk variants, pricing, and API docs.

InfiniteTalk on RunAPI →

Try InfiniteTalk in Hermes Agent today.

Get a free RunAPI key, configure the custom:runapi provider, and generate lip-synced talking avatar videos from any audio file.

Browse models →