Use Nano Banana in Hermes Agent.
Nano Banana is Google's Gemini-native image generation model with industry-leading text rendering in generated images. Pro outputs up to 4K resolution, Nano Banana 2 optimizes for speed with extended aspect ratios, and Edit modifies existing images. Hermes Agent calls all three through the RunAPI custom provider — no ComfyUI or GPU setup needed.
Use RunAPI to generate an image with Google Nano Banana Pro from Hermes Agent.
Requirements:
- Read the API key from RUNAPI_API_KEY. Do not hardcode the key.
- Use the custom:runapi provider already configured in Hermes Agent.
- Send a POST request to https://runapi.ai/api/v1/nano_banana/text_to_image.
- Set the model to "nano-banana-pro".
- Write a descriptive prompt. Nano Banana excels at rendering text within images, so include any text overlays in the prompt.
- Optionally set output_resolution to "4k" for maximum quality.
- The response returns a task_id. Poll the task status endpoint until the task completes.
- When the task is complete, retrieve the image URL from the response.
curl -X POST https://runapi.ai/api/v1/nano_banana/text_to_image \
-H "Authorization: Bearer $RUNAPI_API_KEY" \
-H "Content-Type: application/json" \
-d '{
"model": "nano-banana-pro",
"prompt": "A minimalist product card for a coffee brand, the text DARK ROAST in bold sans-serif across the top, a steaming cup below, warm studio lighting",
"aspect_ratio": "4:3",
"output_resolution": "4k",
"output_format": "png"
}'
{
"task_id": "tsk_abc123",
"status": "pending",
"model": "nano-banana-pro"
}
Use Nano Banana in Hermes Agent in three steps
Configure RunAPI
Set the RUNAPI_API_KEY environment variable. If you already configured RunAPI as a custom:runapi provider in Hermes Agent for chat, the same key and base_url work for Nano Banana image generation.
export RUNAPI_API_KEY=runapi_xxx
Call Nano Banana
Send a POST to the text_to_image endpoint with model set to nano-banana-pro for highest quality, nano-banana-2 for speed, or nano-banana for the base variant. Set output_resolution to 4k on Pro or 2 for maximum detail. For editing, POST to edit_image with nano-banana-edit and include an image_url.
POST /api/v1/nano_banana/text_to_image
Get the result
The response includes a task_id. Poll the task status endpoint until status changes to completed. The finished response contains the generated image URL. RunAPI SDKs and the CLI handle polling automatically.
task_id: tsk_abc123
Nano Banana API parameters
| Parameter | Type | Description |
|---|---|---|
model |
string |
Required. nano-banana-pro (highest quality, 4K), nano-banana-2 (fast, extended ratios), nano-banana (base), or nano-banana-edit (editing). |
prompt |
string |
Text description of the desired image. Include any text you want rendered in the image — Nano Banana handles typography natively. |
image_url |
string |
Source image URL. Required for nano-banana-edit, ignored for text_to_image variants. |
aspect_ratio |
string |
Optional. Output aspect ratio. Pro accepts: 1:1, 2:3, 3:2, 3:4, 4:3, 4:5, 5:4, 9:16, 16:9, 21:9, auto. Nano Banana 2 adds 1:4, 1:8, 4:1, 8:1. |
output_resolution |
string |
Optional. Pro and 2 only. Accepted values: 1k, 2k, 4k. Defaults to 1k. |
output_format |
string |
Optional. Output file format. Accepted values: png, jpeg, jpg. |
callback_url |
string |
Optional. Webhook URL that receives a POST when the task completes. |
What is Nano Banana on Hermes Agent?
Nano Banana is the Google image model that reviewers describe as a "Photoshop killer" for its organic design features and world knowledge integration. It generates images that understand real-world context -- ask for a specific landmark, product type, or design style and it draws from built-in knowledge rather than guessing. Hermes Agent calls all three variants through the RunAPI custom provider.
Nano Banana use cases
E-commerce product catalog imagery
Generate consistent product photography for entire catalogs -- same lighting, same angles, same brand style -- without a physical photo studio.
UI mockups and packaging concepts
Turn written design briefs into visual mockups for apps, packaging, or physical products, getting to visual review faster than manual wireframing.
Blog post illustrations and branded series
Create a consistent visual series for blog posts or social media campaigns, where each image maintains the same style and brand identity across dozens of outputs.
Nano Banana + Hermes Agent questions
Nano Banana Pro delivers the highest image quality with up to 4K output resolution. Nano Banana 2 is optimized for speed and supports extra aspect ratios like 1:4 and 8:1 for banner and tall formats. The base nano-banana variant is the most affordable option at standard resolution. All three share the same text_to_image endpoint.
For text-to-image generation, POST to /api/v1/nano_banana/text_to_image with any generation variant (nano-banana-pro, nano-banana-2, or nano-banana). For editing, POST to /api/v1/nano_banana/edit_image with model nano-banana-edit and include the source image_url. Both endpoints use the same custom:runapi provider and RUNAPI_API_KEY.
Nano Banana Pro pricing varies by output_resolution: 1k and 2k are the same rate, while 4k costs more per image. Nano Banana 2 follows a similar tier structure. Check the RunAPI pricing page for exact per-image rates. There is no monthly minimum.
No. Hermes Agent calls Nano Banana through the RunAPI custom:runapi provider endpoint. RunAPI handles all GPU infrastructure server-side. No ComfyUI workflow, no cloud GPU rental, no local VRAM requirements — send a prompt and receive an image URL.
Configure the custom:runapi provider in Hermes Agent and set the model to the Nano Banana slug. Hermes Agent sends the request to RunAPI's image generation endpoint, which routes to Google's Gemini-native image generation model.
Nano Banana 2 and Pro are competitive with Midjourney V8 on photorealism and significantly ahead on text rendering accuracy. The main advantage is API access -- Midjourney requires Discord or a separate web UI, while Nano Banana runs through a standard API endpoint via RunAPI.