Can I use GPT Image 2 in OpenClaw?

Yes. OpenClaw agents call GPT Image 2 through the RunAPI text_to_image endpoint. Set the model field to gpt-image-2-text-to-image and send the request with the same RUNAPI_API_KEY you use for chat. No extra skills or plugins required.

What is the difference between GPT Image 2 and GPT-4o Image?

GPT Image 2 is OpenAI's dedicated image generation model with higher quality, 4K output, and transparent background support. GPT-4o Image generates images within a chat context but is limited to 1:1, 3:2, or 2:3 aspect ratios. Both are available through RunAPI — use gpt-image-2-text-to-image for standalone generation and gpt-4o-image for chat-integrated image output.

Does GPT Image 2 support transparent backgrounds?

Yes. GPT Image 2 can output images with transparent backgrounds when instructed in the prompt. This is useful for product photos, logos, and UI elements. Specify transparency in your prompt — for example, "product photo with transparent background."

How does GPT Image 2 pricing work on RunAPI?

GPT Image 2 is billed per image based on output resolution: 1k is the lowest cost, 2k is mid-range, and 4k is the most expensive. The same rate applies to both text_to_image and edit_image. Check the RunAPI pricing page for current per-image rates. Failed generations are not charged.

Can I edit an existing image with GPT Image 2?

Yes. Use the edit_image endpoint with model set to gpt-image-2-image-to-image. Pass the source image URLs in source_image_urls and describe the edit in the prompt — for example, "change the background to a beach sunset" or "add a red hat to the person." GPT Image 2 follows natural language editing instructions.

OPENCLAW + GPT IMAGE

OpenClaw で GPT Image を使う。

GPT Image 2 は OpenAI の専用画像生成モデルです——テキスト→画像と指示ベースの画像編集に対応し、出力解像度は最大 4K、透明背景もサポートします。OpenClaw agent は、チャットで使うのと同じ RunAPI キーと /v1 エンドポイントで呼び出し、追加の skill をインストールする必要はありません。

API キーを取得ドキュメントを読む

1つの APIキー · テキスト→画像 + 画像編集 · 最大 4K 出力

RunAPI を使って OpenAI GPT Image 2 で画像を生成します。

要件：
- https://runapi.ai/v1/text_to_image の RunAPI API を使用します。
- RUNAPI_API_KEY 環境変数から API キーを読み込みます。
- model を "gpt-image-2-text-to-image" に設定します。
- 説明的な prompt を書きます。GPT Image 2 は自然言語の指示に忠実に従います——レイアウト、スタイル、テキストオーバーレイ、透明度の要件を記述します。
- 任意で output_resolution を 1k、2k、または 4k に設定します。デフォルトは 1k です。
- レスポンスは task_id を返します。タスクが完了するまでタスクステータスエンドポイントをポーリングし、出力 URL を取得します。

curl -X POST https://runapi.ai/v1/text_to_image \
  -H "Authorization: Bearer $RUNAPI_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gpt-image-2-text-to-image",
    "prompt": "A product photo of a glass perfume bottle on a marble surface, transparent background, studio lighting, the label reads AURORA in gold serif font",
    "output_resolution": "2k",
    "aspect_ratio": "3:4"
  }'

{
  "task_id": "tsk_abc123",
  "status": "pending",
  "model": "gpt-image-2-text-to-image"
}

curlコマンドをコピーしてテスト gpt-image

仕組み

OpenClaw で GPT Image を使う3ステップ

Configure RunAPI

Set the RUNAPI_API_KEY environment variable in your shell profile. If RunAPI is already configured in OpenClaw for chat, the same key works for GPT Image — no additional setup needed.

export RUNAPI_API_KEY=runapi_xxx

Call GPT Image 2

Send a POST request to the text_to_image endpoint with model set to gpt-image-2-text-to-image. Include a descriptive prompt with layout and style instructions. Set output_resolution to 2k or 4k for higher detail. For editing existing images, use the edit_image endpoint with gpt-image-2-image-to-image and provide source_image_urls.

POST /v1/text_to_image

Get the result

The API returns a task_id immediately. Poll the task status endpoint until the status changes to completed, then retrieve the output image URL from the response. GPT Image 2 typically completes within 10–30 seconds depending on resolution.

task_id: tsk_abc123

パラメータ

GPT Image API パラメータ

パラメータ	型	説明
`model`	`string`	Required. gpt-image-2-text-to-image for generation, gpt-image-2-image-to-image for editing.
`prompt`	`string`	Required. Natural language description of the desired image. Supports detailed instructions for layout, text overlays, and style.
`output_resolution`	`string`	Optional. Output resolution — 1k (default), 2k, or 4k. Higher resolution costs more per image.
`aspect_ratio`	`string`	Optional. Defaults to auto. Supports 1:1, 3:2, 2:3, 4:3, 3:4, 16:9, 9:16, and more.
`source_image_urls`	`array`	Required for edit_image endpoint. One or more URLs of source images to edit.

OpenClaw上のGPT Imageとは？

GPT Image 2はOpenAIの専用画像モデルで、キーワード駆動のジェネレーターよりも構造化されたデザインアシスタントとして機能します。制作ブリーフ——レイアウト・テキスト配置・スタイル制約——を与えると、指示に厳密に従います。OpenClaw agentはチャットと同じRunAPIエンドポイントを通じて呼び出します。