---
title: &quot;RunAPI 経由で Hermes Agent で GPT を使う — 生成AI API ガイド&quot;
url: &quot;https://runapi.ai/ja/hermes-gpt.md&quot;
canonical: &quot;https://runapi.ai/ja/hermes-gpt&quot;
locale: &quot;ja&quot;
model: &quot;gpt&quot;
---

# Hermes Agent で GPT を使う。

GPT-5.5 は OpenAI のフラッグシップ生成AIで、RunAPI を通じて公式のトークン単価の半額で利用できます。Hermes Agent は chat_completions モードで custom:runapi プロバイダーを介して接続します——1つの設定ブロックですべての GPT バリアント（5.5、5.4、5.4-mini、5.3-codex）がストリーミング、関数呼び出し、構造化出力とともに解放されます。

## API example

```bash
curl -X POST https://runapi.ai/v1/chat/completions \
  -H &quot;Authorization: Bearer $RUNAPI_API_KEY&quot; \
  -H &quot;Content-Type: application/json&quot; \
  -d &#39;{
    &quot;model&quot;: &quot;gpt-5.5&quot;,
    &quot;messages&quot;: [
      {&quot;role&quot;: &quot;system&quot;, &quot;content&quot;: &quot;You are a concise coding assistant.&quot;},
      {&quot;role&quot;: &quot;user&quot;, &quot;content&quot;: &quot;Write a Python function that merges two sorted lists in O(n) time.&quot;}
    ],
    &quot;temperature&quot;: 0.3,
    &quot;max_tokens&quot;: 1024
  }&#39;

```

### Response

```json
{
  &quot;id&quot;: &quot;chatcmpl-abc123&quot;,
  &quot;object&quot;: &quot;chat.completion&quot;,
  &quot;model&quot;: &quot;gpt-5.5&quot;,
  &quot;choices&quot;: [
    {
      &quot;index&quot;: 0,
      &quot;message&quot;: {
        &quot;role&quot;: &quot;assistant&quot;,
        &quot;content&quot;: &quot;def merge_sorted(a, b):\n    result = []\n    i = j = 0\n    while i &lt; len(a) and j &lt; len(b):\n        if a[i] &lt;= b[j]:\n            result.append(a[i]); i += 1\n        else:\n            result.append(b[j]); j += 1\n    result.extend(a[i:])\n    result.extend(b[j:])\n    return result&quot;
      },
      &quot;finish_reason&quot;: &quot;stop&quot;
    }
  ],
  &quot;usage&quot;: {
    &quot;prompt_tokens&quot;: 38,
    &quot;completion_tokens&quot;: 95,
    &quot;total_tokens&quot;: 133
  }
}

```

## How it works

1. **Add RunAPI as a custom provider** — If the custom:runapi provider is already configured in Hermes Agent, the same key works for GPT. Otherwise, add a custom provider with base_url https://runapi.ai/v1, key_env set to RUNAPI_API_KEY, and api_mode set to chat_completions.
2. **Select a GPT model** — Set the default model to gpt-5.5 for the flagship, gpt-5.4 or gpt-5.4-mini for lower cost, or gpt-5.3-codex for code-heavy tasks. The /v1/chat/completions endpoint returns a standard OpenAI response with usage counts and finish_reason.
3. **Use streaming or function calling** — Hermes Agent forwards stream, tools, and response_format parameters through the custom:runapi provider. All standard OpenAI Chat Completions parameters work through RunAPI without modification.

## Parameters

| Parameter | Type | Description |
|-----------|------|-------------|
| `model` | `string` | Required. gpt-5.5, gpt-5.4, gpt-5.4-mini, gpt-5.4-nano, gpt-5.3-codex, or gpt-5.2. |
| `messages` | `array` | Required. Array of message objects with role (system, user, assistant) and content fields. |
| `temperature` | `number` | Optional. Sampling temperature between 0 and 2. Lower values produce more deterministic output. Defaults to 1. |
| `max_tokens` | `integer` | Optional. Maximum number of tokens to generate in the completion. |
| `stream` | `boolean` | Optional. When true, returns server-sent events with incremental token deltas. Defaults to false. |
| `tools` | `array` | Optional. Array of tool definitions for function calling. Each tool has a type, function name, description, and parameters schema. |
| `response_format` | `object` | Optional. Set type to &quot;json_object&quot; or &quot;json_schema&quot; for structured JSON output. |
| `reasoning_effort` | `string` | Optional. Controls thinking depth for supported models. Accepted values are low, medium, high. |

## FAQ

### Can I use GPT-5.5 in Hermes Agent through RunAPI?

Yes. Hermes Agent supports custom OpenAI-compatible providers. Add RunAPI as custom:runapi with base_url https://runapi.ai/v1, key_env set to RUNAPI_API_KEY, and api_mode set to chat_completions. Set the default model to gpt-5.5.

### How does RunAPI GPT pricing compare to official OpenAI pricing?

RunAPI charges 50% of the official OpenAI per-token rate for all GPT models. The discount applies to both input and output tokens. Check the RunAPI pricing page for exact per-million-token rates.

### Which GPT model should I use -- 5.5 vs 5.4 vs mini vs codex?

GPT-5.5 for complex reasoning and hard problems. GPT-5.4 for everyday tasks at lower cost. GPT-5.4-mini for high-volume cheap work like classification. GPT-5.3-codex for code generation and editing. Switch between them by changing only the model field -- no provider reconfiguration needed.

### Does the Responses API work through RunAPI in Hermes Agent?

Yes. RunAPI also proxies the OpenAI Responses API at /v1/responses. If Hermes Agent supports the Responses API surface, set the endpoint to https://runapi.ai/v1/responses. The same API key and custom provider work for both endpoints.

### How do I use structured outputs to guarantee valid JSON from GPT?

Set response_format to json_schema with a schema definition in your request. GPT will constrain its output to match your schema exactly. RunAPI forwards the schema parameter unchanged. This works for data extraction, form parsing, and any task where you need predictable JSON structure.

### Can Hermes Agent switch between GPT models dynamically per request?

Yes. Set the model parameter per request. Hermes Agent can route simple tasks to GPT-5.4-mini for cost efficiency and complex reasoning to GPT-5.5 for quality, all through the same RunAPI provider.


## Links

- [Hermes Agent セットアップガイド →](https://runapi.ai/ja/hermes-agent)
- [GPT on RunAPI →](https://runapi.ai/ja/models/gpt)
- [Model catalog](https://runapi.ai/ja/models)
- [API docs](https://runapi.ai/ja/docs)
