Can I use GPT-5.5 in Hermes Agent through RunAPI?

Yes. Hermes Agent supports custom OpenAI-compatible providers. Add RunAPI as custom:runapi with base_url https://runapi.ai/v1, key_env set to RUNAPI_API_KEY, and api_mode set to chat_completions. Set the default model to gpt-5.5.

How does RunAPI GPT pricing compare to official OpenAI pricing?

RunAPI charges 50% of the official OpenAI per-token rate for all GPT models. The discount applies to both input and output tokens. Check the RunAPI pricing page for exact per-million-token rates.

Which GPT model should I use -- 5.5 vs 5.4 vs mini vs codex?

GPT-5.5 for complex reasoning and hard problems. GPT-5.4 for everyday tasks at lower cost. GPT-5.4-mini for high-volume cheap work like classification. GPT-5.3-codex for code generation and editing. Switch between them by changing only the model field -- no provider reconfiguration needed.

Does the Responses API work through RunAPI in Hermes Agent?

Yes. RunAPI also proxies the OpenAI Responses API at /v1/responses. If Hermes Agent supports the Responses API surface, set the endpoint to https://runapi.ai/v1/responses. The same API key and custom provider work for both endpoints.

How do I use structured outputs to guarantee valid JSON from GPT?

Set response_format to json_schema with a schema definition in your request. GPT will constrain its output to match your schema exactly. RunAPI forwards the schema parameter unchanged. This works for data extraction, form parsing, and any task where you need predictable JSON structure.

Can Hermes Agent switch between GPT models dynamically per request?

Yes. Set the model parameter per request. Hermes Agent can route simple tasks to GPT-5.4-mini for cost efficiency and complex reasoning to GPT-5.5 for quality, all through the same RunAPI provider.

HERMES + GPT

在 Hermes Agent 中使用 GPT。

GPT-5.5 是 OpenAI 的旗舰大模型，通过 RunAPI 可以官方每 token 价格的一半使用。Hermes Agent 通过 custom:runapi 提供商以 chat_completions 模式连接——一个配置块即可解锁每个 GPT 版本（5.5、5.4、5.4-mini、5.3-codex），并支持流式传输、函数调用和结构化输出。

获取 API Key 阅读文档

一个 API 密钥 · OpenAI 兼容 · 流式响应

使用 RunAPI 通过 OpenAI 兼容的 Chat Completions 端点调用 GPT-5.5。

要求：
- 从 RUNAPI_API_KEY 读取 API 密钥。
- 使用 custom:runapi 提供商，base_url 为 https://runapi.ai/v1。
- 调用 POST https://runapi.ai/v1/chat/completions
- 将 model 设置为 "gpt-5.5"。
- 包含一个至少有一条 user 消息的 messages 数组。
- 响应是同步的——补全结果在同一个 HTTP 响应中返回。
- 如需流式传输，设置 "stream": true 以接收 server-sent events。
- 如需使用 Responses API，改为调用 POST https://runapi.ai/v1/responses。

curl -X POST https://runapi.ai/v1/chat/completions \
  -H "Authorization: Bearer $RUNAPI_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gpt-5.5",
    "messages": [
      {"role": "system", "content": "You are a concise coding assistant."},
      {"role": "user", "content": "Write a Python function that merges two sorted lists in O(n) time."}
    ],
    "temperature": 0.3,
    "max_tokens": 1024
  }'

{
  "id": "chatcmpl-abc123",
  "object": "chat.completion",
  "model": "gpt-5.5",
  "choices": [
    {
      "index": 0,
      "message": {
        "role": "assistant",
        "content": "def merge_sorted(a, b):\n    result = []\n    i = j = 0\n    while i < len(a) and j < len(b):\n        if a[i] <= b[j]:\n            result.append(a[i]); i += 1\n        else:\n            result.append(b[j]); j += 1\n    result.extend(a[i:])\n    result.extend(b[j:])\n    return result"
      },
      "finish_reason": "stop"
    }
  ],
  "usage": {
    "prompt_tokens": 38,
    "completion_tokens": 95,
    "total_tokens": 133
  }
}

复制 curl 命令进行测试 gpt

工作原理

三步在 Hermes Agent 中使用 GPT

Add RunAPI as a custom provider

If the custom:runapi provider is already configured in Hermes Agent, the same key works for GPT. Otherwise, add a custom provider with base_url https://runapi.ai/v1, key_env set to RUNAPI_API_KEY, and api_mode set to chat_completions.

export RUNAPI_API_KEY=runapi_xxx

Select a GPT model

Set the default model to gpt-5.5 for the flagship, gpt-5.4 or gpt-5.4-mini for lower cost, or gpt-5.3-codex for code-heavy tasks. The /v1/chat/completions endpoint returns a standard OpenAI response with usage counts and finish_reason.

default: gpt-5.5

Use streaming or function calling

Hermes Agent forwards stream, tools, and response_format parameters through the custom:runapi provider. All standard OpenAI Chat Completions parameters work through RunAPI without modification.

"stream": true

参数

GPT Chat Completions 参数

参数	类型	说明
`model`	`string`	Required. gpt-5.5, gpt-5.4, gpt-5.4-mini, gpt-5.4-nano, gpt-5.3-codex, or gpt-5.2.
`messages`	`array`	Required. Array of message objects with role (system, user, assistant) and content fields.
`temperature`	`number`	Optional. Sampling temperature between 0 and 2. Lower values produce more deterministic output. Defaults to 1.
`max_tokens`	`integer`	Optional. Maximum number of tokens to generate in the completion.
`stream`	`boolean`	Optional. When true, returns server-sent events with incremental token deltas. Defaults to false.
`tools`	`array`	Optional. Array of tool definitions for function calling. Each tool has a type, function name, description, and parameters schema.
`response_format`	`object`	Optional. Set type to "json_object" or "json_schema" for structured JSON output.
`reasoning_effort`	`string`	Optional. Controls thinking depth for supported models. Accepted values are low, medium, high.

Hermes Agent 上的 GPT 是什么？

GPT 是 OpenAI 的 LLM 系列，通过 RunAPI 的 custom:runapi provider 以官方单 token 价格的一半提供。Hermes Agent 使用标准 chat_completions API 模式连接，因此 GPT-5.5、5.4、5.4-mini 和 5.3-codex 均支持流式输出、函数调用、结构化 JSON 输出和视觉输入——全部通过与 Claude 或 Gemini 相同的 provider 配置。