---
title: &quot;Hermes Agent에서 RunAPI로 GPT 사용하기 — LLM API 가이드&quot;
url: &quot;https://runapi.ai/ko/hermes-gpt.md&quot;
canonical: &quot;https://runapi.ai/ko/hermes-gpt&quot;
locale: &quot;ko&quot;
model: &quot;gpt&quot;
---

# Hermes Agent에서 GPT를 사용하세요.

GPT-5.5는 OpenAI의 주력 LLM으로, 공식 토큰당 가격의 절반으로 RunAPI를 통해 이용할 수 있습니다. Hermes Agent는 chat_completions 모드를 사용하는 custom:runapi 공급자를 통해 연결하며 — 하나의 설정 블록으로 스트리밍, 함수 호출, 구조화된 출력을 지원하는 모든 GPT 변형(5.5, 5.4, 5.4-mini, 5.3-codex)을 사용할 수 있습니다.

## API example

```bash
curl -X POST https://runapi.ai/v1/chat/completions \
  -H &quot;Authorization: Bearer $RUNAPI_API_KEY&quot; \
  -H &quot;Content-Type: application/json&quot; \
  -d &#39;{
    &quot;model&quot;: &quot;gpt-5.5&quot;,
    &quot;messages&quot;: [
      {&quot;role&quot;: &quot;system&quot;, &quot;content&quot;: &quot;You are a concise coding assistant.&quot;},
      {&quot;role&quot;: &quot;user&quot;, &quot;content&quot;: &quot;Write a Python function that merges two sorted lists in O(n) time.&quot;}
    ],
    &quot;temperature&quot;: 0.3,
    &quot;max_tokens&quot;: 1024
  }&#39;

```

### Response

```json
{
  &quot;id&quot;: &quot;chatcmpl-abc123&quot;,
  &quot;object&quot;: &quot;chat.completion&quot;,
  &quot;model&quot;: &quot;gpt-5.5&quot;,
  &quot;choices&quot;: [
    {
      &quot;index&quot;: 0,
      &quot;message&quot;: {
        &quot;role&quot;: &quot;assistant&quot;,
        &quot;content&quot;: &quot;def merge_sorted(a, b):\n    result = []\n    i = j = 0\n    while i &lt; len(a) and j &lt; len(b):\n        if a[i] &lt;= b[j]:\n            result.append(a[i]); i += 1\n        else:\n            result.append(b[j]); j += 1\n    result.extend(a[i:])\n    result.extend(b[j:])\n    return result&quot;
      },
      &quot;finish_reason&quot;: &quot;stop&quot;
    }
  ],
  &quot;usage&quot;: {
    &quot;prompt_tokens&quot;: 38,
    &quot;completion_tokens&quot;: 95,
    &quot;total_tokens&quot;: 133
  }
}

```

## How it works

1. **RunAPI를 커스텀 공급자로 추가** — Hermes Agent에 이미 custom:runapi 공급자가 설정되어 있다면 동일한 key로 GPT를 사용할 수 있습니다. 그렇지 않은 경우 base_url을 https://runapi.ai/v1로, key_env를 RUNAPI_API_KEY로, api_mode를 chat_completions로 설정하여 커스텀 공급자를 추가하세요.
2. **GPT 모델 선택** — 주력 모델로 gpt-5.5를, 비용 절감을 위해 gpt-5.4 또는 gpt-5.4-mini를, 코드 집중 작업에는 gpt-5.3-codex를 기본 모델로 설정하세요. /v1/chat/completions 엔드포인트는 사용량 카운트와 finish_reason이 포함된 표준 OpenAI 응답을 반환합니다.
3. **스트리밍 또는 함수 호출 사용** — Hermes Agent가 custom:runapi 공급자를 통해 stream, tools, response_format 파라미터를 전달합니다. 모든 표준 OpenAI Chat Completions 파라미터가 RunAPI를 통해 수정 없이 작동합니다.

## Parameters

| Parameter | Type | Description |
|-----------|------|-------------|
| `model` | `string` | 필수. gpt-5.5, gpt-5.4, gpt-5.4-mini, gpt-5.4-nano, gpt-5.3-codex, 또는 gpt-5.2. |
| `messages` | `array` | 필수. role(system, user, assistant)과 content 필드가 있는 메시지 객체 배열. |
| `temperature` | `number` | 선택 사항. 0~2 사이의 샘플링 온도. 낮을수록 결정론적 출력이 생성됩니다. 기본값 1. |
| `max_tokens` | `integer` | 선택 사항. 완성에서 생성할 최대 토큰 수. |
| `stream` | `boolean` | 선택 사항. true이면 증분 토큰 델타가 포함된 서버 전송 이벤트를 반환합니다. 기본값 false. |
| `tools` | `array` | 선택 사항. 함수 호출용 도구 정의 배열. 각 도구에는 type, 함수 이름, 설명, 파라미터 스키마가 있습니다. |
| `response_format` | `object` | 선택 사항. 구조화된 JSON 출력을 위해 type을 &quot;json_object&quot; 또는 &quot;json_schema&quot;로 설정하세요. |
| `reasoning_effort` | `string` | 선택 사항. 지원 모델의 사고 깊이를 제어합니다. 허용 값은 low, medium, high. |

## FAQ

### RunAPI를 통해 Hermes Agent에서 GPT-5.5를 사용할 수 있나요?

네. Hermes Agent는 OpenAI 호환 커스텀 공급자를 지원합니다. base_url을 https://runapi.ai/v1로, key_env를 RUNAPI_API_KEY로, api_mode를 chat_completions로 설정하여 RunAPI를 custom:runapi로 추가하세요. 기본 모델을 gpt-5.5로 설정하세요.

### RunAPI GPT 가격은 공식 OpenAI 가격과 어떻게 다른가요?

RunAPI는 모든 GPT 모델에 대해 공식 OpenAI 토큰당 요금의 50%를 청구합니다. 할인은 입력 및 출력 토큰 모두에 적용됩니다. 정확한 백만 토큰당 요금은 RunAPI 가격 페이지를 확인하세요.

### 공급자 설정을 변경하지 않고 GPT 변형 간에 전환할 수 있나요?

네. custom:runapi 공급자는 모든 GPT 변형에서 작동합니다. 요청의 model 필드만 변경하면 됩니다 — gpt-5.5, gpt-5.4, gpt-5.4-mini, 또는 gpt-5.3-codex. base URL, API key, api_mode는 동일하게 유지됩니다.

### Hermes Agent에서 RunAPI를 통해 Responses API가 작동하나요?

네. RunAPI는 /v1/responses에서 OpenAI Responses API도 프록시합니다. Hermes Agent가 Responses API 인터페이스를 지원하는 경우 엔드포인트를 https://runapi.ai/v1/responses로 설정하세요. 동일한 API key와 커스텀 공급자가 두 엔드포인트 모두에서 작동합니다.

### RunAPI에서 GPT로 함수 호출과 구조화된 출력을 사용할 수 있나요?

네. 함수 호출을 위해 tools 배열을 전달하거나 구조화된 출력을 위해 response_format을 json_schema로 설정하세요. RunAPI가 이 파라미터들을 GPT 모델로 전달하고 표준 OpenAI 응답 형식으로 tool_calls 또는 구조화된 JSON을 반환합니다.


## Links

- [Hermes Agent 설정 가이드 →](https://runapi.ai/ko/hermes-agent)
- [RunAPI의 GPT →](https://runapi.ai/ko/models/gpt)
- [Model catalog](https://runapi.ai/ko/models)
- [API docs](https://runapi.ai/ko/docs)