---
title: &quot;GPT in Hermes Agent via RunAPI verwenden — LLM-API-Anleitung&quot;
url: &quot;https://runapi.ai/de/hermes-gpt.md&quot;
canonical: &quot;https://runapi.ai/de/hermes-gpt&quot;
locale: &quot;de&quot;
model: &quot;gpt&quot;
---

# GPT in Hermes Agent verwenden.

GPT-5.5 ist OpenAIs Flagship-LLM, verfügbar über RunAPI zum halben offiziellen Token-Preis. Hermes Agent verbindet sich über den custom:runapi-Provider im chat_completions-Modus — ein Konfigurationsblock schaltet jede GPT-Variante (5.5, 5.4, 5.4-mini, 5.3-codex) mit Streaming, Function Calling und strukturiertem Output frei.

## API example

```bash
curl -X POST https://runapi.ai/v1/chat/completions \
  -H &quot;Authorization: Bearer $RUNAPI_API_KEY&quot; \
  -H &quot;Content-Type: application/json&quot; \
  -d &#39;{
    &quot;model&quot;: &quot;gpt-5.5&quot;,
    &quot;messages&quot;: [
      {&quot;role&quot;: &quot;system&quot;, &quot;content&quot;: &quot;You are a concise coding assistant.&quot;},
      {&quot;role&quot;: &quot;user&quot;, &quot;content&quot;: &quot;Write a Python function that merges two sorted lists in O(n) time.&quot;}
    ],
    &quot;temperature&quot;: 0.3,
    &quot;max_tokens&quot;: 1024
  }&#39;

```

### Response

```json
{
  &quot;id&quot;: &quot;chatcmpl-abc123&quot;,
  &quot;object&quot;: &quot;chat.completion&quot;,
  &quot;model&quot;: &quot;gpt-5.5&quot;,
  &quot;choices&quot;: [
    {
      &quot;index&quot;: 0,
      &quot;message&quot;: {
        &quot;role&quot;: &quot;assistant&quot;,
        &quot;content&quot;: &quot;def merge_sorted(a, b):\n    result = []\n    i = j = 0\n    while i &lt; len(a) and j &lt; len(b):\n        if a[i] &lt;= b[j]:\n            result.append(a[i]); i += 1\n        else:\n            result.append(b[j]); j += 1\n    result.extend(a[i:])\n    result.extend(b[j:])\n    return result&quot;
      },
      &quot;finish_reason&quot;: &quot;stop&quot;
    }
  ],
  &quot;usage&quot;: {
    &quot;prompt_tokens&quot;: 38,
    &quot;completion_tokens&quot;: 95,
    &quot;total_tokens&quot;: 133
  }
}

```

## How it works

1. **RunAPI als Custom-Provider hinzufügen** — Wenn der custom:runapi-Provider bereits in Hermes Agent konfiguriert ist, funktionieren derselbe Key und dieselbe /v1-Base-URL für GPT. Andernfalls einen Custom-Provider mit base_url https://runapi.ai/v1, key_env RUNAPI_API_KEY und api_mode chat_completions hinzufügen.
2. **Ein GPT-Modell auswählen** — Das Standardmodell auf gpt-5.5 für das Flaggschiff, gpt-5.4 oder gpt-5.4-mini für geringere Kosten oder gpt-5.3-codex für code-intensive Aufgaben setzen. Der /v1/chat/completions-Endpunkt gibt eine Standard-OpenAI-Antwort mit Usage-Counts und finish_reason zurück.
3. **Streaming oder Function Calling verwenden** — Hermes Agent leitet stream-, tools- und response_format-Parameter über den custom:runapi-Provider weiter. Alle Standard-OpenAI-Chat-Completions-Parameter funktionieren über RunAPI ohne Änderungen.

## Parameters

| Parameter | Type | Description |
|-----------|------|-------------|
| `model` | `string` | Erforderlich. gpt-5.5, gpt-5.4, gpt-5.4-mini, gpt-5.4-nano, gpt-5.3-codex oder gpt-5.2. |
| `messages` | `array` | Erforderlich. Array von Message-Objekten mit role (system, user, assistant) und content-Feldern. |
| `temperature` | `number` | Optional. Sampling-Temperatur zwischen 0 und 2. Niedrigere Werte liefern deterministischeren Output. Standard 1. |
| `max_tokens` | `integer` | Optional. Maximale Anzahl zu generierender Token in der Antwort. |
| `stream` | `boolean` | Optional. Wenn true, werden Server-Sent Events mit inkrementellen Token-Deltas zurückgegeben. Standard false. |
| `tools` | `array` | Optional. Array von Tool-Definitionen für Function Calling. Jedes Tool hat type, function name, description und parameters schema. |
| `response_format` | `object` | Optional. type auf &quot;json_object&quot; oder &quot;json_schema&quot; für strukturierten JSON-Output setzen. |
| `reasoning_effort` | `string` | Optional. Steuert die Denktiefe für unterstützte Modelle. Akzeptierte Werte sind low, medium, high. |

## FAQ

### Kann ich GPT-5.5 in Hermes Agent über RunAPI verwenden?

Ja. Hermes Agent unterstützt benutzerdefinierte OpenAI-kompatible Provider. RunAPI als custom:runapi mit base_url https://runapi.ai/v1, key_env RUNAPI_API_KEY und api_mode chat_completions hinzufügen. Das Standardmodell auf gpt-5.5 setzen.

### Wie unterscheidet sich der GPT-Preis auf RunAPI vom offiziellen OpenAI-Preis?

RunAPI berechnet 50 % des offiziellen OpenAI-Token-Preises für alle GPT-Modelle. Der Rabatt gilt für Input- und Output-Token. Aktuelle Preise pro Million Token auf der RunAPI-Preisseite.

### Kann ich zwischen GPT-Varianten wechseln, ohne die Provider-Konfiguration zu ändern?

Ja. Der custom:runapi-Provider funktioniert mit allen GPT-Varianten. Nur das model-Feld im Request ändern — gpt-5.5, gpt-5.4, gpt-5.4-mini oder gpt-5.3-codex. Base URL, API-Key und api_mode bleiben gleich.

### Funktioniert die Responses API über RunAPI in Hermes Agent?

Ja. RunAPI leitet auch die OpenAI Responses API unter /v1/responses weiter. Wenn Hermes Agent die Responses-API-Oberfläche unterstützt, den Endpunkt auf https://runapi.ai/v1/responses setzen. Derselbe API-Key und Custom-Provider funktionieren für beide Endpunkte.

### Kann ich Function Calling und strukturierten Output mit GPT auf RunAPI verwenden?

Ja. Ein tools-Array für Function Calling übergeben oder response_format auf json_schema für strukturierten Output setzen. RunAPI leitet diese Parameter an das GPT-Modell weiter und gibt tool_calls oder strukturiertes JSON im OpenAI-Standardformat zurück.


## Links

- [Hermes Agent Einrichtungsanleitung →](https://runapi.ai/de/hermes-agent)
- [GPT on RunAPI →](https://runapi.ai/de/models/gpt)
- [Model catalog](https://runapi.ai/de/models)
- [API docs](https://runapi.ai/de/docs)