在 Hermes Agent 中使用 Claude。
Anthropic Claude 提供 Opus 4.8 以获得最强能力(200K 上下文、扩展思考)、Sonnet 4.6 以获得均衡性能、Haiku 4.5 以获得速度。Hermes Agent 通过 custom:runapi 提供商以 Anthropic 官方每 token 费率的 50% 调用 Claude——使用你为聊天配置的相同密钥和 base_url。
使用 RunAPI 通过 Hermes Agent 发送 Claude 聊天补全请求。
要求:
- 使用 Hermes Agent 中已配置的 custom:runapi 提供商
- 调用 RunAPI 聊天补全端点 https://runapi.ai/v1/chat/completions
- 将 model 设置为 "claude-opus-4.8"
- RUNAPI_API_KEY 环境变量提供授权
- 响应是同步的——assistant 消息直接在响应体中返回
- 如需流式传输,设置 "stream": true 以接收 server-sent events
curl -X POST https://runapi.ai/v1/chat/completions \
-H "Authorization: Bearer $RUNAPI_API_KEY" \
-H "Content-Type: application/json" \
-d '{
"model": "claude-opus-4.8",
"max_tokens": 1024,
"messages": [
{"role": "user", "content": "Explain the difference between a mutex and a semaphore in three sentences."}
]
}'
{
"id": "chatcmpl-abc123",
"object": "chat.completion",
"model": "claude-opus-4.8",
"choices": [
{
"index": 0,
"message": {
"role": "assistant",
"content": "A mutex is a locking mechanism that allows only one thread to access a resource at a time..."
},
"finish_reason": "stop"
}
],
"usage": {
"prompt_tokens": 24,
"completion_tokens": 87,
"total_tokens": 111
}
}
三步在 Hermes Agent 中使用 Claude
Configure RunAPI
Set the RUNAPI_API_KEY environment variable. If you already added RunAPI as a custom:runapi provider in Hermes Agent, the same key and base_url work for Claude — switch the model parameter to claude-opus-4.8 in your Hermes config or use the /model command.
export RUNAPI_API_KEY=runapi_xxx
Call Claude
Send a POST request to /v1/chat/completions with model set to claude-opus-4.8. Include a messages array with at least one user message. Set max_tokens to control response length. Add "stream" true for token-by-token SSE output in your Hermes session.
POST /v1/chat/completions
Read the response
The endpoint returns the assistant message synchronously — no task polling needed. Hermes Agent displays the response inline. Token usage counts are included in the response for billing transparency. Streaming responses arrive as SSE events for real-time display.
usage.total_tokens: 111
Claude API 参数(OpenAI 兼容)
| 参数 | 类型 | 说明 |
|---|---|---|
model |
string |
Required. claude-opus-4.8, claude-sonnet-4.6, claude-haiku-4.5, or any Claude variant listed in the RunAPI catalog. |
messages |
array |
Required. Array of message objects with role (system, user, assistant) and content fields. |
max_tokens |
integer |
Maximum number of tokens in the response. Defaults vary by model — set explicitly for predictable billing. |
stream |
boolean |
When true, returns server-sent events with incremental token deltas instead of a single JSON response. |
temperature |
float |
Sampling temperature between 0 and 1. Lower values produce more deterministic output. |
top_p |
float |
Nucleus sampling cutoff. Alternative to temperature — use one or the other, not both. |
Hermes Agent 上的 Claude 是什么?
Claude 是 Anthropic 的 LLM,Hermes Agent 通过 custom:runapi provider 以官方 Anthropic 单 token 价格的一半调用它。三个档位——Opus 4.8(200K 上下文、扩展思考)、Sonnet 4.6(平衡速度与质量)、Haiku 4.5(快速低成本)——全部通过同一个 provider 配置。只需修改 model 字段即可在各档位间按请求切换,无需重新配置。
Claude 使用场景
构建带工具调用和 MCP 的 AI agent
在 Hermes Agent 中使用 Claude 的函数调用和模型上下文协议支持,构建能读取文件、查询数据库并基于推理采取行动的多步骤自动化工作流。
代码生成与审查
将编程任务通过 Hermes Agent 路由给 Claude——Opus 4.8 用于复杂架构决策和多文件重构,Sonnet 4.6 用于日常 PR 审查和测试生成。
带提示词缓存的内容生成
使用提示词缓存大批量生成营销文案、文档或报告,在系统提示词和上下文跨多次请求保持不变时降低成本。
Claude + Hermes Agent 常见问题
Yes. Configure RunAPI as a custom:runapi provider in Hermes Agent with base_url https://runapi.ai/v1 and api_mode chat_completions. Set model to claude-opus-4.8 or any other Claude variant. The same RUNAPI_API_KEY handles chat, image, video, and music models.
RunAPI charges 50% of Anthropic's official rate. Opus 4.8 is $7.50/$37.50 per million input/output tokens through RunAPI versus $15/$75 direct. With prompt caching enabled, cached input tokens cost even less. No subscription or volume commitment required.
No. Change only the model parameter in your Hermes config or use the /model command during a session. The custom:runapi provider, base_url, and API key stay the same across all Claude variants -- Opus 4.8, Sonnet 4.6, Haiku 4.5, and dated snapshots.
RunAPI exposes both /v1/chat/completions (OpenAI-compatible, used by Hermes Agent's chat_completions mode) and /v1/messages (native Anthropic format). The native endpoint supports extended thinking and Anthropic-specific features. For Hermes Agent, the OpenAI-compatible path covers standard chat and streaming.
Include a cache_control breakpoint on your system prompt or large context blocks. Subsequent requests that share the same cached prefix pay a reduced input token rate. This is especially effective for agent loops where the system prompt and tool definitions repeat across many turns.
Yes. Pass the extended thinking parameters in your request body. Hermes Agent forwards them to the RunAPI Claude endpoint, which supports the same extended thinking configuration as the direct Anthropic API.
立即在 Hermes Agent 中试用 Claude。
免费获取 RunAPI 密钥,配置 custom:runapi 提供商,以 Anthropic 官方费率的 50% 开始使用 Claude。