VARIANT · Z.ai / GLM

GLM glm-4.5-air API

透過 RunAPI 統一 AI API 提供的模型變體。

可正常運作 · text · 可商用

runapi.ai

# Base URL
https://runapi.ai

# Endpoints
POST /v1/chat/completions

curl https://runapi.ai/v1/chat/completions \
  -H "Authorization: Bearer $RUNAPI_KEY" \
  -H "Content-Type: application/json" \
  -d '{
  "model": "glm-4.5-air",
  "messages": [
    {
      "role": "user",
      "content": "Read this multi-file repository, find the failing integration test, and propose a patch with an explanation of the root cause."
    }
  ]
}'

from openai import OpenAI

client = OpenAI(
    base_url="https://runapi.ai/v1",
    api_key="your-runapi-key"
)

response = client.chat.completions.create(
    model="glm-4.5-air",
    messages=[{"role": "user", "content": "Read this multi-file repository, find the failing integration test, and propose a patch with an explanation of the root cause."}]
)

import OpenAI from "openai";

const client = new OpenAI({
  baseURL: "https://runapi.ai/v1",
  apiKey: "your-runapi-key"
});

const response = await client.chat.completions.create({
  model: "glm-4.5-air",
  messages: [{ role: "user", content: "Read this multi-file repository, find the failing integration test, and propose a patch with an explanation of the root cause." }]
});

https://runapi.ai /v1/chat/completions

切換 variant

glm-4.5 glm-4.6 glm-4.7 glm-5 glm-5-turbo glm-5.1

OVERVIEW

glm-4.5-air 針對 GLM 系列中品質與成本的最佳平衡點。

按次收費，美元計價
生成失敗不收費
如模型支援，提供串流輸出
Model skill setup

PRICING

收費

失敗的生成不會收費

Chat completion

Input $0.10 / 1M tokens

Output $0.55 / 1M tokens

Cache read $0.02

Cache write 5m Free

規格說明

技術詳情

模型 ID	glm-4.5-air
供應商	Z.ai
模式	text
任務類型	synchronous
計費單位	1K tokens
API endpoint	/v1/chat/completions
商業授權	是 — 已透過 API 包含
狀態	可正常運作

SKILLS

快速開始 — glm-4.5-air

結構一致 · variant 已固定於 model 中

Endpoint	Protocol
/v1/chat/completions	OpenAI compatible

運作方式

四步使用 glm-4.5-air

01

安裝

為此模型系列安裝 model skill。

02

設定

將 model 欄位設為本頁顯示的完整 model ID。

03

呼叫

連同 prompt、inputs 及 callback 設定，送出一個 typed request。

04

接收

從 RunAPI 讀取 task response、webhook callback 或 cached output URL。

DIFFERENCES

glm-4.5-air 有咩不同

VS GLM-4.5

Lighter GLM-4.5 tier for fast, lower-cost everyday work

355B / 32B active; 128K context; flagship open-weight MoE baseline

VS GLM-4.6

Lighter GLM-4.5 tier for fast, lower-cost everyday work

200K context; first GLM on Cambricon chips; sharper code generation

VS GLM-4.7

Lighter GLM-4.5 tier for fast, lower-cost everyday work

200K context; 73.8% SWE-bench; persistent thinking across turns

使用場景

最適合

客戶支援

從私有知識庫回答客戶問題，減少工單數量。

文件分析

草擬合約摘要，並標示重點條款供律師審閱。

程式碼生成

在 CI 自動生成單元測試、程式碼審查同重構建議。

FAQ

關於 glm-4.5-air 的常見問題

模型 ID 會否在不同版本之間保持穩定？

RunAPI 會保持 model ID 穩定，並處理相容的版本更新，而不需要改動你的 request 格式。

這個 variant 的 rate limit 係多少？

每個 key 的 rate limit 會按使用層級而定。請查看定價頁了解最新限制。

之後可以切換 variant 嗎？

可以——variant 只係一個旗標。只要更改 model 參數就可以切換。

支援 streaming 嗎？

在可用 streaming 的情況下，RunAPI 會端到端串流。

我應該在邊度回報品質問題？

可以在公開 GitHub repo 開 issue，或者電郵支援團隊。

GLM 的其他 variant

$0.020 / 1K tokens

$0.020 / 1K tokens

$0.020 / 1K tokens

$0.020 / 1K tokens

glm-5-turbo 最快

$0.020 / 1K tokens

$0.030 / 1K tokens

其他模型的替代方案

Anthropic's LLM for complex reasoning, code, analysis, and extended-context tasks.

Reasoning-first LLMs via RunAPI — flash for fast, low-cost work; pro for complex agentic tasks.

OpenAI text embeddings for semantic search, retrieval, clustering, and ranking workflows.

立即開始

開始用 GLM 建立產品。

建立免費帳戶閱讀快速開始 →