PROVIDER

Z.ai

Q: Is dit een officiële Z.ai-integratie?

RunAPI biedt een beheerde API-laag met transparante prijzen, mogelijkheden en foutgedrag.

Q: Heb ik een Z.ai-account nodig?

Nee — je RunAPI-sleutel is genoeg voor beheerde toegang.

Q: Wat is de extra latency door proxying?

Meestal onder de 20 ms. RunAPI houdt de proxylaag dicht bij de regio’s waar het model draait.

Q: Worden afbeeldingen / video’s gecached?

Genereerde output wordt opgeslagen en is op te halen via task ID. Invoer wordt niet gecached.

Q: Kan ik mijn eigen key gebruiken?

Nog niet — calls gebruiken RunAPI-beheerde toegang.

Z.ai's GLM — MIT-licensed MoE LLMs from 128K to 200K context, top open-weight SWE-bench scores, via one RunAPI key.

1 models · 7 variants · vanaf $0.010

Alle modellen bekijken API-documentatie →

Alle modellen van Z.ai 1 models

GLM Text

7 variants · from $0.010

OVERVIEW

Z.ai builds the GLM family of MIT-licensed Mixture-of-Experts language models for coding and agentic workflows. The line spans GLM-4.5 (355B / 32B active, 128K context) through GLM-5.1 (754B / 40B active, 200K context), which holds the top open-weight SWE-bench Pro score at 58.4%. All are available through RunAPI from the OpenAI and Anthropic SDKs with per-token billing.

Eén API-key gedeeld tussen providers
Modelskills brengen docs en schema's naar je workspace
Betalen per call, zonder verplichting
Mislukte generaties worden niet in rekening gebracht

FEATURES

Wat opvalt

FASTEST

GLM

P50 ~ <1s

Een van de meest gebruikte model-API's van Z.ai.

FRONTIER

GLM

Frontier tier

Een van de meest gebruikte model-API's van Z.ai.

CHEAPEST

GLM

from $0.010

Laagste instapprijs in de Z.ai-catalogus.

MODELS

Alle modellen van Z.ai

GLM

Z.ai

Text

Z.ai GLM API access via RunAPI — MIT-licensed MoE models with up to 200K context, leading open-weight coding benchmarks.

/v1/chat/completions endpoint

vanaf $0.010 / 1K tokens Bekijken →

QUICKSTART

Installeer een Z.ai modelskill.

Kies een model en voeg de skill toe zodat je codeertool docs, schema's, prijsnotities en setupstappen heeft.

runapi.ai

# Base URL
https://runapi.ai

# Endpoints
POST /v1/chat/completions

curl https://runapi.ai/v1/chat/completions \
  -H "Authorization: Bearer $RUNAPI_KEY" \
  -H "Content-Type: application/json" \
  -d '{
  "model": "glm-5.1",
  "messages": [
    {
      "role": "user",
      "content": "Read this multi-file repository, find the failing integration test, and propose a patch with an explanation of the root cause."
    }
  ]
}'

from openai import OpenAI

client = OpenAI(
    base_url="https://runapi.ai/v1",
    api_key="your-runapi-key"
)

response = client.chat.completions.create(
    model="glm-5.1",
    messages=[{"role": "user", "content": "Read this multi-file repository, find the failing integration test, and propose a patch with an explanation of the root cause."}]
)

import OpenAI from "openai";

const client = new OpenAI({
  baseURL: "https://runapi.ai/v1",
  apiKey: "your-runapi-key"
});

const response = await client.chat.completions.create({
  model: "glm-5.1",
  messages: [{ role: "user", content: "Read this multi-file repository, find the failing integration test, and propose a patch with an explanation of the root cause." }]
});

https://runapi.ai /v1/chat/completions

REFERENCE

Alle varianten van Z.ai

Volledige prijstabel →

Model	Variant	Billing	From
GLM	glm-4.5	1K tokens	$0.020	Bekijken →
	glm-4.5-air	1K tokens	$0.010	Bekijken →
	glm-4.6	1K tokens	$0.020	Bekijken →
	glm-4.7	1K tokens	$0.020	Bekijken →
	glm-5	1K tokens	$0.020	Bekijken →
	glm-5-turbo	1K tokens	$0.020	Bekijken →
	glm-5.1	1K tokens	$0.030	Bekijken →