VARIANT · Z.ai / GLM

GLM glm-5 API

Een modelvariant beschikbaar via de uniforme AI-API van RunAPI.

Operationeel · text · Commercieel toegestaan

runapi.ai

# Base URL
https://runapi.ai

# Endpoints
POST /v1/chat/completions

curl https://runapi.ai/v1/chat/completions \
  -H "Authorization: Bearer $RUNAPI_KEY" \
  -H "Content-Type: application/json" \
  -d '{
  "model": "glm-5",
  "messages": [
    {
      "role": "user",
      "content": "Read this multi-file repository, find the failing integration test, and propose a patch with an explanation of the root cause."
    }
  ]
}'

from openai import OpenAI

client = OpenAI(
    base_url="https://runapi.ai/v1",
    api_key="your-runapi-key"
)

response = client.chat.completions.create(
    model="glm-5",
    messages=[{"role": "user", "content": "Read this multi-file repository, find the failing integration test, and propose a patch with an explanation of the root cause."}]
)

import OpenAI from "openai";

const client = new OpenAI({
  baseURL: "https://runapi.ai/v1",
  apiKey: "your-runapi-key"
});

const response = await client.chat.completions.create({
  model: "glm-5",
  messages: [{ role: "user", content: "Read this multi-file repository, find the failing integration test, and propose a patch with an explanation of the root cause." }]
});

https://runapi.ai /v1/chat/completions

Wissel variant

glm-4.5 glm-4.5-air glm-4.6 glm-4.7 glm-5-turbo glm-5.1

OVERVIEW

glm-5 biedt de ideale balans tussen kwaliteit en kosten binnen de GLM-familie.

Prijs per call in USD
Mislukte generaties worden niet gefactureerd
Streaming wanneer ondersteund door het model
Modelskill-setup

PRICING

Prijzen

Mislukte generaties worden niet in rekening gebracht

Chat completion

Input $0.50 / 1M tokens

Output $1.60 / 1M tokens

Cache read $0.10

Cache write 5m Free

SPECIFICATIEBLAD

Technische details

Model-ID	glm-5
Provider	Z.ai
Modaliteit	text
Taaktype	synchronous
Facturatie-eenheid	1K tokens
API endpoint	/v1/chat/completions
Commerciële licentie	Ja — inbegrepen via API
Status	Operationeel

SKILLS

Snelstart — glm-5

Zelfde structuur · variant vastgelegd in model

Endpoint	Protocol
/v1/chat/completions	OpenAI compatible

HOE HET WERKT

Gebruik glm-5 in vier stappen

01

Installeren

Installeer de modelskill voor deze modelreeks.

02

Configureren

Zet het modelveld op de volledige model-ID die op deze pagina staat.

03

Aanroepen

Stuur een getypeerde request met je prompt, inputs en callback-instellingen.

04

Ontvangen

Lees de task-response, webhook-callback of cached output-URL van RunAPI.

DIFFERENCES

Wat is er anders aan glm-5

VS GLM-4.5

744B / 40B active; 200K context; 77.8% SWE-bench Verified

355B / 32B active; 128K context; flagship open-weight MoE baseline

VS GLM-4.5-AIR

744B / 40B active; 200K context; 77.8% SWE-bench Verified

Lighter GLM-4.5 tier for fast, lower-cost everyday work

VS GLM-4.6

744B / 40B active; 200K context; 77.8% SWE-bench Verified

200K context; first GLM on Cambricon chips; sharper code generation

USE CASES

Ideaal voor

Klantenservice

Beantwoord klantvragen vanuit een privékennisbank en verlaag zo het aantal tickets.

Documentanalyse

Stel samenvattingen van contracten op en markeer belangrijke clausules voor beoordeling door een jurist.

Codegeneratie

Genereer automatisch unittests, code reviews en refactoringsuggesties in CI.

FAQ

Veelgestelde vragen over glm-5

Is de model-ID stabiel tussen versies?

RunAPI houdt de model-ID stabiel en verwerkt compatibele versie-updates zonder de opzet van je request te wijzigen.

Wat is de rate limit voor deze variant?

Rate limits per key schalen mee met je usage tier. Bekijk de prijzenpagina voor de actuele limieten.

Kan ik later van variant wisselen?

Ja — variant is een flag. Wissel door de modelparameter aan te passen.

Ondersteunt het streaming?

Waar streaming beschikbaar is, streamt RunAPI end-to-end.

Waar meld ik kwaliteitsproblemen?

Open een issue in de publieke GitHub-repo of mail support.

Andere varianten van GLM

glm-4.5-air goedkoopst

$0.010 / 1K tokens

$0.020 / 1K tokens

$0.020 / 1K tokens

$0.020 / 1K tokens

glm-5-turbo snel

$0.020 / 1K tokens

$0.030 / 1K tokens

Alternatieven van andere modellen

Anthropic's LLM for complex reasoning, code, analysis, and extended-context tasks.

Reasoning-first LLMs via RunAPI — flash for fast, low-cost work; pro for complex agentic tasks.

OpenAI text embeddings for semantic search, retrieval, clustering, and ranking workflows.

NU BEGINNEN

Begin met bouwen met GLM.

Gratis account aanmaken Lees de quickstart →