VARIANTE · Z.ai / GLM

GLM glm-4.5 API

Variante de modelo disponible mediante la API de IA unificada de RunAPI.

Operativo · text · Uso comercial compatible

runapi.ai

# Base URL
https://runapi.ai

# Endpoints
POST /v1/chat/completions

curl https://runapi.ai/v1/chat/completions \
  -H "Authorization: Bearer $RUNAPI_KEY" \
  -H "Content-Type: application/json" \
  -d '{
  "model": "glm-4.5",
  "messages": [
    {
      "role": "user",
      "content": "Read this multi-file repository, find the failing integration test, and propose a patch with an explanation of the root cause."
    }
  ]
}'

from openai import OpenAI

client = OpenAI(
    base_url="https://runapi.ai/v1",
    api_key="your-runapi-key"
)

response = client.chat.completions.create(
    model="glm-4.5",
    messages=[{"role": "user", "content": "Read this multi-file repository, find the failing integration test, and propose a patch with an explanation of the root cause."}]
)

import OpenAI from "openai";

const client = new OpenAI({
  baseURL: "https://runapi.ai/v1",
  apiKey: "your-runapi-key"
});

const response = await client.chat.completions.create({
  model: "glm-4.5",
  messages: [{ role: "user", content: "Read this multi-file repository, find the failing integration test, and propose a patch with an explanation of the root cause." }]
});

https://runapi.ai /v1/chat/completions

Cambiar variante

glm-4.5-air glm-4.6 glm-4.7 glm-5 glm-5-turbo glm-5.1

RESUMEN

GLM glm-4.5 está disponible con la misma autenticación RunAPI, flujo de model skill y precios por uso.

API key unificada
Setup del model skill
Referencia de ID de modelo
Las generaciones fallidas no se cobran

PRECIOS

PRECIOS

Las generaciones fallidas no se cobran

Chat completion

Input $0.30 / 1M tokens

Output $1.10 / 1M tokens

Cache read $0.06

Cache write 5m Free

FICHA TÉCNICA

FICHA TÉCNICA

ID de modelo	glm-4.5
Proveedor	Z.ai
Modalidad	text
Tipo de tarea	synchronous
Facturación	1K tokens
Endpoint	/v1/chat/completions
Comercial	Sí
Estado	Operativo

MODELS

Habilidad de modelo — glm-4.5

Instala la habilidad una vez y luego usa el ID de variante de esta página mientras construyes.

Endpoint	Protocol
/v1/chat/completions	OpenAI compatible

CÓMO FUNCIONA

Cómo usar glm-4.5

01

Elige modelo

Selecciona el modelo y la variante que encajan con el flujo de tu app.

02

Pasa el ID de modelo

Usa este ID de variante en el cuerpo de la petición.

03

Ejecuta la tarea

Envía la petición y guarda el task ID devuelto.

04

Recoge la salida

Consulta o recibe el callback cuando termine la tarea.

DIFERENCIAS

Comparativa de glm-4.5

VS GLM-4.5-AIR

355B / 32B active; 128K context; flagship open-weight MoE baseline

Lighter GLM-4.5 tier for fast, lower-cost everyday work

VS GLM-4.6

355B / 32B active; 128K context; flagship open-weight MoE baseline

200K context; first GLM on Cambricon chips; sharper code generation

VS GLM-4.7

355B / 32B active; 128K context; flagship open-weight MoE baseline

200K context; 73.8% SWE-bench; persistent thinking across turns

CASOS DE USO

Dónde usar esta variante

Chat

Usa LLMs para chat y razonamiento.

Código

Genera y revisa trabajo de implementación.

Automatización

Conecta modelos a tareas backend.

FAQ

Preguntas frecuentes sobre glm-4.5

¿Cómo selecciono glm-4.5?

Pasa el ID de modelo que aparece en el quickstart.

¿El precio es por uso?

Sí. El precio se mide por llamada o unidad.

Otras variantes de GLM

glm-4.5-air Más barato

$0.010 / 1K tokens

$0.020 / 1K tokens

$0.020 / 1K tokens

$0.020 / 1K tokens

glm-5-turbo Rápido

$0.020 / 1K tokens

$0.030 / 1K tokens

Modelos relacionados

Acceso a la API de Claude para el LLM de Anthropic en tareas de razonamiento complejo, código, análisis y contextos extendidos.

Acceso a la API de DeepSeek a través de RunAPI — flash para trabajo rápido y económico; pro para tareas agénticas complejas.

OpenAI text embeddings for semantic search, retrieval, clustering, and ranking workflows.

EMPEZAR

Empieza con GLM hoy.

Crear cuenta gratis Leer inicio rápido →