---
title: "Modelli AI Z.ai — accesso API tramite RunAPI"
url: "https://runapi.ai/it/providers/z-ai.md"
canonical: "https://runapi.ai/it/providers/z-ai.md"
locale: "it"
provider: "Z.ai"
line_count: 1
variant_count: 7
---

# Z.ai

Z.ai&#39;s GLM — MIT-licensed MoE LLMs from 128K to 200K context, top open-weight SWE-bench scores, via one RunAPI key.

Z.ai builds the GLM family of MIT-licensed Mixture-of-Experts language models for coding and agentic workflows. The line spans GLM-4.5 (355B / 32B active, 128K context) through GLM-5.1 (754B / 40B active, 200K context), which holds the top open-weight SWE-bench Pro score at 58.4%. All are available through RunAPI from the OpenAI and Anthropic SDKs with per-token billing.

## Models

- [GLM](https://runapi.ai/it/models/glm.md) — Text, 7 variants · from $0.010

## Variants

| Model | Variant | Billing | Starting price | URL |
|---|---|---|---|---|
| GLM | `4.5` | 1K tokens | $0.020 | https://runapi.ai/it/models/glm/4.5.md |
| GLM | `4.5-air` | 1K tokens | $0.010 | https://runapi.ai/it/models/glm/4.5-air.md |
| GLM | `4.6` | 1K tokens | $0.020 | https://runapi.ai/it/models/glm/4.6.md |
| GLM | `4.7` | 1K tokens | $0.020 | https://runapi.ai/it/models/glm/4.7.md |
| GLM | `5` | 1K tokens | $0.020 | https://runapi.ai/it/models/glm/5.md |
| GLM | `5-turbo` | 1K tokens | $0.020 | https://runapi.ai/it/models/glm/5-turbo.md |
| GLM | `5.1` | 1K tokens | $0.030 | https://runapi.ai/it/models/glm/5.1.md |

## LLM API endpoints

Base URL: `https://runapi.ai`

- `POST /v1/chat/completions` — OpenAI compatible
- `POST /v1/messages` — Anthropic compatible

Use the OpenAI or Anthropic SDK with your RunAPI API key.


## FAQ

### È un&#39;integrazione ufficiale di %{provider}?

RunAPI espone una superficie API gestita con prezzi, capacità e comportamento degli errori trasparenti.

### Mi serve un account %{provider}?

No — la tua chiave RunAPI è sufficiente per l’accesso gestito.

### Qual è l’overhead di latenza del proxying?

In genere sotto i 20 ms. RunAPI mantiene il layer proxy vicino alle regioni di esecuzione del modello.

### Le immagini / i video vengono cachati?

Gli output generati vengono salvati e possono essere recuperati tramite task ID. Gli input non vengono cachati.

### Posso usare una mia chiave?

Al momento no — le chiamate usano l’accesso gestito da RunAPI.