변형 · Z.ai / GLM

GLM glm-4.6 API

RunAPI의 통합 AI API로 제공되는 모델 변형입니다.

운영 중 · text · 상업적 사용 지원

runapi.ai

# Base URL
https://runapi.ai

# Endpoints
POST /v1/chat/completions

curl https://runapi.ai/v1/chat/completions \
  -H "Authorization: Bearer $RUNAPI_KEY" \
  -H "Content-Type: application/json" \
  -d '{
  "model": "glm-4.6",
  "messages": [
    {
      "role": "user",
      "content": "Read this multi-file repository, find the failing integration test, and propose a patch with an explanation of the root cause."
    }
  ]
}'

from openai import OpenAI

client = OpenAI(
    base_url="https://runapi.ai/v1",
    api_key="your-runapi-key"
)

response = client.chat.completions.create(
    model="glm-4.6",
    messages=[{"role": "user", "content": "Read this multi-file repository, find the failing integration test, and propose a patch with an explanation of the root cause."}]
)

import OpenAI from "openai";

const client = new OpenAI({
  baseURL: "https://runapi.ai/v1",
  apiKey: "your-runapi-key"
});

const response = await client.chat.completions.create({
  model: "glm-4.6",
  messages: [{ role: "user", content: "Read this multi-file repository, find the failing integration test, and propose a patch with an explanation of the root cause." }]
});

https://runapi.ai /v1/chat/completions

변형 전환

glm-4.5 glm-4.5-air glm-4.7 glm-5 glm-5-turbo glm-5.1

개요

GLM glm-4.6는 같은 RunAPI 인증, model skill 워크플로, 사용량 기반 가격으로 사용할 수 있습니다.

통합 API key
Model skill 설정
모델 ID 참조
실패한 생성은 과금되지 않습니다

요금

요금

실패한 생성은 과금되지 않습니다

Chat completion

Input $0.30 / 1M tokens

Output $1.10 / 1M tokens

Cache read $0.06

Cache write 5m Free

스펙

스펙

모델 ID	glm-4.6
제공사	Z.ai
모달리티	text
작업 유형	synchronous
과금	1K tokens
Endpoint	/v1/chat/completions
상업적 사용	예
상태	운영 중

MODELS

모델 스킬 — glm-4.6

스킬을 한 번 설치한 후, 이 페이지의 변형 ID를 사용해 개발을 시작하세요.

Endpoint	Protocol
/v1/chat/completions	OpenAI compatible

작동 방식

glm-4.6 사용 방법

01

모델 선택

앱 워크플로에 맞는 모델과 변형을 고릅니다.

02

모델 ID 전달

요청 본문에 이 변형 ID를 사용합니다.

03

작업 실행

요청을 보내고 반환된 task ID를 저장합니다.

04

출력 수집

작업이 끝나면 조회하거나 callback으로 결과를 받습니다.

차이

glm-4.6 비교

VS GLM-4.5

200K context; first GLM on Cambricon chips; sharper code generation

355B / 32B active; 128K context; flagship open-weight MoE baseline

VS GLM-4.5-AIR

200K context; first GLM on Cambricon chips; sharper code generation

Lighter GLM-4.5 tier for fast, lower-cost everyday work

VS GLM-4.7

200K context; first GLM on Cambricon chips; sharper code generation

200K context; 73.8% SWE-bench; persistent thinking across turns

사용 사례

이 변형을 쓰는 곳

Chat

LLM을 채팅과 추론에 사용합니다.

코드

구현 작업을 생성하고 리뷰합니다.

자동화

모델을 backend 작업에 연결합니다.

FAQ

glm-4.6 자주 묻는 질문

glm-4.6는 어떻게 선택하나요?

quickstart에 표시된 모델 ID를 전달하세요.

가격은 사용량 기반인가요?

네. 호출 또는 단위 기준으로 측정됩니다.

다른 GLM 변형

glm-4.5-air 최저가

$0.010 / 1K tokens

$0.020 / 1K tokens

$0.020 / 1K tokens

$0.020 / 1K tokens

glm-5-turbo 빠름

$0.020 / 1K tokens

$0.030 / 1K tokens

관련 모델

복잡한 추론, 코드, 분석, 확장 컨텍스트 작업을 위한 Anthropic의 LLM Claude API 액세스.

RunAPI를 통한 DeepSeek API 액세스 — 빠르고 저렴한 작업에는 flash, 복잡한 에이전트 작업에는 pro를 사용합니다.

OpenAI text embeddings for semantic search, retrieval, clustering, and ranking workflows.

지금 시작

오늘 GLM로 시작하세요.

무료 계정 만들기 빠른 시작 읽기 →