Z.ai: GLM 5.2

GPU TEE

Chat

z-ai/glm-5.2

Created Jun 16, 2026|1M context|$1.40/M input tokens|$4.40/M output tokens|$0.50/M cache read

Intel TDXNVIDIA CC

GLM-5.2 is Z.ai's flagship model for the era of long-horizon tasks. With a truly usable 1M-token context window, it can handle project-level engineering context and execute long-running tasks more reliably. Served as a text-only TEE deployment via Phala.

Providers for Z.ai: GLM 5.2

RedPill routes requests to the best providers that are able to handle your prompt size and parameters, with fallbacks to maximize uptime.

phala

Total Context

Input

$1.40/M

Output

$4.40/M

Cache Read

$0.50/M

API

RedPill provides an OpenAI-compatible completion API to all models & providers that you can call directly, or using the OpenAI SDK. Additionally, some third-party SDKs are available.

fetch("https://api.redpill.ai/v1/chat/completions", {
  method: "POST",
  headers: {
    "Authorization": "Bearer <YOUR-REDPILL-API-KEY>",
    "Content-Type": "application/json"
  },
  body: JSON.stringify({
    "model": "z-ai/glm-5.2",
    "messages": [
      {
        "role": "user",
        "content": "What is the meaning of life?"
      }
    ]
  })
})

Privacy-first AI solutions that keep your data secure and confidential.

Z.ai: GLM 5.2

Providers for Z.ai: GLM 5.2

API

Products

Developers

Resources