RedPill
z-ai logo

Z.ai: GLM 5.2

GPU TEE
Chat
z-ai/glm-5.2
Created Jun 16, 2026|1M context|$1.40/M input tokens|$4.40/M output tokens|$0.50/M cache read
Intel TDXNVIDIA CC

GLM-5.2 is Z.ai's flagship model for the era of long-horizon tasks. With a truly usable 1M-token context window, it can handle project-level engineering context and execute long-running tasks more reliably. Served as a text-only TEE deployment via Phala.

Providers for Z.ai: GLM 5.2

RedPill routes requests to the best providers that are able to handle your prompt size and parameters, with fallbacks to maximize uptime.

phala logo
phala
Total Context
1M
Input
$1.40/M
Output
$4.40/M
Cache Read
$0.50/M

API

RedPill provides an OpenAI-compatible completion API to all models & providers that you can call directly, or using the OpenAI SDK. Additionally, some third-party SDKs are available.

fetch("https://api.redpill.ai/v1/chat/completions", {
  method: "POST",
  headers: {
    "Authorization": "Bearer <YOUR-REDPILL-API-KEY>",
    "Content-Type": "application/json"
  },
  body: JSON.stringify({
    "model": "z-ai/glm-5.2",
    "messages": [
      {
        "role": "user",
        "content": "What is the meaning of life?"
      }
    ]
  })
})