GLM 5 | Model Details | Geodd

GLM 5

zai-org/GLM-5

We are launching GLM-5, targeting complex systems engineering and long-horizon agentic tasks. Scaling is still one of the most important ways to improve the intelligence efficiency of Artificial General Intelligence (AGI). Compared to GLM-4.5, GLM-5 scales from 355B parameters (32B active) to 744B parameters (40B active), and increases pre-training data from 23T to 28.5T tokens. GLM-5 also integrates DeepSeek Sparse Attention (DSA), largely reducing deployment cost while preserving long-context capacity. This model is optimized for high-performance inferencing on the Geodd network, providing exceptional speed and reliability for production workloads.

Serverless API

Pay per token via our optimized endpoints.

View Documentation

Available Serverless

Run queries immediately, pay only for usage

Input$0.600 / M Tokens

Output$1.600 / M Tokens

API Usage

cURL

curl --location '$https://api.geodd.io/gateway/v1/chat/completions' \ --header 'Authorization: Bearer <token>' \ --header 'Content-Type: application/json' \ --data '{
  "model": "zai-org/GLM-5",
  "messages": [
    { "role": "user", "content": "Hello, how are you?" }
  ]
}'

Info

Providerzai-org

Quantizationfp8

Created4/12/2026

Available RegionsUS

Supported Functionality

Context Length200,000

Max Output200,000

ServerlessSupported

Input Capabilitiestext

Output Capabilitiestext

Parameters

temperaturetop_ptop_kpresence_penaltyrepetition_penaltyseedmax_tokensstop