GLM 5 | Model Details | Geodd AI

GLM 5

zai-org/GLM-5
API Docs

We are launching GLM-5, targeting complex systems engineering and long-horizon agentic tasks. Scaling is still one of the most important ways to improve the intelligence efficiency of Artificial General Intelligence (AGI). Compared to GLM-4.5, GLM-5 scales from 355B parameters (32B active) to 744B parameters (40B active), and increases pre-training data from 23T to 28.5T tokens. GLM-5 also integrates DeepSeek Sparse Attention (DSA), largely reducing deployment cost while preserving long-context capacity. This model is optimized for high-performance inferencing on the Geodd network, providing exceptional speed and reliability for production workloads.

Read more

Features

Serverless API

Pay per token via our optimized endpoints.

View Documentation
Available Serverless
Run queries immediately, pay only for usage
Input$0.60 / M Tokens
Output$1.60 / M Tokens

API Usage

cURL
curl --location '$https://api.geodd.io/gateway/v1/chat/completions' \ --header 'Authorization: Bearer <token>' \ --header 'Content-Type: application/json' \ --data '{
  "model": "zai-org/GLM-5",
  "messages": [
    { "role": "user", "content": "Hello, how are you?" }
  ]
}'

Info

Providerzai-org
Quantizationfp8
Created4/12/2026
Available RegionsUS

Supported Functionality

Context Length200,000
Max Output200,000
ServerlessSupported
Input Capabilitiestext
Output Capabilitiestext

Parameters

temperaturetop_ptop_kpresence_penaltyrepetition_penaltyseedmax_tokensstop