Gemma 4 31B | Model Details | Geodd AI
Model Library/Gemma 4 31B

Gemma 4 31B

google/gemma-4-31B-it
API Docs

Gemma is a family of open models built by Google DeepMind. Gemma 4 models are multimodal, handling text and image input (with audio supported on small models) and generating text output. This release includes open-weights models in both pre-trained and instruction-tuned variants. Gemma 4 features a context window of up to 256K tokens and maintains multilingual support in over 140 languages. This model is optimized for high-performance inferencing on the Geodd network, providing exceptional speed and reliability for production workloads.

Read more

Features

Serverless API

Pay per token via our optimized endpoints.

View Documentation
Available Serverless
Run queries immediately, pay only for usage
Input$0.13 / M Tokens
Output$0.37 / M Tokens

API Usage

cURL
curl --location '$https://api.geodd.io/gateway/v1/chat/completions' \ --header 'Authorization: Bearer <token>' \ --header 'Content-Type: application/json' \ --data '{
  "model": "google/gemma-4-31B-it",
  "messages": [
    { "role": "user", "content": "Hello, how are you?" }
  ]
}'

Info

Providergoogle
Quantizationfp4
Created5/24/2026
Available RegionsUS

Supported Functionality

Context Length262,144
Max Output262,144
ServerlessSupported
Input Capabilitiestext, image, video
Output Capabilitiestext

Parameters

temperaturetop_ptop_kfrequency_penaltypresence_penaltyrepetition_penaltymax_tokens