Gemma 4 31B | Model Details | Geodd

Gemma 4 31B

google/gemma-4-31B-it

Gemma is a family of open models built by Google DeepMind. Gemma 4 models are multimodal, handling text and image input (with audio supported on small models) and generating text output. This release includes open-weights models in both pre-trained and instruction-tuned variants. Gemma 4 features a context window of up to 256K tokens and maintains multilingual support in over 140 languages. This model is optimized for high-performance inferencing on the Geodd network, providing exceptional speed and reliability for production workloads.

Serverless API

Pay per token via our optimized endpoints.

View Documentation

Available Serverless

Run queries immediately, pay only for usage

Input$0.130 / M Tokens

Output$0.370 / M Tokens

API Usage

cURL

curl --location '$https://api.geodd.io/gateway/v1/chat/completions' \ --header 'Authorization: Bearer <token>' \ --header 'Content-Type: application/json' \ --data '{
  "model": "google/gemma-4-31B-it",
  "messages": [
    { "role": "user", "content": "Hello, how are you?" }
  ]
}'

Info

Providergoogle

Quantizationfp4

Created5/24/2026

Available RegionsUS

Supported Functionality

Context Length262,144

Max Output262,144

ServerlessSupported

Input Capabilitiestext, image, video

Output Capabilitiestext

Parameters

temperaturetop_ptop_kfrequency_penaltypresence_penaltyrepetition_penaltymax_tokens