DeepSeek V4 Flash | Model Details | Geodd

DeepSeek V4 Flash

deepseek-ai/DeepSeek-V4-Flash

DeepSeek-V4-Flash with 284B parameters (13B activated) — both supporting a context length of one million tokens. This model is optimized for high-performance inferencing on the Geodd network, providing exceptional speed and reliability for production workloads.

Serverless API

Pay per token via our optimized endpoints.

View Documentation

Available Serverless

Run queries immediately, pay only for usage

Input$0.140 / M Tokens

Output$0.300 / M Tokens

API Usage

cURL

curl --location '$https://api.geodd.io/gateway/v1/chat/completions' \ --header 'Authorization: Bearer <token>' \ --header 'Content-Type: application/json' \ --data '{
  "model": "deepseek-ai/DeepSeek-V4-Flash",
  "messages": [
    { "role": "user", "content": "Hello, how are you?" }
  ]
}'

Info

Providerdeepseek-ai

Quantizationfp8

Created5/19/2026

Available RegionsUS

Supported Functionality

Context Length1,048,576

Max Output393,216

ServerlessSupported

Input Capabilitiestext

Output Capabilitiestext

Parameters

temperaturetop_ptop_kmin_pfrequency_penaltypresence_penaltymax_tokensseedstop