DeepSeek V4 Flash | Model Details | Geodd AI
Model Library/DeepSeek V4 Flash

DeepSeek V4 Flash

deepseek-ai/DeepSeek-V4-Flash
API Docs

This including two strong Mixture-of-Experts (MoE) language models — DeepSeek-V4-Pro with 1.6T parameters (49B activated) and DeepSeek-V4-Flash with 284B parameters (13B activated) — both supporting a context length of one million tokens. This model is optimized for high-performance inferencing on the Geodd network, providing exceptional speed and reliability for production workloads.

Read more

Features

Serverless API

Pay per token via our optimized endpoints.

View Documentation
Available Serverless
Run queries immediately, pay only for usage
Input$0.14 / M Tokens
Output$0.30 / M Tokens

API Usage

cURL
curl --location '$https://api.geodd.io/gateway/v1/chat/completions' \ --header 'Authorization: Bearer <token>' \ --header 'Content-Type: application/json' \ --data '{
  "model": "deepseek-ai/DeepSeek-V4-Flash",
  "messages": [
    { "role": "user", "content": "Hello, how are you?" }
  ]
}'

Info

Providerdeepseek-ai
Quantizationfp8
Created5/19/2026
Available RegionsUS

Supported Functionality

Context Length1,048,576
Max Output393,216
ServerlessSupported
Input Capabilitiestext
Output Capabilitiestext

Parameters

temperaturetop_ptop_kmin_pfrequency_penaltypresence_penaltymax_tokensseedstop