DeepSeek V4 Flash by Deep Infra | AI model information

The fastest GLM 5.2, Kimi K2.6 and MiniMax M3The fastest GLM 5.2, Kimi K2.6 and MiniMax M3 Discuss

Model details

DeepSeek V4 Flash

Deep Infradeepseek-ai/DeepSeek-V4-Flashdeepseek-flash

Open provider page Provider docs

Quick Info

Provider: Deep Infra
Model key: deepseek-ai/DeepSeek-V4-Flash
Release date: Apr 24, 2026

Cost

Input token cost: $0.10
Output token cost: $0.20

Limits

Output tokens: 16,384 tokens
Context window

Latest news about DeepSeek V4 Flash

No articles yet. Fetch the latest news to show it here.

Videos about DeepSeek V4 Flash

Recent tweets and retweets from Deep Infra

Jun 22, 2026, 7:55 PMUTC

deepinfra.com/zai-org/GLM-5.… Link zai-org/GLM-5.2 - Demo - DeepInfra GLM-5.2 is Z-AI's latest flagship model for long-horizon tasks. It marks a substantial leap in long-horizon task capability over its predecessor GLM-5.1 and, for the first time, delivers…

Jun 18, 2026, 9:55 PMUTC

GLM 5.2 prices just dropped 🔥 $1.40 → $1.20 input $4.40 → $4.20 output $0.25 → $0.20 cached cached tokens down 20% - huge for long-context workloads. same model, less spend. 👇

Jun 18, 2026, 5:58 PMUTC

Your own AI agent, always on, from $13/mo. 📷 Deep Infra Hosted Agents: OpenClaw (web dashboard) or Hermes (SSH/terminal). One-click setup & updates. Pre-wired for fast inference from second one. Auto-backups + restore. Stop it and pay $0 while idle. Spin one up →…

Jun 16, 2026, 11:26 PMUTC

📉 Price cut on @NVIDIAAI Nemotron 3 Ultra. $0.50 in / $2.20 out / $0.10 cached per 1M — output down 12%, cached down 33%. 550B/55B MoE for agentic reasoning and deep research. Multimodal, function calling, 256K…

Recent tweets and retweets from Deep Infra

More models around DeepSeek V4 Flash

Browse family

Also served by

This exact model name is also listed by 26 other providers.

1 provider

DeepSeek V4 Flash 6bit

Currently listed through:

routing.run

Discuss this model

Powered byHyvor Talk

Add corrections, implementation notes, pricing changes, or usage caveats for other readers.