Gemma 4 26B A4B IT by Deep Infra | AI model information

The fastest GLM 5.2, Kimi K2.6 and MiniMax M3The fastest GLM 5.2, Kimi K2.6 and MiniMax M3 Discuss

Model details

Gemma 4 26B A4B IT

Deep Infragoogle/gemma-4-26B-A4B-itgemma

Open provider page Provider docs

Quick Info

Provider: Deep Infra
Model key: google/gemma-4-26B-A4B-it
Release date: Apr 2, 2026

Cost

Input token cost: $0.07
Output token cost: $0.34

Limits

Output tokens: 32,768 tokens
Context window

Latest news about Gemma 4 26B A4B IT

No articles yet. Fetch the latest news to show it here.

Videos about Gemma 4 26B A4B IT

Recent tweets and retweets from Deep Infra

Jun 22, 2026, 7:55 PMUTC

deepinfra.com/zai-org/GLM-5.… Link zai-org/GLM-5.2 - Demo - DeepInfra GLM-5.2 is Z-AI's latest flagship model for long-horizon tasks. It marks a substantial leap in long-horizon task capability over its predecessor GLM-5.1 and, for the first time, delivers…

Jun 18, 2026, 9:55 PMUTC

GLM 5.2 prices just dropped 🔥 $1.40 → $1.20 input $4.40 → $4.20 output $0.25 → $0.20 cached cached tokens down 20% - huge for long-context workloads. same model, less spend. 👇

Jun 18, 2026, 5:58 PMUTC

Your own AI agent, always on, from $13/mo. 📷 Deep Infra Hosted Agents: OpenClaw (web dashboard) or Hermes (SSH/terminal). One-click setup & updates. Pre-wired for fast inference from second one. Auto-backups + restore. Stop it and pay $0 while idle. Spin one up →…

Jun 16, 2026, 11:26 PMUTC

📉 Price cut on @NVIDIAAI Nemotron 3 Ultra. $0.50 in / $2.20 out / $0.10 cached per 1M — output down 12%, cached down 33%. 550B/55B MoE for agentic reasoning and deep research. Multimodal, function calling, 256K…