Add corrections, implementation notes, pricing changes, or usage caveats for other readers.
Last updated
Apr 24, 2026
Knowledge cutoff
2025-05
Input modalities
Output modalities
Capabilities
1,048,576 tokens
Recent tweets and retweets from Deep Infra
deepinfra.com/zai-org/GLM-5.…
Link
zai-org/GLM-5.2 - Demo - DeepInfra
GLM-5.2 is Z-AI's latest flagship model for long-horizon tasks. It marks a substantial leap in long-horizon task capability over its predecessor GLM-5.1 and, for the first time, delivers…
deepinfra.com/zai-org/GLM-5.…
Link
zai-org/GLM-5.2 - Demo - DeepInfra
GLM-5.2 is Z-AI's latest flagship model for long-horizon tasks. It marks a substantial leap in long-horizon task capability over its predecessor GLM-5.1 and, for the first time, delivers…
Your own AI agent, always on, from $13/mo. 📷
Deep Infra Hosted Agents: OpenClaw (web dashboard) or Hermes (SSH/terminal). One-click setup & updates. Pre-wired for fast inference from second one. Auto-backups + restore. Stop it and pay $0 while idle.
Spin one up →…
📉 Price cut on @NVIDIAAI Nemotron 3 Ultra.
$0.50 in / $2.20 out / $0.10 cached per 1M — output down 12%, cached down 33%.
550B/55B MoE for agentic reasoning and deep research. Multimodal, function calling, 256K…
Discuss this model
Add corrections, implementation notes, pricing changes, or usage caveats for other readers.