GLM-5.2 by Together AI | AI model information

The fastest GLM 5.2, Kimi K2.6 and MiniMax M3The fastest GLM 5.2, Kimi K2.6 and MiniMax M3 Discuss

Model details

GLM-5.2

Together AIzai-org/GLM-5.2glm

Open provider page Provider docs

Quick Info

Provider: Together AI
Model key: zai-org/GLM-5.2
Release date: Jun 16, 2026

Cost

Input token cost: $1.40
Output token cost: $4.40

Limits

Output tokens: 164,000 tokens
Context window

Latest news about GLM-5.2

Videos about GLM-5.2

Recent tweets and retweets from Together AI

Jun 24, 2026, 2:41 AMUTC

Read the full story: theinformation.com/newslette… Link Open Source Growth Boosts Together AI, Hugging Face Anthropic and OpenAI are booming, but so are providers of open-source AI models and other cheaper alternatives, thanks to businesses using open-source to control…

Jun 24, 2026, 2:41 AMUTC

400T tokens is what production adoption looks like. Teams are moving real workloads to open models because they want frontier quality, better tokenomics, and more control over inference. Together AI gives them the infrastructure to make that shift.

Jun 23, 2026, 8:43 PMUTC

Single-shot generation still surfaces net-new kernels with no public reference: NeMo vocab-parallel log-probs, Hyena context parallelism, SAM 3 mask suppression. One GEMM + All-Gather kernel hit 87.9µs vs 320.6µs for NCCL. PKB is open. Read more and contribute below. Blog:…

Jun 23, 2026, 8:17 PMUTC

An agentic loop (compile, test, profile, revise) helps. Gemini 3 Pro went from 24 to 35/87 correct, then plateaued after ~20 steps. Feedback fixes syntax, not rank coordination, collective ordering, or transfer-mechanism choice. TMA and NVLS stay almost unused.

Jun 23, 2026, 8:17 PMUTC

Frontier models struggle. → Best zero-shot: 28/87 correct, 22 beat the PyTorch + NCCL baseline → With 3 attempts: 36/87 correct, but fast1@3 tops out at 31% Weak models fail to compile. Strong reasoners compile cleanly and return wrong answers.