Nemotron 3 Ultra 550B A55B by Together AI

Use Kimi K3 in OpenCode Go. Get $5 off your first month.

$5 OFF

Discuss

Model details

Nemotron 3 Ultra 550B A55B

Together AInvidia/nemotron-3-ultra-550b-a55bnemotron

Open provider page Provider docs

Quick Info

Provider: Together AI
Model key: nvidia/nemotron-3-ultra-550b-a55b
Release date: Jun 4, 2026

Cost

Input token cost: $0.60
Output token cost: $3.60

Limits

Output tokens: 512,300 tokens
Context window

Latest news about Nemotron 3 Ultra 550B A55B

Videos about Nemotron 3 Ultra 550B A55B

Recent tweets and retweets from Together AI

Jul 16, 2026, 11:20 PMUTC

Together AI gives developers a fully managed path to run Inkling without operating the multi-node serving infrastructure behind a model of this scale. Try it now: together.ai/models/inkling Link Inkling API | Together AI 975B parameter (41B active) multimodal MoE model…

Jul 16, 2026, 11:20 PMUTC

A 975B open-weight MoE with 41B active parameters, 1M context window, and native text, image, and audio understanding. Now available through Together Serverless Inference. Video

Jul 16, 2026, 11:20 PMUTC

Introducing Inkling from Thinking Machines Lab on Together AI. A multimodal MoE model built for token-efficient reasoning, with native text, image, and audio input support. Supports controllable inference effort, optimized with Together’s FlashAttention-4-based kernel.

Jul 16, 2026, 7:43 PMUTC

60x lower costs by switching from closed to open models. This is the AI economics story.

Jul 16, 2026, 4:47 PMUTC

Consistently impressed by the horsepower of @togethercompute and the people behind it! Thank you for your partnership!