Highlights:
👉 Native multimodality across text, image, and video
👉 1M context with MiniMax Sparse Attention for long-context workloads
👉 9x prefill and 15x decode speedups vs M2 at 1M context
👉 Thinking mode for complex reasoning, non-thinking mode for…
MiniMax-M3 from @MiniMax_AI is now available on Together AI.
It’s an open-weight native multimodal model with 1M context, MiniMax Sparse Attention, and thinking / non-thinking modes.
Together AI is MiniMax’s preferred cloud partner, with inference optimizations delivering up…
Discuss this model
Add corrections, implementation notes, pricing changes, or usage caveats for other readers.