Add corrections, implementation notes, pricing changes, or usage caveats for other readers.
Knowledge cutoff
2025-01
Input modalities
Output modalities
Capabilities
262,000 tokens
Recent tweets and retweets from Baseten
The new AgentPerf benchmark by @ArtificialAnlys shows that @NVIDIAAI Blackwell delivers best performance for demanding agentic workloads. With NVIDIA, we're continuously investing in making your coding agents run fast, scale seamlessly, and cost…
We're thrilled to be working with the Harvey team to push open models to frontier-level performance for legal AI.
Shout out to @gabepereyra for the great article. LAB was key to our joint work post-training open-weight models for legal agents.
Congrats to the MiniMax team on the open-source launch of M3!
There are very few <500bn parameter models that can tackle coding, agentic workloads, and multimodal all with a 1M-token context window but M3 does it all.
Dig in here: baseten.co/library/minimax-m…
We've heard from customers that they ship model updates >50% more often with rolling deploys than their previous solutions.
No downtime, parallel GPU fleet, or off-hours babysitting. Rolling deploys are autoscaling-aware, and you can pause, inspect, or roll back at any step.
Discuss this model
Add corrections, implementation notes, pricing changes, or usage caveats for other readers.