Add corrections, implementation notes, pricing changes, or usage caveats for other readers.
Input modalities
Output modalities
Capabilities
262,000 tokens
Recent tweets and retweets from Fireworks AI
Fireworks was named to @Redpoint's InfraRed 100 which recognizes the companies building the foundation for the next wave of AI.
We're just getting started.
Come build with us: fireworks.ai/careers
We spent the week at #MSBuild talking about one thing: fine-tuning has gone from "maybe not worth it" to your actual competitive moat.
@lqiao sat down with @yina_arenas to break down why and what Fireworks + @Microsoft Foundry makes possible when you stop treating models as…
NVIDIA Nemotron 3 Ultra is on Fireworks, day zero.
Nemotron Ultra is an open model for frontier reasoning and orchestration in long-running autonomous agents.
Think use cases like coding agents, deep research, and complex enterprise workflows.
Read on:…
Many research labs only consider inference efficiency after the fact. Step 3.7 Flash is a 198B sparse MoE VLM designed by @StepFun_ai for inference from the start. 196B language backbone with a 1.8B vision encoder.
Built for real-world agent workloads, running at up to 400…
Discuss this model
Add corrections, implementation notes, pricing changes, or usage caveats for other readers.