Add corrections, implementation notes, pricing changes, or usage caveats for other readers.
Knowledge cutoff
2025-08-31
Input modalities
Output modalities
Capabilities
128,000 tokens
Recent tweets and retweets from Azure
In a world where every GPU matters, Managed Compute in Foundry Models gives you global reach with zero infrastructure overhead by routing your workloads to available capacity automatically and globally.
msft.it/6014vjJno
An open, end-to-end trust stack for AI agents on any framework: run ASSERT to find where your agent fails policy, use ACS to place the right controls at the right checkpoints, then re-run ASSERT to confirm.
Evaluation → enforcement, closed loop: msft.it/6011vj6g9
Stop stitching vendors together to run open models in production. Fireworks AI on Microsoft Foundry is GA. One endpoint. Enterprise SLAs. Zero setup.
Build your next production workload with Fireworks AI on Foundry: msft.it/6018vbCUI
Video
Stop building RAG pipelines from scratch. Foundry IQ is the agent-native knowledge layer behind your Foundry agents. Every source in one knowledge base - Work IQ, Fabric IQ, and now Web IQ. Serverless grounding, zero friction.
Available today. msft.it/6012vb7La
Video
Discuss this model
Add corrections, implementation notes, pricing changes, or usage caveats for other readers.