Llama 4 Scout 17B by Deep Infra | AI model information

Discuss

Model details

Llama 4 Scout 17B

Deep Inframeta-llama/Llama-4-Scout-17B-16E-Instructllama

Open provider page Provider docs

Quick Info

Provider: Deep Infra
Model key: meta-llama/Llama-4-Scout-17B-16E-Instruct
Release date: Apr 5, 2025

Cost

Input token cost: $0.10
Output token cost: $0.30

Limits

Output tokens: 16,384 tokens
Context window

Latest news about Llama 4 Scout 17B

No articles yet. Fetch the latest news to show it here.

Videos about Llama 4 Scout 17B

Recent tweets and retweets from Deep Infra

Jul 16, 2026, 4:20 PMUTC

1B BF16 deepinfra.com/nvidia/Nemotro… Link nvidia/Nemotron-3-Embed-1B-BF16 - Demo - DeepInfra Nemotron-3-Embed-1B-BF16 is a compact multilingual text embedding model from NVIDIA, pruned and distilled from Ministral-3 to ~1B parameters, that maps text into 2048-dimensional…

Jul 16, 2026, 4:20 PMUTC

1B NVFP4 deepinfra.com/nvidia/Nemotro… Link nvidia/Nemotron-3-Embed-1B-NVFP4 - Demo - DeepInfra Nemotron-3-Embed-1B-NVFP4 is the NVFP4-quantized version of Nemotron-3-Embed-1B-BF16 — a multilingual text embedding model from NVIDIA that maps text into 2048-dimensional dense…

Jul 16, 2026, 4:20 PMUTC

8B Frontier Accuracy deepinfra.com/nvidia/Nemotro… Link nvidia/Nemotron-3-Embed-8B - Demo - DeepInfra Nemotron-3-Embed-8B is a multilingual text embedding model from NVIDIA, based on Ministral-3-8B, that maps text into 4096-dimensional dense vectors for retrieval and…

Jul 16, 2026, 4:20 PMUTC

Retrieval is where AI agents live or die. Miss the right context and the agent reasons from the wrong place. @nvidia Nemotron 3 Embed is now on DeepInfra with Day-0 support, - the leading, embedding model for enterprise search. RAG, and agentic retrieval. Tops the RTEB…

Jul 8, 2026, 8:27 PMUTC

Big milestone for DeepInfra 📷 We've opened our first international #datacenter in Toronto! The new facility will host 1,000+ NVIDIA Blackwell B300 GPUs, expanding our production-scale AI inference capacity while bringing lower-latency infrastructure closer to customers. As…