Llama 4 Scout

Llama 4 Scout

Llama 4 Scout is Meta's compact yet powerful multimodal model with 17B active parameters and 16 experts (109B total parameters). It fits on a single H100 GPU with Int4 quantization and offers an industry-leading 10M token context window, outperforming Gemma 3, Gemini 2.0 Flash-Lite, and Mistral 3.1 across various benchmarks.

ConversationReasoningCode GenerationAnalysis
Provider
Meta
Release Date
April 5, 2025
Size
MEDIUM
Parameters
17B active (109B total)

Benchmark Performance

Performance metrics on industry standard AI benchmarks that measure capabilities across reasoning, knowledge, and specialized tasks.

Context Length

10M tokens
Source:meta.com

Model Insights

All Model Responses