Llama 4 Scout

meta-llama/llama-4-scout

BackTry Model

Llama 4 Scout

meta-llama/llama-4-scout

BackTry Model

Llama 4 Scout, a 17 billion active parameter model with 16 experts, is the best multimodal model in the world in its class and is more powerful than all previous generation Llama models, while fitting in a single H100 GPU. Additionally, Llama 4 Scout offers an industry-leading context window of 10M and delivers better results than Gemma 3, Gemini 2.0 Flash-Lite, and Mistral 3.1 across a broad range of widely reported benchmarks.

Added Sep 5, 2025

Model weights

Context Window

328.0K

Max Output

65.5K

Input Price (Auto)

$0.085/1M

Output Price (Auto)

$0.46/1M

Cache Read (Auto)

$0.043/1M

Capabilities

Benchmarks

Performance metrics and benchmarks

Artificial Analysis

LMArena

Vectara

Design Arena

Sourced from Artificial Analysis.

Intelligence Index

10.0

Providers

Auto routing is available for this model. Explicit provider selection is not available.

Loading provider options…

Llama 4 Scout

Llama 4 Scout

Benchmarks

Providers

Reasoning

Coding

Math

Knowledge