LocalOps LogoLocalOps
Back to Calculator

Llama 4 Scout

Hot

Meta's first open-weight natively multimodal MoE model with 16 experts and an industry-leading 10M token context window

Specifications

Source
ArchitectureVISION
Parameters109B
Familyllama
VRAM (Q4)54.5G
MoE: 17B active.
Fits on a single H100 GPU with int4 quantization
metamoemultimodallong-contexttrending

Build your Local Rig

Ready to run locally? Shop top-tier GPUs on Amazon for the best performance.

Instant Cloud GPUs

Running out of VRAM? Rent a high-end H100 or RTX 4090 on RunPod and deploy in seconds.

Deploy Now

Share this Model

Send these specs directly to your community.

Post