LocalOps LogoLocalOps
Back to Calculator

Llama 4 Scout

Consumer flagship MoE, 16 experts, 10M context

Specifications

Source
ArchitectureTEXT
Parameters109B
Familyllama4
VRAM (Q4)54.5G
MoE: 17B active.
Runs on single H100, massive context window
chatmetamultimodalpopular

Build your Local Rig

Ready to run locally? Shop top-tier GPUs on Amazon for the best performance.

Instant Cloud GPUs

Running out of VRAM? Rent a high-end H100 or RTX 4090 on RunPod and deploy in seconds.

Deploy Now

Quantization Estimates

FormatVRAM NeedTier
FP16218.0 GBFull Precision
Q8_0109.0 GBHigh
Q6_K92.6 GBExcellent
Q5_K_M76.3 GBGreat
Q4_K_M54.5 GBSweet Spot
Q2_K32.7 GBEmergency

Share this Model

Send these specs directly to your community.

Post