Back to CalculatorDeploy Now
Llama 4 Scout
HotMeta's first open-weight natively multimodal MoE model with 16 experts and an industry-leading 10M token context window
Specifications
SourceArchitectureVISION
Parameters109B
Familyllama
VRAM (Q4)54.5G
MoE: 17B active.
Fits on a single H100 GPU with int4 quantization
metamoemultimodallong-contexttrending
Build your Local Rig
Ready to run locally? Shop top-tier GPUs on Amazon for the best performance.
Instant Cloud GPUs
Running out of VRAM? Rent a high-end H100 or RTX 4090 on RunPod and deploy in seconds.
Share this Model
Send these specs directly to your community.