Back to CalculatorView API Docs Deploy on RunPodDeploy Now
Llama 4 Behemoth
Flagship 2T foundation model, 16 experts
Specifications
SourceArchitectureTEXT
Parameters2000B
Familyllama4
VRAM (Q4)API
MoE: 288B active.
State-of-the-art dense reasoning, enterprise only
flagshipmetaproprietary
Run in the Cloud
This model requires enterprise-grade VRAM. Rent GPUs on RunPod and start generating.
Instant Cloud GPUs
Running out of VRAM? Rent a high-end H100 or RTX 4090 on RunPod and deploy in seconds.
Share this Model
Send these specs directly to your community.