LocalOps LogoLocalOps
Back to Calculator

Llama 3 8B Instruct

Hot

Highly capable open model for chat and instruction following

Specifications

Source
ArchitectureTEXT
Parameters8.03B
Familyllama
VRAM (Q4)4.0G
chatmetapopulartrending

Build your Local Rig

Ready to run locally? Shop top-tier GPUs on Amazon for the best performance.

Instant Cloud GPUs

Running out of VRAM? Rent a high-end H100 or RTX 4090 on RunPod and deploy in seconds.

Deploy Now

Quantization Estimates

FormatVRAM NeedTier
FP1616.1 GBFull Precision
Q8_08.0 GBHigh
Q6_K6.8 GBExcellent
Q5_K_M5.6 GBGreat
Q4_K_M4.0 GBSweet Spot
Q2_K2.4 GBEmergency

Share this Model

Send these specs directly to your community.

Post