Text Model

Can I Run Llama 3.1 8B Locally?

Best small model for most tasks

System Configuration

Configure your hardware to check compatibility

Graphics Card

System RAM

VRAM12GB

Bandwidth504 GB/s

TDP285W

System RAM32GB

Typededicated

Based on your selected hardware

Runs with Offload

VRAM Usage20.3GB / 12GB

Est. Speed~32.8 T/s

Context (KV)

14.47 GB

Disk Space

4.8 GB

41% of layers will be offloaded to system RAM. This will significantly reduce generation speed.

2000B

Flagship 2T foundation model, 16 experts

flagshipmeta

400B

High-efficiency MoE, 128 experts, 1M context

chatmeta

109B

Consumer flagship MoE, 16 experts, 10M context

chatmeta