Back to Calculator
Gemma 4 E2B
HotGoogle's most compact Gemma 4 model with effective 2B parameters, multimodal (text/image/audio), built for on-device deployment
Model Specifications
ArchitectureVISION
Parameters2B
Familygemma
VRAM (Q4)1.0GB
Uses Per-Layer Embeddings (PLE) for parameter efficiency. 128K context. Apache 2.0
Share this Model
Send this model's specs directly to your community.
Similar Models
Related Guides
How much VRAM do you really need?
A complete breakdown of quantization levels and VRAM overhead for running local models.
Best GPUs for Machine Learning in 2026
Comparing NVIDIA and AMD options for the best speed-to-dollar ratio.
GGUF vs EXL2 vs AWQ
Understanding local AI formats and which one to pick for your specific hardware.