Back to CalculatorDeploy Now
GLM Z1 32B
HotZhipu AI's efficient reasoning model at 32B parameters with strong math and code performance
Specifications
SourceArchitectureTEXT
Parameters32B
Familyglm
VRAM (Q4)16.0G
zhipureasoningefficienttrending
Build your Local Rig
Ready to run locally? Shop top-tier GPUs on Amazon for the best performance.
Instant Cloud GPUs
Running out of VRAM? Rent a high-end H100 or RTX 4090 on RunPod and deploy in seconds.
Quantization Estimates
| Format | VRAM Need | Tier |
|---|---|---|
| FP16 | 64.0 GB | Full Precision |
| Q8_0 | 32.0 GB | High |
| Q6_K | 27.2 GB | Excellent |
| Q5_K_M | 22.4 GB | Great |
| Q4_K_M | 16.0 GB | Sweet Spot |
| Q2_K | 9.6 GB | Emergency |
Share this Model
Send these specs directly to your community.