LocalOps LogoLocalOps
Back to Calculator

GLM-5

Hot

Z.ai's frontier MoE model (745B total, 44B active), trained entirely on Huawei Ascend chips. SOTA open-source SWE-bench performance (77.8%), focuses on agentic engineering and coding. Uses DeepSeek Sparse Attention.

Specifications

Source
ArchitectureTEXT
Parameters745B
Familyglm
VRAM (Q4)372.5G
MoE: 44B active.
flagshipzhipumoetrending

Run in the Cloud

This model requires enterprise-grade VRAM. Rent GPUs on RunPod and start generating.

Deploy on RunPod

Instant Cloud GPUs

Running out of VRAM? Rent a high-end H100 or RTX 4090 on RunPod and deploy in seconds.

Deploy Now

Quantization Estimates

FormatVRAM NeedTier
FP161490.0 GBFull Precision
Q8_0745.0 GBHigh
Q6_K633.3 GBExcellent
Q5_K_M521.5 GBGreat
Q4_K_M372.5 GBSweet Spot
Q2_K223.5 GBEmergency

Share this Model

Send these specs directly to your community.

Post