LocalOps
Back to Calculator

GLM-5.1

Z.ai next-gen flagship for agentic engineering. 744B MoE, 40B active. MIT licensed. #1 open-weight model on SWE-Bench Pro as of April 2026. Trained on Huawei Ascend chips.

Model Specifications

ArchitectureTEXT
Parameters744B
Familyglm
VRAM (Q4)372.0GB
Mixture of ExpertsActive inference parameters: 40B.
#coding#agents#reasoningSource

Estimated Quantization Sizes

FormatPrecisionEst. VRAMRecommendation
FP16 / BF1616-bit1488.0 GBUncompressed Base
Q8_0High8-bit744.0 GBNear Lossless
Q6_K6-bit558.0 GBExcellent Balance
Q4_K_MPopular4-bit372.0 GBStandard Use

Share this Model

Send this model's specs directly to your community.

Post