LocalOps LogoLocalOps
Back to Calculator

Magistral Small

Hot

Mistral AI's first open-weight reasoning model — 24B parameters under Apache 2.0. Chain-of-thought reasoning in 20+ languages, 70.7% on AIME2024. Fits on a single RTX 4090 or 32GB MacBook.

Specifications

Source
ArchitectureTEXT
Parameters24B
Familymagistral
VRAM (Q4)12.0G
128K context window but performance may degrade past 40K tokens. Excellent for domain-specific reasoning tasks.
mistralreasoningapache2multilingualmathtrending

Build your Local Rig

Ready to run locally? Shop top-tier GPUs on Amazon for the best performance.

Instant Cloud GPUs

Running out of VRAM? Rent a high-end H100 or RTX 4090 on RunPod and deploy in seconds.

Deploy Now

Quantization Estimates

FormatVRAM NeedTier
FP1648.0 GBFull Precision
Q8_024.0 GBHigh
Q6_K20.4 GBExcellent
Q5_K_M16.8 GBGreat
Q4_K_M12.0 GBSweet Spot
Q2_K7.2 GBEmergency

Share this Model

Send these specs directly to your community.

Post