Back to Calculator Deploy on RunPodDeploy Now
Jamba 1.5 Large
Hybrid Transformer-Mamba, 256k context
Specifications
SourceArchitectureTEXT
Parameters398B
Familyjamba
VRAM (Q4)199.0G
MoE: 94B active.
Linear attention scaling for massive docs
hybridai21long-contextmoe
Run in the Cloud
This model requires enterprise-grade VRAM. Rent GPUs on RunPod and start generating.
Instant Cloud GPUs
Running out of VRAM? Rent a high-end H100 or RTX 4090 on RunPod and deploy in seconds.
Quantization Estimates
| Format | VRAM Need | Tier |
|---|---|---|
| FP16 | 796.0 GB | Full Precision |
| Q8_0 | 398.0 GB | High |
| Q6_K | 338.3 GB | Excellent |
| Q5_K_M | 278.6 GB | Great |
| Q4_K_M | 199.0 GB | Sweet Spot |
| Q2_K | 119.4 GB | Emergency |
Share this Model
Send these specs directly to your community.