Text Model
Can I Run MOSS Locally?
First open Chinese conversational LLM
System Configuration
Configure your hardware to check compatibility
VRAM12GB
Bandwidth504 GB/s
TDP285W
System RAM32GB
Typededicated
Compatibility Result
Based on your selected hardware
Runs with Offload
VRAM Usage38.4GB / 12GB
Est. Speed~9.8 T/s
Context (KV)
27.84 GB
Disk Space
9.6 GB
69% of layers will be offloaded to system RAM. This will significantly reduce generation speed.
Similar Models
OLMo 2 13B
13.3BFully open research model
researchai2
SDXS-512
0.1BTiny real-time model (0.1s latency)
artmobile
Stable Cascade
5.1B3-stage compression architecture (C+B)
artresearch