Back to CalculatorDeploy Now
Phi-4 Multimodal
Vision + speech multimodal small model
Specifications
SourceArchitectureVISION
Parameters5.6B
Familyphi
VRAM (Q4)2.8G
multimodalmicrosoftvision
Build your Local Rig
Ready to run locally? Shop top-tier GPUs on Amazon for the best performance.
Instant Cloud GPUs
Running out of VRAM? Rent a high-end H100 or RTX 4090 on RunPod and deploy in seconds.
Share this Model
Send these specs directly to your community.