LocalOps LogoLocalOps
Back to Calculator

Phi-4 Multimodal

Vision + speech multimodal small model

Specifications

Source
ArchitectureVISION
Parameters5.6B
Familyphi
VRAM (Q4)2.8G
multimodalmicrosoftvision

Build your Local Rig

Ready to run locally? Shop top-tier GPUs on Amazon for the best performance.

Instant Cloud GPUs

Running out of VRAM? Rent a high-end H100 or RTX 4090 on RunPod and deploy in seconds.

Deploy Now

Share this Model

Send these specs directly to your community.

Post