LocalOps LogoLocalOps
Back to Calculator

Qwen3.5-Omni Plus

Hot

Alibaba's flagship omni-modal model — processes text, images, audio, and video natively. Thinker-Talker MoE architecture with real-time streaming speech output, 256K context, 113 speech recognition languages.

Specifications

Source
ArchitectureAUDIO
Parameters30B
Familyqwen3.5
VRAM (Q4)15.0G
MoE: 3B active.
Plus (30B-A3B) and Flash variants are API-only via DashScope as of March 31 2026. Weights not yet confirmed publicly.
alibabaqwenmultimodalaudiovideomoerealtimeomnitrending

Build your Local Rig

Ready to run locally? Shop top-tier GPUs on Amazon for the best performance.

Instant Cloud GPUs

Running out of VRAM? Rent a high-end H100 or RTX 4090 on RunPod and deploy in seconds.

Deploy Now

Share this Model

Send these specs directly to your community.

Post