LocalOps
Back to Calculator

Qwen3 Embedding 4B

Mid-size Qwen3 embedding model at 4B parameters — strong multilingual retrieval with lower VRAM requirements than the 8B. Apache 2.0, 32K context.

Model Specifications

ArchitectureEMBEDDING
Parameters4B
Familyqwen3-embed
VRAM (Q4)2.0GB
Good balance of quality and resource use for mid-tier hardware. Compatible with all standard inference backends.
#alibaba#qwen#embedding#retrieval#rag#multilingual#apache2Source

Share this Model

Send this model's specs directly to your community.

Post