LocalOps LogoLocalOps
Back to Calculator

Qwen3 Reranker 8B

Alibaba's top cross-encoder reranker at 8B parameters — state-of-the-art on multilingual text retrieval benchmarks. Instruction-aware for task-specific ranking. Apache 2.0.

Specifications

Source
ArchitectureEMBEDDING
Parameters8B
Familyqwen3-embed
VRAM (Q4)4.0G
Pair with Qwen3-Embedding-8B for a full dense retrieval + reranking pipeline. Use TEI or vLLM for serving.
alibabaqwenrerankerretrievalragmultilingualapache2

Build your Local Rig

Ready to run locally? Shop top-tier GPUs on Amazon for the best performance.

Instant Cloud GPUs

Running out of VRAM? Rent a high-end H100 or RTX 4090 on RunPod and deploy in seconds.

Deploy Now

Share this Model

Send these specs directly to your community.

Post