LocalOps LogoLocalOps
Back to Calculator

Voxtral Mini 4B Realtime

Hot

Mistral's multilingual real-time speech transcription model with configurable latency (240ms-2.4s), supports 13 languages

Specifications

Source
ArchitectureAUDIO
Parameters4B
Familyvoxtral
VRAM (Q4)4G
4B ASR model, <500ms delay matching offline transcription accuracy. Apache 2.0 license
mistralsttspeech-to-textrealtimetrending

Build your Local Rig

Ready to run locally? Shop top-tier GPUs on Amazon for the best performance.

Instant Cloud GPUs

Running out of VRAM? Rent a high-end H100 or RTX 4090 on RunPod and deploy in seconds.

Deploy Now

Share this Model

Send these specs directly to your community.

Post