Back to Calculator
Voxtral Mini 4B Realtime
HotMistral's multilingual real-time speech transcription model with configurable latency (240ms-2.4s), supports 13 languages
Model Specifications
ArchitectureAUDIO
Parameters4B
Familyvoxtral
VRAM (Q4)4GB
4B ASR model, <500ms delay matching offline transcription accuracy. Apache 2.0 license
Share this Model
Send this model's specs directly to your community.
Similar Models
Related Guides
How much VRAM do you really need?
A complete breakdown of quantization levels and VRAM overhead for running local models.
Best GPUs for Machine Learning in 2026
Comparing NVIDIA and AMD options for the best speed-to-dollar ratio.
GGUF vs EXL2 vs AWQ
Understanding local AI formats and which one to pick for your specific hardware.