LocalOps

System Configuration

Configure your hardware to check model compatibility

VRAM12GB
Bandwidth504 GB/s
TDP285W
System RAM32GB
Typededicated
Showing 492 of 492 models227 compatible •163 with offload •102 incompatible

Isaac GR00T N1

Roboticsgr00t

NVIDIA humanoid foundation model

1GB / 12GB0.0GB disk
Ready to Run
View

HumanPlus

Roboticshumanplus

Shadows human motion to robot

1GB / 12GB0.0GB disk
Ready to Run
View

SwinIR

Imageswinir

Transformer image restoration

4GB / 12GB4GB disk
Ready to Run
View

Real-ESRGAN

Imageesrgan

Image super-resolution

4GB / 12GB4GB disk
Ready to Run
View

Piper TTS

Audiopiper

Ultra-fast local TTS

1GB / 12GB1GB disk
Ready to Run
View

YOLOv10

Visionyolo

Latest YOLO model

6GB / 12GB6GB disk
Ready to Run
View

BGE Small EN

Embeddingbge

Fast English embeddings

1GB / 12GB1GB disk
Ready to Run
View

GFP-GAN

Imagegfpgan

Face restoration

6GB / 12GB6GB disk
Ready to Run
View

Whisper Tiny

Audiowhisper

Minimal ASR model

1GB / 12GB1GB disk
Ready to Run
View

YOLOv8 L

Visionyolo

Large YOLO variant

6GB / 12GB6GB disk
Ready to Run
View

YOLOv11 X

Imageyolo

Latest YOLO object detection flagship

4GB / 12GB4GB disk
Ready to Run
View

CodeFormer

Imagecodeformer

Robust face restoration

6GB / 12GB6GB disk
Ready to Run
View

YOLOv8 X

Visionyolo

Object detection flagship

8GB / 12GB8GB disk
Ready to Run
View

BRIA RMBG 2.0

Imagebria

Next-gen background removal, video support

4GB / 12GB4GB disk
Ready to Run
View

Kokoro 82M

Audiokokoro

Highest quality lightweight TTS

2GB / 12GB2GB disk
Ready to Run
View

Wav2Vec2 Base

Audiowav2vec

Compact ASR

2GB / 12GB2GB disk
Ready to Run
View

SDXS-512

Imagesd

Tiny real-time model (0.1s latency)

2GB / 12GB2GB disk
Ready to Run
View

Octo Base

Roboticsocto

Open source robot manipulation policy

1.1GB / 12GB0.1GB disk
Ready to Run
View

MaskGCT

Audiomaskgct

Zero-shot TTS/Voice conversion

1.1GB / 12GB0.1GB disk
Ready to Run
View

VITS

Audiovits

Fast variational inference TTS

2GB / 12GB2GB disk
Ready to Run
View

PaddleOCR V4

Visionpaddleocr

Multilingual OCR

2GB / 12GB2GB disk
Ready to Run
View

DWPose

Visiondwpose

Human pose estimation

4GB / 12GB4GB disk
Ready to Run
View

IP-Adapter Plus

Imageip-adapter

Image prompt adapter

6GB / 12GB6GB disk
Ready to Run
View

IP-Adapter FaceID

Imageip-adapter

Face ID preservation

6GB / 12GB6GB disk
Ready to Run
View

BRIA RMBG

Imagebria

Commercial background removal

4GB / 12GB4GB disk
Ready to Run
View

E5 Base V2

Embeddinge5

Balanced embeddings

2GB / 12GB2GB disk
Ready to Run
View

LayoutLMv3

Visionlayoutlm

Document understanding

4GB / 12GB4GB disk
Ready to Run
View

SmolLM2 135M

Textsmollm

Smallest practical language model

1.8GB / 12GB0.1GB disk~200.0 T/s
Ready to Run
View

Nomic Embed Text v1.5

Embeddingnomic

Long context embeddings (8192)

2GB / 12GB2GB disk
Ready to Run
View

SpeechT5 TTS

Audiospeecht5

Unified speech/text model

2GB / 12GB2GB disk
Ready to Run
View

ModernBERT Base

Embeddingbert

Fast and accurate

1GB / 12GB1GB disk
Ready to Run
View

StyleTTS 2

Audiostyletts

Fast expressive TTS

4GB / 12GB4GB disk
Ready to Run
View

RemBG

Imagerembg

Background removal

4GB / 12GB4GB disk
Ready to Run
View

Moonshine Base

Audiomoonshine

Fast ASR optimized for resource constrained

2GB / 12GB2GB disk
Ready to Run
View

LGM

3Dlgm

High-res Gaussian splatting

10GB / 12GB10GB disk
Ready to Run
View

Allegro

Videoallegro

Rhymes AI open video model

1.1GB / 12GB0.1GB disk
Ready to Run
View

Donut Base

Visiondonut

OCR-free document understanding

4GB / 12GB4GB disk
Ready to Run
View

EasyOCR

Visioneasyocr

Easy multilingual OCR

4GB / 12GB4GB disk
Ready to Run
View

OpenPose

Visionopenpose

Multi-person pose detection

4GB / 12GB4GB disk
Ready to Run
View

BiRefNet

Imagebirefnet

Bilateral reference high-resolution segmentation

4GB / 12GB4GB disk
Ready to Run
View

SAM 2

Visionsam

Video segmentation

8GB / 12GB8GB disk
Ready to Run
View

SAM 2.1 Large

Imagesam

Segment Anything in images and video, improved accuracy

8GB / 12GB8GB disk
Ready to Run
View

Whisper Small

Audiowhisper

Fast ASR

2GB / 12GB2GB disk
Ready to Run
View

Marvis TTS 250M

Audiomarvis

Real-time streaming voice cloning

4GB / 12GB4GB disk
Ready to Run
View

Gemma 3 270M

Textgemma3

Ultra-compact on-device model

2.2GB / 12GB0.2GB disk~200.0 T/s
Ready to Run
View

F5-TTS

Audiof5

Zero-shot voice cloning

6GB / 12GB6GB disk
Ready to Run
View

CosyVoice

Audiocosyvoice

Alibaba's multilingual TTS

6GB / 12GB6GB disk
Ready to Run
View

MeloTTS

Audiomelo

Multilingual fast TTS

4GB / 12GB4GB disk
Ready to Run
View

MusicGen Small

Audiomusicgen

Fast music generation

6GB / 12GB6GB disk
Ready to Run
View

Wav2Vec2 Large

Audiowav2vec

Self-supervised ASR

4GB / 12GB4GB disk
Ready to Run
View

mxbai-embed-large

Embeddingmxbai

Mixed bread AI embeddings

2GB / 12GB2GB disk
Ready to Run
View

BGE Large EN

Embeddingbge

English embeddings

3GB / 12GB3GB disk
Ready to Run
View

GTE Large

Embeddinggte

General text embeddings

3GB / 12GB3GB disk
Ready to Run
View

E5 Large V2

Embeddinge5

Contrastive embeddings

3GB / 12GB3GB disk
Ready to Run
View

Depth Anything V2 Large

Visiondepth-anything

Monocular depth estimation

6GB / 12GB6GB disk
Ready to Run
View

Depth Anything V2 Large

Imagedepth-anything

High-quality monocular depth estimation

4GB / 12GB4GB disk
Ready to Run
View

ZoeDepth

Visionzoedepth

Metric depth estimation

6GB / 12GB6GB disk
Ready to Run
View

SmolLM2 360M

Textsmollm

Nanoscale efficient model

2.4GB / 12GB0.2GB disk~200.0 T/s
Ready to Run
View

ModernBERT Large

Embeddingbert

8k context, modern architecture

3GB / 12GB3GB disk
Ready to Run
View

ModernBERT Embed Large

Textmodernbert

Modernized BERT for efficient retrieval, 8K context

2.5GB / 12GB0.2GB disk~200.0 T/s
Ready to Run
View

AnimateDiff

Videoanimatediff

Turn any SD model into video

10GB / 12GB10GB disk
Ready to Run
View

Stella v5 400M

Embeddingstella

SOTA commercial-friendly embedding

1.2GB / 12GB0.2GB disk
Ready to Run
View

Orpheus 400M

Audioorpheus

Efficient TTS

4GB / 12GB4GB disk
Ready to Run
View

SigLIP SO400M

Visionsiglip

Google improved CLIP

4GB / 12GB4GB disk
Ready to Run
View

Stella EN 400M

Embeddingstella

Efficient embeddings

3GB / 12GB3GB disk
Ready to Run
View

Chatterbox Turbo

Audiochatterbox

Low-latency high-performance TTS

4GB / 12GB4GB disk
Ready to Run
View

CLIP ViT-L/14

Visionclip

Vision-language alignment

4GB / 12GB4GB disk
Ready to Run
View

Qwen 2.5 0.5B

Textqwen

Smallest Qwen variant

2.6GB / 12GB0.3GB disk~200.0 T/s
Ready to Run
View

CosyVoice 2 (0.5B)

Audiocosyvoice

Streaming speech synthesis foundation

4GB / 12GB4GB disk
Ready to Run
View

CosyVoice 2 Instruct

Audiocosyvoice

Fine-grained emotional control

4GB / 12GB4GB disk
Ready to Run
View

XTTS v2

Audioxtts

High quality voice cloning

6GB / 12GB6GB disk
Ready to Run
View

InstantMesh

3Dinstantmesh

Fast image to 3D mesh

12GB / 12GB12GB disk
Ready to Run
View

TripoSR

3Dtriposr

Single image to 3D

8GB / 12GB8GB disk
Ready to Run
View

MeshAnything V2

3Dmesh-anything

Artist-created mesh alignment

1.3GB / 12GB0.3GB disk
Ready to Run
View

Chatterbox

Audiochatterbox

Natural voice cloning TTS

6GB / 12GB6GB disk
Ready to Run
View

SmolVLM 500M

Textsmolvlm

Ultra-compact vision-language model

2.7GB / 12GB0.3GB disk~200.0 T/s
Ready to Run
View

Spark TTS 0.5B

Audiospark-tts

Controllable TTS with voice cloning, emotion and speed control

4GB / 12GB4GB disk
Ready to Run
View

StableFast3D V2

3Dsf3d

Rapid single-image 3D mesh generation

6GB / 12GB6GB disk
Ready to Run
View

PuLID FLUX

Imagepulid

Pure identity face insertion for FLUX

12GB / 12GB12GB disk
Ready to Run
View

BGE Reranker Large

Embeddingbge

Reranking model

4GB / 12GB4GB disk
Ready to Run
View

Multilingual E5 Large

Embeddinge5

100+ language embeddings

4GB / 12GB4GB disk
Ready to Run
View

TrOCR Large

Visiontrocr

Transformer OCR

6GB / 12GB6GB disk
Ready to Run
View

BGE-M3

Embeddingbge

Multi-lingual multi-granularity

4GB / 12GB4GB disk
Ready to Run
View

Arctic Embed L v2

Textarctic

Multilingual embedding model, 8K context

2.8GB / 12GB0.3GB disk~200.0 T/s
Ready to Run
View

Jina Embeddings v3

Embeddingjina

Task-specific embeddings

4GB / 12GB4GB disk
Ready to Run
View

Qwen 3 0.6B

Textqwen

Micro model for embedded systems

2.9GB / 12GB0.4GB disk~200.0 T/s
Ready to Run
View

Qwen3-TTS Base (0.6B)

Audioqwen-tts

Ultra-low latency streaming TTS (<97ms)

2GB / 12GB2GB disk
Ready to Run
View

PixArt-Σ

Imagepixart

Efficient DiT architecture

8GB / 12GB8GB disk
Ready to Run
View

PixArt-α

Imagepixart

Original PixArt model

8GB / 12GB8GB disk
Ready to Run
View

Parakeet v2 0.6B

Audioparakeet

Ultra-fast ASR, 60min in 1sec, word timestamps

4GB / 12GB4GB disk
Ready to Run
View

SAM

Visionsam

Segment Anything Model

8GB / 12GB8GB disk
Ready to Run
View

ChatTTS

Audiochattts

Conversational TTS with laughter/pauses

4GB / 12GB4GB disk
Ready to Run
View

Fish Speech 1.4

Audiofish

Highly expressive TTS

1.4GB / 12GB0.4GB disk
Ready to Run
View

ControlNet Canny

Imagecontrolnet

Edge-guided generation

8GB / 12GB8GB disk
Ready to Run
View

ControlNet Depth

Imagecontrolnet

Depth-guided generation

8GB / 12GB8GB disk
Ready to Run
View

ControlNet OpenPose

Imagecontrolnet

Pose-guided generation

8GB / 12GB8GB disk
Ready to Run
View

Segmind Vega

Imagesegmind

Distilled SDXL - 70% faster

6GB / 12GB6GB disk
Ready to Run
View

Distil-Whisper Large

Audiowhisper

Distilled for speed

4GB / 12GB4GB disk
Ready to Run
View

Florence 2 Large

Visionflorence

Microsoft vision foundation

6GB / 12GB6GB disk
Ready to Run
View

Whisper Medium

Audiowhisper

Balanced ASR model

5GB / 12GB5GB disk
Ready to Run
View

Whisper Large v3 Turbo

Audiowhisper

Fast high-quality ASR

6GB / 12GB6GB disk
Ready to Run
View

Tortoise TTS

Audiotortoise

High quality but slower TTS

6GB / 12GB6GB disk
Ready to Run
View

VALL-E

Audiovalle

Neural codec language model TTS

8GB / 12GB8GB disk
Ready to Run
View

IC-Light V2

Imageic-light

Relighting images with controllable illumination

6GB / 12GB6GB disk
Ready to Run
View

Whisper V3 Turbo FT

Audiowhisper

Fine-tuned turbo whisper for specialized domains

4GB / 12GB4GB disk
Ready to Run
View

Stable Diffusion 1.5

Imagesd

Community favorite, massive ecosystem

4GB / 12GB4GB disk
Ready to Run
View

SD 1.5 Inpainting

Imagesd

Standard inpainting model

4GB / 12GB4GB disk
Ready to Run
View

Instruct-Pix2Pix

Imagesd

Edit images via text instructions

4GB / 12GB4GB disk
Ready to Run
View

Riffusion

Audioriffusion

Stable Diffusion for music

8GB / 12GB8GB disk
Ready to Run
View

Stable Diffusion 2.1 (768)

Imagesd

Native 768px generation

6GB / 12GB6GB disk
Ready to Run
View

Stable Diffusion 2.1 Base

Imagesd

Native 512px generation

5GB / 12GB5GB disk
Ready to Run
View

SD 2.0 Depth

Imagesd

Structure preservation via depth map

6GB / 12GB6GB disk
Ready to Run
View

Parler TTS

Audioparler

Describe voice with text

6GB / 12GB6GB disk
Ready to Run
View

DeepFloyd IF L

Imagedeepfloyd

Mid-tier cascaded model

12GB / 12GB12GB disk
Ready to Run
View

Stable Fast 3D

3Dstable3d

Single image to 3D in 0.5s

8GB / 12GB8GB disk
Ready to Run
View

Stable Point Aware 3D

3Dstable3d

View-consistent 3D generation

10GB / 12GB10GB disk
Ready to Run
View

ECMWF AIFS

Scienceaifs

Operational AI weather forecasting

1.6GB / 12GB0.6GB disk
Ready to Run
View

NOAA AIGFS

Sciencegraphcast

AI Global Forecast System

1.6GB / 12GB0.6GB disk
Ready to Run
View

TRELLIS

3Dtrellis

Structured 3D asset generation

1.6GB / 12GB0.6GB disk
Ready to Run
View

OpenELM 1B

Textopenelm

Lightweight Apple LLM

3.5GB / 12GB0.6GB disk~200.0 T/s
Ready to Run
View

Bark

Audiobark

Multi-lingual with sound effects

8GB / 12GB8GB disk
Ready to Run
View

Orpheus 1B

Audioorpheus

Balanced TTS model

8GB / 12GB8GB disk
Ready to Run
View

Würstchen

Imagewuerstchen

Efficient latent diffusion

8GB / 12GB8GB disk
Ready to Run
View

Point-E

3Dpoint-e

Point cloud generation

10GB / 12GB10GB disk
Ready to Run
View

Zero-1-to-3

3Dzero123

Single image to 3D views

12GB / 12GB12GB disk
Ready to Run
View

Gemma 3 1B

Textgemma3

Lightweight text-only, 32K context

3.5GB / 12GB0.6GB disk~200.0 T/s
Ready to Run
View

Falcon 3 1B

Textfalcon3

Ultra-light deployment

3.5GB / 12GB0.6GB disk~200.0 T/s
Ready to Run
View

StarCoder2 1B

Textstarcoder

Compact code completion

3.5GB / 12GB0.6GB disk~200.0 T/s
Ready to Run
View

OuteTTS 1.0 1B

Audiooutetts

Open TTS with pure LLM approach, voice cloning

4GB / 12GB4GB disk
Ready to Run
View

TRELLIS Large

3Dtrellis

Scalable 3D generation with structured latents, large variant

10GB / 12GB10GB disk
Ready to Run
View

Stable Audio Open

Audiostable-audio

47s stereo audio generation (44.1kHz)

8GB / 12GB8GB disk
Ready to Run
View

Fish Speech 1.5

Audiofish-speech

1M+ hours multilingual TTS

8GB / 12GB8GB disk
Ready to Run
View

NeMo Parakeet

Audionemo

NVIDIA ASR model

8GB / 12GB8GB disk
Ready to Run
View

DINOv2 Giant

Visiondino

Self-supervised vision

10GB / 12GB10GB disk
Ready to Run
View

MetaVoice 1B

Audiometavoice

Emotion and prosody control

8GB / 12GB8GB disk
Ready to Run
View

AudioLDM 2

Audioaudioldm

Text-to-audio generation

10GB / 12GB10GB disk
Ready to Run
View

Llama 3.2 1B

Textllama

Ultra-light edge deployment

3.9GB / 12GB0.7GB disk~200.0 T/s
Ready to Run
View

Wan 2.1 1.3B

Videowan

Efficient consumer video gen (480p native)

8GB / 12GB8GB disk
Ready to Run
View

Aurora v2

Scienceaurora

Microsoft's atmospheric foundation model

1.8GB / 12GB0.8GB disk
Ready to Run
View

SSD-1B

Imagesegmind

50% smaller SDXL, 60% faster

8GB / 12GB8GB disk
Ready to Run
View

Shap-E

3Dshap-e

OpenAI 3D generation

12GB / 12GB12GB disk
Ready to Run
View

DeepSeek R1 Distill 1.5B

Textdeepseek

Tiny reasoning for edge

5.2GB / 12GB0.9GB disk~200.0 T/s
Ready to Run
View

Hunyuan-DiT

Imagehunyuan

Tencent DiT foundation model

8GB / 12GB8GB disk
Ready to Run
View

Stella EN 1.5B

Embeddingstella

State-of-the-art embeddings

8GB / 12GB8GB disk
Ready to Run
View

Instructor XL

Embeddinginstructor

Instruction-based embeddings

8GB / 12GB8GB disk
Ready to Run
View

MusicGen Medium

Audiomusicgen

Balanced music generation

12GB / 12GB12GB disk
Ready to Run
View

MusicGen Melody

Audiomusicgen

Melody-conditioned generation

12GB / 12GB12GB disk
Ready to Run
View

AudioLDM 2 Large

Audioaudioldm

High-quality audio gen

12GB / 12GB12GB disk
Ready to Run
View

Qwen 2.5 Coder 1.5B

Textqwen

Ultra-compact coding model

5.2GB / 12GB0.9GB disk~200.0 T/s
Ready to Run
View

GTE Qwen2 1.5B

Textgte

High-quality text embeddings, 8K context

5.2GB / 12GB0.9GB disk~200.0 T/s
Ready to Run
View

Qwen 2.5 1.5B

Textqwen

Edge deployment ready

5.3GB / 12GB0.9GB disk~200.0 T/s
Ready to Run
View

Whisper Large v3

Audiowhisper

Best open ASR model

10GB / 12GB10GB disk
Ready to Run
View

Moondream1

Visionmoondream

Original Moondream

4GB / 12GB4GB disk
Ready to Run
View

Dia TTS 1.6B

Audiodia

Hyper-realistic dialogue TTS, emotions, voice cloning

10GB / 12GB10GB disk
Ready to Run
View

Qwen 3 1.7B

Textqwen

On-device assistant specialist

5.6GB / 12GB1.0GB disk~200.0 T/s
Ready to Run
View

Qwen3-TTS VoiceDesign (1.7B)

Audioqwen-tts

Zero-shot voice design from text descriptions

4GB / 12GB4GB disk
Ready to Run
View

Qwen3-TTS CustomVoice (1.7B)

Audioqwen-tts

Few-shot voice cloning with style control

4GB / 12GB4GB disk
Ready to Run
View

SmolLM 1.7B

Textsmollm

Hugging Face small model

5.6GB / 12GB1.0GB disk~200.0 T/s
Ready to Run
View

SmolLM2 1.7B

Textsmollm

Capable tiny model for constrained devices

5.6GB / 12GB1.0GB disk~200.0 T/s
Ready to Run
View

Moondream2

Visionmoondream

Tiny vision-language model

6GB / 12GB6GB disk
Ready to Run
View

SD 3 Medium

Imagesd

First open weight MMDiT model

10GB / 12GB10GB disk
Ready to Run
View

Lumina-Next-SFT

Imagelumina

Efficient high-res generation

8GB / 12GB8GB disk
Ready to Run
View

Granite 3.0 2B

Textgranite

Compact robust model

6.1GB / 12GB1.2GB disk~200.0 T/s
Ready to Run
View

Granite 3.3 2B

Textgranite

Compact enterprise model

6.1GB / 12GB1.2GB disk~200.0 T/s
Ready to Run
View

InternVL3 2B

Textinternvl

Compact vision model

6.1GB / 12GB1.2GB disk~200.0 T/s
Ready to Run
View

SmolVLM 2B

Textsmolvlm

Tiny but capable vision-language model

6.1GB / 12GB1.2GB disk~200.0 T/s
Ready to Run
View

Lumina Image 2.0

Imagelumina

Unified multimodal generation framework

8GB / 12GB8GB disk
Ready to Run
View

LTX-Video 0.9.7

Videoltx

Fast real-time video gen, 30fps

8GB / 12GB8GB disk
Ready to Run
View

Qwen2-VL 2B

Visionqwen

Compact vision model

8GB / 12GB8GB disk
Ready to Run
View

SD 3.5 Medium

Imagesd

Efficient balanced model for consumer GPUs

8GB / 12GB8GB disk
Ready to Run
View

Canary Qwen 2.5B

Audiocanary

Multilingual ASR with Qwen backbone

6GB / 12GB6GB disk
Ready to Run
View

Gemma 2 2B

Textgemma

Lightweight deployment

8GB / 12GB1.6GB disk~200.0 T/s
Ready to Run
View

Mamba-2 2.7B

Textmamba

Pure State Space Model (SSM)

8.1GB / 12GB1.6GB disk~200.0 T/s
Ready to Run
View

Ministral 3 3B

Textmistral

Lightweight mobile model with vision

8.6GB / 12GB1.8GB disk~182.0 T/s
Ready to Run
View

RedNote OCR

Visionrednote

Character recognition specialist

2.8GB / 12GB1.8GB disk
Ready to Run
View

Granite 3.0 3B

Textgranite

Small enterprise model

8.6GB / 12GB1.8GB disk~182.0 T/s
Ready to Run
View

OpenELM 3B

Textopenelm

Apple open language model

8.6GB / 12GB1.8GB disk~182.0 T/s
Ready to Run
View

StableLM 3B

Textstablelm

Efficient chat model

8.6GB / 12GB1.8GB disk~182.0 T/s
Ready to Run
View

Stable Code 3B

Textstablelm

Code generation specialist

8.6GB / 12GB1.8GB disk~182.0 T/s
Ready to Run
View

StarCoder2 3B

Textstarcoder

Small code specialist

8.6GB / 12GB1.8GB disk~182.0 T/s
Ready to Run
View

RedPajama INCITE 3B

Textredpajama

Smaller open model

8.6GB / 12GB1.8GB disk~182.0 T/s
Ready to Run
View

Orpheus 3B

Audioorpheus

Llama-based TTS flagship

12GB / 12GB12GB disk
Ready to Run
View

Falcon 3 3B

Textfalcon3

Compact edge model

8.6GB / 12GB1.8GB disk~182.0 T/s
Ready to Run
View

SmolLM3 3B

Textsmollm

Multilingual, dual-mode thinking, 128K context

8.6GB / 12GB1.8GB disk~182.0 T/s
Ready to Run
View

Voxtral Mini 3B

Audiovoxtral

Fast speech-to-text, 13 languages

4GB / 12GB4GB disk
Ready to Run
View

Qwen 2.5 VL 3B

Textqwen

Compact vision-language model

8.6GB / 12GB1.8GB disk~182.0 T/s
Ready to Run
View

Open-Sora 2.0

Videoopen-sora

Open-source Sora replica, 720p, many modes

10GB / 12GB10GB disk
Ready to Run
View

Hunyuan3D 2.0

3Dhunyuan3d

High-res textured 3D asset generation from images/text

12GB / 12GB12GB disk
Ready to Run
View

Qwen 2.5 3B

Textqwen

Lightweight and fast

8.7GB / 12GB1.9GB disk~176.7 T/s
Ready to Run
View

Qwen 2.5 Coder 3B

Textqwen

Lightweight coder

8.7GB / 12GB1.9GB disk~176.7 T/s
Ready to Run
View

Llama-3.2 3B Abliterated

Textllama

Safety guardrails removed

8.9GB / 12GB1.9GB disk~170.6 T/s
Ready to Run
View

Llama 3.2 3B

Textllama

Mobile-optimized small model

8.9GB / 12GB1.9GB disk~170.1 T/s
Ready to Run
View

Yue Music V2

Audioyue

AI music composition with lyrics and genre control

8GB / 12GB8GB disk
Ready to Run
View

Phi-4 Mini

Textphi

Math-optimized compact model, mobile-ready

10.8GB / 12GB2.3GB disk~143.7 T/s
Ready to Run
View

OmniGen V1

Imageomnigen

Unified image generation without extra modules

10GB / 12GB10GB disk
Ready to Run
View

Phi-3.5 Mini (3.8B)

Textphi

High IQ for its size

10.8GB / 12GB2.3GB disk~142.9 T/s
Ready to Run
View

FLUX.2 [klein] 4B

Imageflux

Ultra-fast edge/laptop model

6GB / 12GB6GB disk
Ready to Run
View

H2O Danube 3 4B

Textdanube

Mobile-first efficient model

11.1GB / 12GB2.4GB disk~136.5 T/s
Ready to Run
View

InternVL2 4B

Visioninternvl

Compact vision model

10GB / 12GB10GB disk
Ready to Run
View

Gemma 3 4B

Textgemma3

Compact multimodal, 128K context

11.1GB / 12GB2.4GB disk~136.5 T/s
Ready to Run
View

Nemotron Mini 4B

Textnemotron

Compact edge model optimized for NVIDIA GPUs

11.1GB / 12GB2.4GB disk~136.5 T/s
Ready to Run
View

Qwen 3 4B

Textqwen

High performance mobile model

11.3GB / 12GB2.5GB disk~133.2 T/s
Ready to Run
View

Kolors

Imagekolors

Chinese bilingual image generation

12GB / 12GB12GB disk
Ready to Run
View

SDXL Base 1.0

Imagesd

The gold standard for fine-tuning

12GB / 12GB12GB disk
Ready to Run
View

SDXL Turbo

Imagesd

Real-time single-step generation

12GB / 12GB12GB disk
Ready to Run
View

SDXL Lightning

Imagesd

2-step and 4-step distilled UNet

12GB / 12GB12GB disk
Ready to Run
View

SDXL Inpainting

Imagesd

Dedicated inpainting specialist

12GB / 12GB12GB disk
Ready to Run
View

CosXL

Imagesd

Instruction-editing fine-tune

12GB / 12GB12GB disk
Ready to Run
View

Playground v2.5

Imageplayground

Aesthetic-focused generation

12GB / 12GB12GB disk
Ready to Run
View

AuraFlow v0.3

Imageauraflow

Rectified Flow open source generator

12GB / 12GB12GB disk
Ready to Run
View

Animagine XL 3.1

Imageanimagine

Anime specialist SDXL

12GB / 12GB12GB disk
Ready to Run
View

Animagine XL 3.0

Imageanimagine

Anime image generation

12GB / 12GB12GB disk
Ready to Run
View

Dreamshaper 8

Imagedreamshaper

Versatile SDXL variant

12GB / 12GB12GB disk
Ready to Run
View

SDXL Lightning 4-Step

Imagesd

Ultra-fast 4-step generation

10GB / 12GB10GB disk
Ready to Run
View

SDXL Lightning 2-Step

Imagesd

Fastest 2-step variant

10GB / 12GB10GB disk
Ready to Run
View

BioMistral 7B

Sciencemistral

Medical adaptation of Mistral

5.2GB / 12GB4.2GB disk
Ready to Run
View

Pyramid Flow 7B

Videopyramid

Efficient pyramidal flow matching

5.2GB / 12GB4.2GB disk
Ready to Run
View

NV-Embed-v2

Embeddingnv-embed

Top MTEB leaderboard performer

5.2GB / 12GB4.2GB disk
Ready to Run
View

LLaVA 1.5 7B

Visionllava

Efficient vision-language

12GB / 12GB12GB disk
Ready to Run
View

Janus Pro 7B

Imagejanus

Unified understanding and generation model

10GB / 12GB10GB disk
Ready to Run
View

MiniCPM-V 2.6

Visionminicpm

Strong OCR and multimodal features

12GB / 12GB12GB disk
Ready to Run
View

RFM-1

Roboticsrfm

Covariant physics world model

5.8GB / 12GB4.8GB disk
Ready to Run
View

Llama-3.1 Omni 8B

Audiollama

Low latency speech interaction

5.8GB / 12GB4.8GB disk
Ready to Run
View

Xiaomi VLM

Visionxiaomi

Vision language model

5.8GB / 12GB4.8GB disk
Ready to Run
View

Voxtral Small 8B

Audiovoxtral

High-accuracy multilingual transcription

8GB / 12GB8GB disk
Ready to Run
View

FramePack F1

Videoframepack

Fits long video gen in 6GB VRAM via progressive packing

6GB / 12GB6GB disk
Ready to Run
View

FLUX.2 [klein] 9B

Imageflux

Efficient consumer GPU model

12GB / 12GB12GB disk
Ready to Run
View

AlphaFold 3

Sciencealphafold

Predicts protein/DNA/RNA structures

16GB / 12GB16GB disk25% offload
Runs Slow
View

Med-Gemini 2

Sciencegemini

Multimodal medical flagship

16GB / 12GB16GB disk25% offload
Runs Slow
View

Open-Sora 1.2

Videoopensora

Sora reproduction

16GB / 12GB16GB disk25% offload
Runs Slow
View

MARS5 TTS

Audiomars5

Prosody-focused voice cloning

20GB / 12GB20GB disk40% offload
Runs Slow
View

SVD XT 1.1

Videosvd

Optimized Image-to-Video (25 frames)

16GB / 12GB16GB disk25% offload
Runs Slow
View

SVD

Videosvd

Base Image-to-Video (14 frames)

16GB / 12GB16GB disk25% offload
Runs Slow
View

Stable Video Diffusion XT

Videosvd

Image-to-video with extended frames

16GB / 12GB16GB disk25% offload
Runs Slow
View

Wonder3D

3Dwonder3d

Single image to 3D mesh

14GB / 12GB14GB disk14% offload
Runs Slow
View

ModelScope Text2Video

Videomodelscope

Alibaba video generation

16GB / 12GB16GB disk25% offload
Runs Slow
View

ZeroScope V2

Videozeroscope

Watermark-free video gen

16GB / 12GB16GB disk25% offload
Runs Slow
View

LTX-Video

Videoltx

Lightricks video gen

16GB / 12GB16GB disk25% offload
Runs Slow
View

CogVideoX 2B

Videocogvideo

Efficient video gen

16GB / 12GB16GB disk25% offload
Runs Slow
View

VideoCrafter1

Videovideocrafter

Original VideoCrafter

16GB / 12GB16GB disk25% offload
Runs Slow
View

SeamlessM4T

Audioseamless

Multilingual translation model

16GB / 12GB16GB disk25% offload
Runs Slow
View

YuE Music

Audioyue

Chinese music generation

16GB / 12GB16GB disk25% offload
Runs Slow
View

VideoCrafter2

Videovideocrafter

Text and image to video

20GB / 12GB20GB disk40% offload
Runs Slow
View

LaVie

Videolavie

High-quality video synthesis

20GB / 12GB20GB disk40% offload
Runs Slow
View

Craftsman

3Dcraftsman

Text/image to 3D generation

16GB / 12GB16GB disk25% offload
Runs Slow
View

MusicGen Large

Audiomusicgen

Text-to-music generation

16GB / 12GB16GB disk25% offload
Runs Slow
View

DeepFloyd IF XL

Imagedeepfloyd

Cascaded diffusion model

20GB / 12GB20GB disk40% offload
Runs Slow
View

Wan 2.2 TI2V 5B

Videowan

Unified T2V/I2V efficiency model

16GB / 12GB16GB disk25% offload
Runs Slow
View

CogVideoX 5B

Videocogvideo

Text-to-video generation

24GB / 12GB24GB disk50% offload
Runs Slow
View

Gemma 3n E2B

Textgemma3n

Ultra-light multimodal, 2B effective memory footprint

12.6GB / 12GB3.0GB disk~80.4 T/s5% offload
Runs Slow
View

CogVideoX 1.5 5B

Videocogvideo

Improved video gen with 10s 720p output

14GB / 12GB14GB disk14% offload
Runs Slow
View

CogVideoX 1.5 5B I2V

Videocogvideo

Image-to-video with 10s output

14GB / 12GB14GB disk14% offload
Runs Slow
View

Stable Cascade

Imagecascade

3-stage compression architecture (C+B)

16GB / 12GB16GB disk25% offload
Runs Slow
View

Phi-4 Multimodal

Textphi

Vision + speech multimodal small model

14.6GB / 12GB3.4GB disk~62.8 T/s18% offload
Runs Slow
View

ChatGLM3 6B

Textglm

Bilingual chat model from Tsinghua

15.2GB / 12GB3.6GB disk~56.6 T/s21% offload
Runs Slow
View

ChatGLM2 6B

Textglm

Second generation GLM chat

15.2GB / 12GB3.6GB disk~56.6 T/s21% offload
Runs Slow
View

Yi 1.5 6B

Textyi

Bilingual general-purpose

15.2GB / 12GB3.6GB disk~56.6 T/s21% offload
Runs Slow
View

CogView4

Imagecogview

High-resolution text-to-image, Chinese + English, up to 2048x2048

13GB / 12GB13GB disk8% offload
Runs Slow
View

Pythia 6.9B

Textpythia

Mid-size Pythia model

17.6GB / 12GB4.1GB disk~43.2 T/s32% offload
Runs Slow
View

Qwen2.5 Audio Instruct

Audioqwen

Voice chat and audio analysis

14GB / 12GB14GB disk14% offload
Runs Slow
View

Qwen2 Audio 7B

Audioqwen

Audio understanding foundation

14GB / 12GB14GB disk14% offload
Runs Slow
View

xLAM 7B FC

Textxlam

Optimized for efficient function calling

17.7GB / 12GB4.2GB disk~42.4 T/s32% offload
Runs Slow
View

Gorilla OpenFunctions v2

Textgorilla

DeepSeek-based API calling specialist

17.7GB / 12GB4.2GB disk~42.4 T/s32% offload
Runs Slow
View

RWKV-6 World 7B

Textrwkv

RNN with Transformer-level performance

17.7GB / 12GB4.2GB disk~42.4 T/s32% offload
Runs Slow
View

SeaLLM v3 7B

Textseallm

Southeast Asia languages specialist

17.7GB / 12GB4.2GB disk~42.4 T/s32% offload
Runs Slow
View

DeepSeek Prover V1.5

Textdeepseek

Theorem proving specialist

17.7GB / 12GB4.2GB disk~42.4 T/s32% offload
Runs Slow
View

InternLM 2.5 7B

Textinternlm

Efficient bilingual model

17.7GB / 12GB4.2GB disk~42.4 T/s32% offload
Runs Slow
View

Baichuan2 7B

Textbaichuan

Efficient Chinese LLM

17.7GB / 12GB4.2GB disk~42.4 T/s32% offload
Runs Slow
View

OLMo 2 7B

Textolmo

Efficient fully open model

17.7GB / 12GB4.2GB disk~42.4 T/s32% offload
Runs Slow
View

Command R 7B

Textcommand

Efficient RAG specialist

17.7GB / 12GB4.2GB disk~42.4 T/s32% offload
Runs Slow
View

Falcon 7B

Textfalcon

Compact Falcon model

17.7GB / 12GB4.2GB disk~42.4 T/s32% offload
Runs Slow
View

StarCoder2 7B

Textstarcoder

Efficient code model

17.7GB / 12GB4.2GB disk~42.4 T/s32% offload
Runs Slow
View

CodeGen2.5 7B

Textcodegen

Salesforce code model

17.7GB / 12GB4.2GB disk~42.4 T/s32% offload
Runs Slow
View

XGen 7B

Textxgen

Long sequence model

17.7GB / 12GB4.2GB disk~42.4 T/s32% offload
Runs Slow
View

RedPajama INCITE 7B

Textredpajama

Open reproduction model

17.7GB / 12GB4.2GB disk~42.4 T/s32% offload
Runs Slow
View

Hermes 2 Pro Mistral 7B

Texthermes

Function calling specialist

17.7GB / 12GB4.2GB disk~42.4 T/s32% offload
Runs Slow
View

LLaVA 1.6 Mistral 7B

Visionllava

Mistral-based vision model

16GB / 12GB16GB disk25% offload
Runs Slow
View

O1 Mini Distill

Texto1

Distilled reasoning model

17.7GB / 12GB4.2GB disk~42.4 T/s32% offload
Runs Slow
View

Falcon 3 7B

Textfalcon3

General purpose with multimodal support

17.7GB / 12GB4.2GB disk~42.4 T/s32% offload
Runs Slow
View

DeepSeek R1 7B

Textdeepseek

Distilled reasoning model, Qwen-based

17.7GB / 12GB4.2GB disk~42.4 T/s32% offload
Runs Slow
View

OLMo 3 7B

Textolmo

Fully open-source with training data and logs

17.7GB / 12GB4.2GB disk~42.4 T/s32% offload
Runs Slow
View

MiMo 7B

Textmimo

Compact reasoning model from Xiaomi

17.7GB / 12GB4.2GB disk~42.4 T/s32% offload
Runs Slow
View

Qwen 2.5 Math 7B

Textqwen

Specialized mathematical reasoning

17.7GB / 12GB4.2GB disk~42.4 T/s32% offload
Runs Slow
View

MAP-Neo 7B

Textmap-neo

Fully open-source bilingual (EN/ZH) model

17.7GB / 12GB4.2GB disk~42.4 T/s32% offload
Runs Slow
View

E5-Mistral 7B

Embeddinge5

LLM-based embeddings

16GB / 12GB16GB disk25% offload
Runs Slow
View

SFR Embedding Mistral

Embeddingsfr

Mistral-based embeddings

16GB / 12GB16GB disk25% offload
Runs Slow
View

Molmo 7B

Visionmolmo

Highly efficient vision

16GB / 12GB16GB disk25% offload
Runs Slow
View

Mistral 7B v0.3

Textmistral

Classic efficient model

18.1GB / 12GB4.3GB disk~40.2 T/s34% offload
Runs Slow
View

Mathstral 7B

Textmistral

Math and STEM specialist

18.1GB / 12GB4.3GB disk~40.2 T/s34% offload
Runs Slow
View

Phi-3 Small (7B)

Textphi

Efficient general purpose

18.3GB / 12GB4.4GB disk~39.0 T/s34% offload
Runs Slow
View

Eagle 7B

Textrwkv

RWKV-5 based efficient attention-free

19.5GB / 12GB4.5GB disk~36.3 T/s38% offload
Runs Slow
View

Qwen 2.5 7B

Textqwen

Efficient general purpose

19.6GB / 12GB4.6GB disk~35.7 T/s39% offload
Runs Slow
View

Qwen 2.5 Coder 7B

Textqwen

Efficient code assistant

19.6GB / 12GB4.6GB disk~35.7 T/s39% offload
Runs Slow
View

GTE-Qwen2 7B

Embeddinggte

Alibaba LLM embeddings

16GB / 12GB16GB disk25% offload
Runs Slow
View

Qwen2-VL 7B

Visionqwen

Efficient Qwen vision

16GB / 12GB16GB disk25% offload
Runs Slow
View

EXAONE 3.0 7.8B

Textexaone

LG AI's bilingual English/Korean

19.9GB / 12GB4.7GB disk~34.4 T/s40% offload
Runs Slow
View

Ministral 3 8B

Textmistral

Balanced edge model with vision

20.2GB / 12GB4.8GB disk~33.1 T/s41% offload
Runs Slow
View

DeepSeek R1 Distill 8B

Textdeepseek

Small but capable reasoner

20.2GB / 12GB4.8GB disk~33.1 T/s41% offload
Runs Slow
View

Aya Expanse 8B

Textcommand

Compact multilingual

20.2GB / 12GB4.8GB disk~33.1 T/s41% offload
Runs Slow
View

Granite 3.0 Guardian

Textgranite

IBM risk detection & safety

20.2GB / 12GB4.8GB disk~33.1 T/s41% offload
Runs Slow
View

Dolphin 2.9 Llama 3

Textdolphin

Popular uncensored fine-tune

20.2GB / 12GB4.8GB disk~33.1 T/s41% offload
Runs Slow
View

Skywork Reward

Textskywork

Reward model for RLHF

20.2GB / 12GB4.8GB disk~33.1 T/s41% offload
Runs Slow
View

Granite Code 8B

Textgranite

Code specialist from IBM

20.2GB / 12GB4.8GB disk~33.1 T/s41% offload
Runs Slow
View

InternVL2 8B

Visioninternvl

Efficient multimodal

16GB / 12GB16GB disk25% offload
Runs Slow
View

Fuyu 8B

Visionfuyu

Adept multimodal model

16GB / 12GB16GB disk25% offload
Runs Slow
View

Idefics2 8B

Visionidefics

HuggingFace vision-language

16GB / 12GB16GB disk25% offload
Runs Slow
View

Gemma 3n E4B

Textgemma3n

On-device multimodal (text/image/audio/video), 3B effective memory

20.2GB / 12GB4.8GB disk~33.1 T/s41% offload
Runs Slow
View

Granite 3.3 8B

Textgranite

Enterprise with speech and vision

20.2GB / 12GB4.8GB disk~33.1 T/s41% offload
Runs Slow
View

InternLM3 8B

Textinternlm

Advanced reasoning and long-context, deep thinking

20.2GB / 12GB4.8GB disk~33.1 T/s41% offload
Runs Slow
View

MiniCPM-o 2.6

Textminicpm

Omni-modal: text, image, video, audio, live streaming

20.2GB / 12GB4.8GB disk~33.1 T/s41% offload
Runs Slow
View

SD 3.5 Large ControlNet

Imagesd3

Controlled generation with canny/depth/blur

16GB / 12GB16GB disk25% offload
Runs Slow
View

Llama 3.1 8B

Textllama

Best small model for most tasks

20.3GB / 12GB4.8GB disk~32.8 T/s41% offload
Runs Slow
View

Qwen 3 8B

Textqwen

Universal edge model, MCP native

20.4GB / 12GB4.9GB disk~32.4 T/s41% offload
Runs Slow
View

Granite 3.0 8B

Textgranite

IBM enterprise-grade robust model

20.4GB / 12GB4.9GB disk~32.4 T/s41% offload
Runs Slow
View

Ministral 8B

Textmistral

Edge-focused powerful Mistral

20.4GB / 12GB4.9GB disk~32.4 T/s41% offload
Runs Slow
View

SD 3.5 Large

Imagesd

Flagship MMDiT architecture, superior prompt adherence

16GB / 12GB16GB disk25% offload
Runs Slow
View

SD 3.5 Large Turbo

Imagesd

Distilled 4-step generation version of Large

16GB / 12GB16GB disk25% offload
Runs Slow
View

HunyuanVideo 1.5

Videohunyuan

State-of-art open video gen, 720p, t2v + i2v

14GB / 12GB14GB disk14% offload
Runs Slow
View

CodeGemma 7B

Textgemma

Code-focused Gemma

21GB / 12GB5.1GB disk~30.0 T/s43% offload
Runs Slow
View

Yi 1.5 9B

Textyi

Efficient bilingual

22.5GB / 12GB5.3GB disk~27.4 T/s47% offload
Runs Slow
View

Yi Coder 9B

Textyi

Code specialist

22.5GB / 12GB5.3GB disk~27.4 T/s47% offload
Runs Slow
View

RecurrentGemma 9B

Textgemma

Griffin-based RNN-Transformer

22.8GB / 12GB5.4GB disk~26.5 T/s47% offload
Runs Slow
View

CodeGeex4 9B

Textcodegeex

Multilingual code generation

22.8GB / 12GB5.4GB disk~26.5 T/s47% offload
Runs Slow
View

GLM-4 Voice

Textglm

End-to-end speech chatbot, emotion control

22.8GB / 12GB5.4GB disk~26.5 T/s47% offload
Runs Slow
View

Gemma 2 9B

Textgemma

Efficient mid-size

23.1GB / 12GB5.5GB disk~25.6 T/s48% offload
Runs Slow
View

GLM-4 9B

Textglm

Tsinghua bilingual model

23.2GB / 12GB5.6GB disk~25.3 T/s48% offload
Runs Slow
View

Mochi 1 Preview

Videomochi

Genmo's video model

24GB / 12GB24GB disk50% offload
Runs Slow
View

Ideogram V3

Imageideogram

Text-in-image specialist

20GB / 12GB20GB disk40% offload
Runs Slow
View

Ideogram V3 Turbo

Imageideogram

Fast text rendering

16GB / 12GB16GB disk25% offload
Runs Slow
View

Pyramid Flow

Videopyramid

Autoregressive video diffusion

24GB / 12GB24GB disk50% offload
Runs Slow
View

Falcon 3 10B

Textfalcon3

Enhanced science, math, and coding

24.3GB / 12GB6.0GB disk~22.6 T/s51% offload
Runs Slow
View

Llama 3.2 11B Vision

Textllama

Compact multimodal model

26.2GB / 12GB6.4GB disk~20.0 T/s54% offload
Runs Slow
View

SOLAR 10.7B

Textsolar

Depth-upscaled model

26.4GB / 12GB6.4GB disk~19.8 T/s55% offload
Runs Slow
View

Solar Mini

Textsolar

Compact depth-upscaled

26.4GB / 12GB6.4GB disk~19.8 T/s55% offload
Runs Slow
View

Falcon 11B

Textfalcon

Efficient Falcon variant

26.8GB / 12GB6.6GB disk~19.0 T/s55% offload
Runs Slow
View

Kandinsky 3

Imagekandinsky

Multilingual text-to-image

16GB / 12GB16GB disk25% offload
Runs Slow
View

Mistral Nemo 12B

Textmistral

Compact and capable

29.3GB / 12GB7.2GB disk~16.2 T/s59% offload
Runs Slow
View

FLUX.1 [kontext]

Imageflux

Specialized in-context editing & consistency

24GB / 12GB24GB disk50% offload
Runs Slow
View

FLUX1.1 [pro] Ultra

Imageflux

4MP Raw/Ultra modes, API only

24GB / 12GB24GB disk50% offload
Runs Slow
View

FLUX1.1 [pro]

Imageflux

6x faster than 1.0, superior prompt adherence

24GB / 12GB24GB disk50% offload
Runs Slow
View

FLUX.1 [pro]

Imageflux

Original flagship API model

24GB / 12GB24GB disk50% offload
Runs Slow
View

FLUX.1 [dev]

Imageflux

SOTA open weights image generator

24GB / 12GB24GB disk50% offload
Runs Slow
View

FLUX.1 [schnell]

Imageflux

Fastest 4-step distilled FLUX

16GB / 12GB16GB disk25% offload
Runs Slow
View

FLUX.1 Fill [dev]

Imageflux

Inpainting/outpainting specialist

24GB / 12GB24GB disk50% offload
Runs Slow
View

FLUX.1 Canny [dev]

Imageflux

Structure guidance via Canny edges

24GB / 12GB24GB disk50% offload
Runs Slow
View

FLUX.1 Depth [dev]

Imageflux

Structure guidance via Depth maps

24GB / 12GB24GB disk50% offload
Runs Slow
View

FLUX.1 Redux [dev]

Imageflux

Image mixing and variation adapter

24GB / 12GB24GB disk50% offload
Runs Slow
View

NVIDIA Cosmos 1 XL

Videocosmos

Physical world foundation model

24GB / 12GB24GB disk50% offload
Runs Slow
View

Pythia 12B

Textpythia

Research model suite

29.3GB / 12GB7.2GB disk~16.2 T/s59% offload
Runs Slow
View

OASST Pythia 12B

Textoasst

Open Assistant model

29.3GB / 12GB7.2GB disk~16.2 T/s59% offload
Runs Slow
View

Gemma 3 12B

Textgemma3

Balanced multimodal model, 128K context

29.3GB / 12GB7.2GB disk~16.2 T/s59% offload
Runs Slow
View

Jamba Mini

Textjamba

Mamba architecture, 256K context

29.3GB / 12GB7.2GB disk~16.2 T/s59% offload
Runs Slow
View

FLUX.1 Tools

Imageflux

Suite of editing tools (fill, depth, canny, redux)

14GB / 12GB14GB disk14% offload
Runs Slow
View

Pixtral 12B

Visionmistral

Mistral multimodal native

24GB / 12GB24GB disk50% offload
Runs Slow
View

NexusRaven V2 13B

Textnexusraven

Zero-shot tool use specialist

31.9GB / 12GB7.8GB disk~14.0 T/s62% offload
Runs Slow
View

Fugaku-LLM 13B

Textfugaku

Japanese scientific model

31.9GB / 12GB7.8GB disk~14.0 T/s62% offload
Runs Slow
View

Seed LLM

Textseed

ByteDance research model

31.9GB / 12GB7.8GB disk~14.0 T/s62% offload
Runs Slow
View

Skywork 13B

Textskywork

Open bilingual model

31.9GB / 12GB7.8GB disk~14.0 T/s62% offload
Runs Slow
View

Baichuan2 13B

Textbaichuan

Open Chinese foundation model

31.9GB / 12GB7.8GB disk~14.0 T/s62% offload
Runs Slow
View

OLMo 3 13B

Textolmo

Mid-size truly open model

31.9GB / 12GB7.8GB disk~14.0 T/s62% offload
Runs Slow
View

LLaVA 1.5 13B

Visionllava

Mid-size vision-language

16GB / 12GB16GB disk25% offload
Runs Slow
View

OLMo 2 13B

Textolmo

Fully open research model

32.3GB / 12GB8.0GB disk~13.5 T/s63% offload
Runs Slow
View

Ministral 3 14B

Textmistral

Dense edge flagship with vision

34.4GB / 12GB8.4GB disk~12.2 T/s65% offload
Runs Slow
View

DeepSeek R1 Distill 14B

Textdeepseek

Compact reasoning model

34.4GB / 12GB8.4GB disk~12.2 T/s65% offload
Runs Slow
View

Phi-4 (14B)

Textphi

Latest Phi with exceptional reasoning

34.4GB / 12GB8.4GB disk~12.2 T/s65% offload
Runs Slow
View

Phi-3 Medium (14B)

Textphi

Balanced Phi model

34.4GB / 12GB8.4GB disk~12.2 T/s65% offload
Runs Slow
View

Wan 2.1 I2V 14B (480P)

Videowan

Stable low-res image animation

24GB / 12GB24GB disk50% offload
Runs Slow
View

Xiaomi 14B

Textxiaomi

Xiaomi edge flagship

34.4GB / 12GB8.4GB disk~12.2 T/s65% offload
Runs Slow
View

Phi-4 Reasoning Plus

Textphi

Chain-of-thought reasoning model

34.4GB / 12GB8.4GB disk~12.2 T/s65% offload
Runs Slow
View

InternVL3 14B

Textinternvl

Efficient multimodal understanding

34.4GB / 12GB8.4GB disk~12.2 T/s65% offload
Runs Slow
View

SkyReels V2 I2V

Videoskyreels

Infinite-length video with camera control

18GB / 12GB18GB disk33% offload
Runs Slow
View

Cosmos 1 Video 14B

Videocosmos

Physical world simulation video model

20GB / 12GB20GB disk40% offload
Runs Slow
View

Qwen 2.5 14B

Textqwen

Strong mid-size model

35.5GB / 12GB8.8GB disk~11.3 T/s66% offload
Runs Slow
View

Qwen 2.5 Coder 14B

Textqwen

Strong coding in smaller package

35.5GB / 12GB8.8GB disk~11.3 T/s66% offload
Runs Slow
View

Qwen 3 14B

Textqwen

Perfect mid-range daily driver

35.5GB / 12GB8.8GB disk~11.3 T/s66% offload
Runs Slow
View

StarCoder2 15B

Textstarcoder

Code specialist

35.9GB / 12GB9.0GB disk~11.0 T/s67% offload
Runs Slow
View

StarCoder Base

Textstarcoder

Foundation code model

37.7GB / 12GB9.3GB disk~10.3 T/s68% offload
Runs Slow
View

FLUX.2 [max]

Imageflux

Flagship professional model (2026 SOTA)

32GB / 12GB32GB disk63% offload
Runs Slow
View

FLUX.2 [dev]

Imageflux

Open-weight research flagship

24GB / 12GB24GB disk50% offload
Runs Slow
View

MOSS

Textmoss

First open Chinese conversational LLM

38.4GB / 12GB9.6GB disk~9.8 T/s69% offload
Runs Slow
View

CogVLM 17B

Visioncogvlm

Original CogVLM

20GB / 12GB20GB disk40% offload
Runs Slow
View

HiDream I1 Full

Imagehidream

High-quality text-to-image with 4 LLM backbone

20GB / 12GB20GB disk40% offload
Runs Slow
View

HiDream I1 Fast

Imagehidream

16-step fast generation variant

18GB / 12GB18GB disk33% offload
Runs Slow
View

CogVLM2 19B

Visioncogvlm

Powerful vision-language

24GB / 12GB24GB disk50% offload
Runs Slow
View

LTX-Video 2

Videoltx

Integrated audio-video gen, native 4K 50fps

32GB / 12GB32GB disk63% offload
Runs Slow
View

Recraft V3

Imagerecraft

High-quality realistic generations

24GB / 12GB24GB disk50% offload
Runs Slow
View

Recraft V3 SVG

Imagerecraft

Vector generation specialist

24GB / 12GB24GB disk50% offload
Runs Slow
View

InternVL2 26B

Visioninternvl

Multimodal flagship

32GB / 12GB32GB disk63% offload
Runs Slow
View

Qwen 3 Omni

Audioqwen

End-to-end voice/text/vision interaction

19.5GB / 12GB18.0GB disk38% offload
Runs Slow
View

RT-2-X

Roboticsrt

Google VLA (Vision-Language-Action)

35.8GB / 12GB33.0GB disk66% offload
Runs Slow
View

HunyuanVideo

Videohunyuan

Tencent SOTA open video generation

48GB / 12GB48GB disk
Needs Upgrade
Need 36GB more

SkyReels V1

Videoskyreels

Human-centric cinematic video

48GB / 12GB48GB disk
Needs Upgrade
Need 36GB more

Wan 2.1 14B

Videowan

Cinema-quality generation (Supports 720p)

40GB / 12GB40GB disk
Needs Upgrade
Need 28GB more

Wan 2.1 I2V 14B (720P)

Videowan

High-res image animation flagship

40GB / 12GB40GB disk
Needs Upgrade
Need 28GB more

Wan 2.2 T2V A14B

VideoMoEwan

MoE-powered high fidelity (2x14B Experts)

40GB / 12GB40GB disk
Needs Upgrade
Need 28GB more

Wan 2.2 I2V A14B

VideoMoEwan

MoE-powered image animation

40GB / 12GB40GB disk
Needs Upgrade
Need 28GB more

InternLM 2.5 20B

Textinternlm

Strong Chinese/English

47.3GB / 12GB11.9GB disk
Needs Upgrade
Need 35GB more

Kimi K2

Textkimi

Multimodal with 128K context from Moonshot

47.6GB / 12GB12.0GB disk
Needs Upgrade
Need 36GB more

Kimi K1.5

Textkimi

Long context specialist

47.6GB / 12GB12.0GB disk
Needs Upgrade
Need 36GB more

InternLM2 20B Chat

Textinternlm

Powerful Chinese/English LLM

47.6GB / 12GB12.0GB disk
Needs Upgrade
Need 36GB more

Codestral 22B

Textmistral

Mistral's code specialist

52.8GB / 12GB13.2GB disk
Needs Upgrade
Need 41GB more

Solar Pro

Textsolar

Enterprise Solar model

52.8GB / 12GB13.2GB disk
Needs Upgrade
Need 41GB more

Codestral 25.01

Textmistral

Updated code generation flagship

52.8GB / 12GB13.2GB disk
Needs Upgrade
Need 41GB more

Mistral Small (24B)

Textmistral

Efficient enterprise model

57.9GB / 12GB14.4GB disk
Needs Upgrade
Need 46GB more

Gemma 3 27B

Textgemma3

Flagship multimodal, 128K context, 140+ languages

64.6GB / 12GB16.2GB disk
Needs Upgrade
Need 53GB more

Gemma 2 27B

Textgemma

Google's best open model

64.9GB / 12GB16.3GB disk
Needs Upgrade
Need 53GB more

Qwen 3 30B (MoE)

TextMoEqwen

Punching way above its weight class

71.3GB / 12GB18.0GB disk
Needs Upgrade
Need 59GB more

DeepSeek R1 Distill 32B

Textdeepseek

Efficient reasoning distillation

76.5GB / 12GB19.2GB disk
Needs Upgrade
Need 65GB more

Aya Expanse 32B

Textcommand

Multilingual specialist

76.5GB / 12GB19.2GB disk
Needs Upgrade
Need 65GB more

OLMo 3 32B

Textolmo

Fully open-source with training data

76.5GB / 12GB19.2GB disk
Needs Upgrade
Need 65GB more

Marco-o1

Textmarco

Open reasoning model

76.5GB / 12GB19.2GB disk
Needs Upgrade
Need 65GB more

GLM-4.7 Thinking

Textglm

Advanced reasoning with thinking mode

76.5GB / 12GB19.2GB disk
Needs Upgrade
Need 65GB more

Qwen 2.5 VL 32B

Textqwen

Advanced vision-language understanding

76.5GB / 12GB19.2GB disk
Needs Upgrade
Need 65GB more

Qwen 2.5 32B

Textqwen

The "Goldilocks" model - great balance

77.2GB / 12GB19.5GB disk
Needs Upgrade
Need 65GB more

Qwen 2.5 Coder 32B

Textqwen

State-of-the-art code generation

77.2GB / 12GB19.5GB disk
Needs Upgrade
Need 65GB more

Qwen 3 32B

Textqwen

Dense SOTA for its size category

77.2GB / 12GB19.5GB disk
Needs Upgrade
Need 65GB more

Qwen 3 Coder 32B

Textqwen

Self-correcting code specialist

77.2GB / 12GB19.5GB disk
Needs Upgrade
Need 65GB more

QwQ 32B Preview

Textqwen

Qwen reasoning model (o1-like)

77.2GB / 12GB19.5GB disk
Needs Upgrade
Need 65GB more

DeepSeek Coder 33B

Textdeepseek

Strong dense code model

79.1GB / 12GB19.8GB disk
Needs Upgrade
Need 67GB more

Agent Coder 33B

Textdeepseek

Self-correcting coding agent

79.1GB / 12GB19.8GB disk
Needs Upgrade
Need 67GB more

WhiteRabbitNeo 33B

Textwhiterabbit

Cybersecurity offensive/defensive spec

79.1GB / 12GB19.8GB disk
Needs Upgrade
Need 67GB more

Nous Capybara 34B

Textnous

Conversational expert

81.7GB / 12GB20.4GB disk
Needs Upgrade
Need 70GB more

Yi 1.5 34B

Textyi

Strong bilingual model

82.3GB / 12GB20.6GB disk
Needs Upgrade
Need 70GB more

LLaVA 1.6 34B

Visionllava

State-of-the-art vision-language

40GB / 12GB40GB disk
Needs Upgrade
Need 28GB more

Command R (35B)

Textcommand

Retrieval-optimized

83.2GB / 12GB21.0GB disk
Needs Upgrade
Need 71GB more

Aya 23 35B

Textaya

Cohere's massive multilingual model

83.2GB / 12GB21.0GB disk
Needs Upgrade
Need 71GB more

Falcon 40B

Textfalcon

Mid-size Falcon model

95.1GB / 12GB24.0GB disk
Needs Upgrade
Need 83GB more

Phi-3.5 MoE (42B)

TextMoEphi

Efficient mixture of experts

100.1GB / 12GB25.1GB disk
Needs Upgrade
Need 88GB more

Grok-3 Mini

Textgrok

Efficient reasoning model with real-time tools

106.9GB / 12GB27.0GB disk
Needs Upgrade
Need 95GB more

Mixtral 8x7B (MoE)

TextMoEmistral

Popular efficient MoE

111.7GB / 12GB28.0GB disk
Needs Upgrade
Need 100GB more

Yuan 2.0 51B

Textyuan

Mid-size Yuan variant

121.4GB / 12GB30.6GB disk
Needs Upgrade
Need 109GB more

Jamba v0.1

TextMoEjamba

Mamba-Transformer Hybrid

124GB / 12GB31.2GB disk
Needs Upgrade
Need 112GB more

Jamba 1.5 Mini

TextMoEjamba

Efficient hybrid architecture

124GB / 12GB31.2GB disk
Needs Upgrade
Need 112GB more

DeepSeek R1 Distill 70B

Textdeepseek

Distilled reasoning model

166.3GB / 12GB42.0GB disk
Needs Upgrade
Need 154GB more

Nemotron-4 70B

Textnemotron

NVIDIA RLHF aligned model

166.3GB / 12GB42.0GB disk
Needs Upgrade
Need 154GB more

Hermes 3 70B

Texthermes

Uncensored agentic Llama 3.1 tune

166.3GB / 12GB42.0GB disk
Needs Upgrade
Need 154GB more

Functionary V3 Medium

Textfunctionary

MeetKai's agentic control model

166.3GB / 12GB42.0GB disk
Needs Upgrade
Need 154GB more

Llama 3.3 70B

Textllama

Refined Llama 3 with superior following

168.3GB / 12GB42.3GB disk
Needs Upgrade
Need 156GB more

Llama 3.1 70B

Textllama

Enterprise-grade intelligence

168.3GB / 12GB42.3GB disk
Needs Upgrade
Need 156GB more

Qwen 3 VL 72B

Visionqwen

Visual reasoning powerhouse

46.8GB / 12GB43.2GB disk
Needs Upgrade
Need 35GB more

Molmo 72B

Visionmolmo

AllenAI open state-of-the-art

80GB / 12GB80GB disk
Needs Upgrade
Need 68GB more

Qwen 2.5 Math 72B

Textqwen

Math-specific reasoning model

171.5GB / 12GB43.2GB disk
Needs Upgrade
Need 160GB more

NuminaMath 72B

Textnumina

Winner of AI Math Olympiad

171.5GB / 12GB43.2GB disk
Needs Upgrade
Need 160GB more

Kimi K2.5

Textkimi

Top-tier coding and reasoning, open weights

171.5GB / 12GB43.2GB disk
Needs Upgrade
Need 160GB more

Kimi-Dev 72B

Textkimi

Specialized software development model

171.5GB / 12GB43.2GB disk
Needs Upgrade
Need 160GB more

Qwen 2.5 72B

Textqwen

Top-tier reasoning and coding

173.7GB / 12GB43.6GB disk
Needs Upgrade
Need 162GB more

Qwen2-VL 72B

Visionqwen

Alibaba multimodal flagship

80GB / 12GB80GB disk
Needs Upgrade
Need 68GB more

InternVL3 78B

Textinternvl

Frontier multimodal understanding

186GB / 12GB46.8GB disk
Needs Upgrade
Need 174GB more

Llama 3.2 90B Vision

Textllama

Multimodal with image understanding

210.6GB / 12GB53.1GB disk
Needs Upgrade
Need 199GB more

ESM3 Open

Scienceesm

Simulate & generate biology

63.7GB / 12GB58.8GB disk
Needs Upgrade
Need 52GB more

Ernie X1

Texternie

First Baidu reasoning model

237.6GB / 12GB60.0GB disk
Needs Upgrade
Need 226GB more

SenseTime XL

Textsensetime

Enterprise multimodal model

237.6GB / 12GB60.0GB disk
Needs Upgrade
Need 226GB more

Yuan 2.0

Textyuan

Large scale Chinese model

242.8GB / 12GB61.2GB disk
Needs Upgrade
Need 231GB more

Command R+ (104B)

Textcommand

Enterprise RAG and tool use

248GB / 12GB62.4GB disk
Needs Upgrade
Need 236GB more

GLM-4.5 Air

TextMoEglm

Efficient MoE variant

252.1GB / 12GB63.6GB disk
Needs Upgrade
Need 240GB more

Llama 4 Scout

TextMoEllama4

Consumer flagship MoE, 16 experts, 10M context

259.9GB / 12GB65.4GB disk
Needs Upgrade
Need 248GB more

Command A

Textcommand

Agentic enterprise model, 256K context

264GB / 12GB66.6GB disk
Needs Upgrade
Need 252GB more

TeleChat2 115B

Texttelechat

China Telecom massive model

273.3GB / 12GB69.0GB disk
Needs Upgrade
Need 261GB more

GPT-OSS 120B

Textgpt-oss

OpenAI first open-source model, fits single 80GB GPU

278.5GB / 12GB70.2GB disk
Needs Upgrade
Need 267GB more

Mistral Large 2 (123B)

Textmistral

Flagship Mistral model

292.9GB / 12GB73.8GB disk
Needs Upgrade
Need 281GB more

Pixtral Large 124B

Textpixtral

Flagship vision model with 128K context

295.5GB / 12GB74.4GB disk
Needs Upgrade
Need 284GB more

DBRX Instruct

TextMoEdbrx

Databrick's powerful MoE

314.1GB / 12GB79.2GB disk
Needs Upgrade
Need 302GB more

DBRX Base

TextMoEdbrx

Foundation MoE from Databricks

314.1GB / 12GB79.2GB disk
Needs Upgrade
Need 302GB more

Mixtral 8x22B (MoE)

TextMoEmistral

Large scale MoE

335.3GB / 12GB84.6GB disk
Needs Upgrade
Need 323GB more

xLAM 8x22B

TextMoExlam

Salesforce "Large Action Model" flagship

335.3GB / 12GB84.6GB disk
Needs Upgrade
Need 323GB more

Mixtral 8x22B DPO

TextMoEmistral

DPO-tuned Mixtral

335.3GB / 12GB84.6GB disk
Needs Upgrade
Need 323GB more

Falcon 180B

Textfalcon

Large scale open model

427.7GB / 12GB108.0GB disk
Needs Upgrade
Need 416GB more

Ernie Bot 4

Texternie

Previous generation flagship

427.7GB / 12GB108.0GB disk
Needs Upgrade
Need 416GB more

MiniMax M2

Textminimax

Capable chat model with strong performance

475.3GB / 12GB120.0GB disk
Needs Upgrade
Need 463GB more

Ernie 4.5

Texternie

Latest multimodal foundational model

475.3GB / 12GB120.0GB disk
Needs Upgrade
Need 463GB more

Qwen 3 235B (MoE)

TextMoEqwen

Open weights flagship, highly efficient experts

558.4GB / 12GB141.0GB disk
Needs Upgrade
Need 546GB more

DeepSeek Coder V2 236B (MoE)

TextMoEdeepseek

Expert code generation MoE

561GB / 12GB141.6GB disk
Needs Upgrade
Need 549GB more

MiMo-V2 Flash

TextMoEmimo

Ultra-fast reasoning MoE, 256K context

735.2GB / 12GB185.4GB disk
Needs Upgrade
Need 723GB more

Grok-1

TextMoEgrok

xAI massive open model

747GB / 12GB188.4GB disk
Needs Upgrade
Need 735GB more

Nemotron-4 340B

Textnemotron

Synthetic data generation flagship

808GB / 12GB204.0GB disk
Needs Upgrade
Need 796GB more

GLM-4.6

TextMoEglm

Latest Zhipu flagship MoE model

843.6GB / 12GB213.0GB disk
Needs Upgrade
Need 832GB more

GLM-4.5

TextMoEglm

Advanced open-source MoE from Zhipu

843.6GB / 12GB213.0GB disk
Needs Upgrade
Need 832GB more

Hunyuan Large

TextMoEhunyuan

Tencent flagship MoE model

925.3GB / 12GB233.4GB disk
Needs Upgrade
Need 913GB more

Jamba 1.5 Large

TextMoEjamba

Hybrid Transformer-Mamba, 256k context

946.4GB / 12GB238.8GB disk
Needs Upgrade
Need 934GB more

Llama 4 Maverick

TextMoEllama4

High-efficiency MoE, 128 experts, 1M context

950.5GB / 12GB240.0GB disk
Needs Upgrade
Need 939GB more

Llama 3.1 405B

Textllama

Frontier-class open model. Requires datacenter hardware.

962.4GB / 12GB243.0GB disk
Needs Upgrade
Need 950GB more

MiniMax Text-01

TextMoEminimax

1M context window with hybrid attention

1083.8GB / 12GB273.6GB disk
Needs Upgrade
Need 1072GB more

MiniMax M1

TextMoEminimax

Lightning reasoning model, hybrid thinking

1083.8GB / 12GB273.6GB disk
Needs Upgrade
Need 1072GB more

DeepSeek V3 (MoE)

TextMoEdeepseek

Massive MoE - exceptional performance

1594.7GB / 12GB402.6GB disk
Needs Upgrade
Need 1583GB more

DeepSeek R1 671B (MoE)

TextMoEdeepseek

Reasoning specialist with o1-level performance

1594.7GB / 12GB402.6GB disk
Needs Upgrade
Need 1583GB more

DeepSeek R1 Zero

TextMoEdeepseek

Pure RL reasoning without supervised fine-tuning

1594.7GB / 12GB402.6GB disk
Needs Upgrade
Need 1583GB more

Mistral Large 3

TextMoEmistral

Granular MoE flagship, 256K context

1604GB / 12GB405.0GB disk
Needs Upgrade
Need 1592GB more

Mistral Large 3 NVFP4

TextMoEmistral

FP4 quantized version for NVIDIA NIM

1604GB / 12GB405.0GB disk
Needs Upgrade
Need 1592GB more

DeepSeek V3.1

TextMoEdeepseek

Upgraded V3 with improved reasoning

1627.8GB / 12GB411.0GB disk
Needs Upgrade
Need 1616GB more

Qwen 3 Max (Thinking)

TextMoEqwen

Flagship reasoning model with "System 2" thinking mode

2851.6GB / 12GB720.0GB disk
Needs Upgrade
Need 2840GB more

Llama 4 Behemoth

TextMoEllama4

Flagship 2T foundation model, 16 experts

4752.7GB / 12GB1200.0GB disk
Needs Upgrade
Need 4741GB more

Polaris 3.0

Sciencepolaris

Hippocratic AI's medical constellation

2730GB / 12GB2520.0GB disk
Needs Upgrade
Need 2718GB more

VRAM Bottleneck Detected

Many models are running with RAM offloading. An upgrade to 16GB+ VRAM would significantly improve performance.