SD.Next supported models
https://vladmandic.github.io/sdnext-docs/Models/
On 2025-07-15 sorted by size
| Publisher |
Model |
Version |
Size |
Diffusion Architecture |
Model Params |
Text Encoder(s) |
TE Params |
Auto Encoder |
| Wan-AI |
WAN |
2.1 14B |
78.52GB |
MMDiT |
|
UMT5-XXL |
14.288B |
16ch VAE |
| HiDream-AI |
HiDream |
I2 Fast/Dev/Full |
42.71 GB + 15.69 |
MMDiT |
17.10B |
CLiP ViT-L + ViT+G + T5-XXL + LLama-3.1-8B |
0.12B + 0.69B + 2.95B + 4.54B |
16ch VAE |
| nVidia |
Cosmos-Predict2 T2I |
14B |
37.36GB |
MMDiT |
14.26B |
T5-XXL |
4.86 |
WAN-VAE |
| Black Forest Labs |
Flux |
1 Dev/Schnell |
32.93GB |
MMDiT |
11.9B |
CLiP ViT-L + T5-XXL |
0.12B + 4.76B |
16ch VAE |
| Black Forest Labs |
Flux |
1 Kontext-Dev |
32.93GB |
MMDiT |
11.9B |
CLiP ViT-L + T5-XXL |
0.12B + 4.76B |
16ch VAE |
| FAL |
AuraFlow |
0.3 |
31.90GB |
MMDiT |
6.8B |
UMT5 |
12.1B |
VAE |
| VectorSpaceLab |
OmniGen |
v2 |
30.50GB |
Transformer |
3.97B |
Qwen-VL-2.5 |
3.75B |
VAE |
| Thudm |
CogView |
4 |
30.39GB |
DiT |
6.37B |
GLM-4 |
9.40B |
VAE |
| Kandinsky |
Kandinsky |
3 |
27.72GB |
Unet |
3.05B |
T5-XXXL |
8.72B |
VQ |
| Wan-AI |
WAN |
2.1 1.3B |
27.72GB |
MMDiT |
1.42xB |
UMT5-XXL |
5.68B |
16ch VAE |
| StabilityAI |
Stable Diffusion |
3.5 Large |
26.98GB |
MMDiT |
8.05B |
CLiP ViT-L + ViT+G + T5-XXL |
0.12B + 0.69B + 4.76B |
16ch VAE |
| lodestones |
Chroma |
36 |
26.84GB |
MMDiT |
8.9B |
CLiP ViT-L + T5-XXL |
0.12B + 4.76B |
16ch VAE |
| Ostris |
Flex |
1 Alpha |
25.65GB |
MMDiT |
4.0B |
CLiP ViT-L + T5-XXL |
0.12B + 2.95B |
16ch VAE |
| Thudm |
CogView |
3 Plus |
24.96GB |
DiT |
2.85B |
T5-XXL |
4.76B |
VAE |
| PixArt |
Alpha |
XL 2 |
21.3GB |
DiT |
0.61B |
T5-XXL |
4.76B |
VAE |
| PixArt |
Sigma |
XL 2 |
21.3GB |
DiT |
0.61B |
T5-XXL |
4.76B |
VAE |
| AlphaVLLM |
Lumina |
2 |
20.75GB |
DiT |
2.61B |
Gemma-2 |
2.61B |
16ch VAE |
| FreePix |
F-Lite |
|
19.81GB |
MMDiT |
9.8B |
T5-XXL |
2.95B |
16ch VAE |
| Kwai |
Kolors |
N/A |
17.40GB |
UNnet |
2.58B |
ChatGLM |
6.24B |
VAE |
| StabilityAI |
Stable Diffusion |
3.5 Medium |
15.89GB |
MMDiT |
2.25B |
CLiP ViT-L + ViT+G + T5-XXL |
0.12B + 0.69B + 4.76B |
16ch VAE |
| NVLabs |
Sana |
1.5 4.8B |
15.58GB |
MMDiT |
4.72B |
Gemma2 |
2.61B |
DC-AE |
| DeepFloyd |
IF |
L |
15.48GB |
Multi-stage UNet |
0.61B + 0.93B |
T5-XXL |
4.76B |
Pixel |
| VectorSpaceLab |
OmniGen |
v1 |
15.47GB |
Transformer |
3.76B |
Phi-3 |
0 |
VAE |
| StabilityAI |
Stable Diffusion |
3.0 Medium |
15.14GB |
MMDiT |
2.0B |
CLiP ViT-L + ViT+G + T5-XXL |
0.12B + 0.69B + 4.76B |
16ch VAE |
| Tencent |
HunyuanDiT |
1.2 |
14.09GB |
DiT |
1.5B |
BERT + T5-XL |
3.52B + 1.67B |
VAE |
| PlaygroundAI |
Playground |
2.x |
13.35GB |
UNet |
2.56B |
CLiP ViT-L + ViT+G |
0.12B + 0.69B |
VAE |
| nVidia |
Cosmos-Predict2 T2I |
2B |
13.32GB |
MMDiT |
1.96B |
T5-XXL |
4.86 |
WAN-VAE |
| DeepFloyd |
IF |
M |
12.79GB |
Multi-stage UNet |
0.37B + 0.46B |
T5-XXL |
4.76B |
Pixel |
| NVLabs |
Sana |
1.0 1600M |
12.63GB |
MMDiT |
1.60B |
Gemma2 |
2.61B |
DC-AE |
| Warp AI |
Wuerstchen |
N/A |
12.16GB |
Multi-stage UNet |
1.0B + 1.05B |
CLiP ViT-L + ViT+G |
0.12B + 0.69B |
42x VQE |
| StabilityAI |
Stable Cascade |
Medium |
11.82GB |
Multi-stage UNet |
1.56B + 3.6B |
CLiP ViT-G |
0.69B |
42x VQE |
| NVLabs |
Sana |
1.5 1.6B |
9.49GB |
MMDiT |
1.60B |
Gemma2 |
2.61B |
DC-AE |
| Segmind |
SSD-1B |
N/A |
8.72GB |
UNet |
1.33B |
CLiP ViT-L + ViT+G |
0.12B + 0.69B |
VAE |
| AlphaVLLM |
Lumina |
Next SFT |
8.67GB |
DiT |
1.7B |
Gemma |
2.5B |
VAE |
| NVLabs |
Sana |
1.0 600M |
7.51GB |
MMDiT |
0.59B |
Gemma2 |
2.61B |
DC-AE |
| Salesforce |
BLIP-Diffusion |
N/A |
7.23GB |
UNet |
0.86B |
CLiP ViT-L + BLiP-2 |
0.12B + 0.49B |
VAE |
| StabilityAI |
Stable Diffusion |
XL |
6.94GB |
UNet |
2.56B |
CLiP ViT-L + ViT+G |
0.12B + 0.69B |
VAE |
| Koala |
Koala |
700M |
6.58GB |
UNet |
0.78B |
CLiP ViT-L + ViT+G |
0.12B + 0.69B |
VAE |
| Segmind |
Vega |
N/A |
6.43GB |
UNet |
0.75B |
CLiP ViT-L + ViT+G |
0.12B + 0.69B |
VAE |
| Thu-ML |
UniDiffuser |
v1 |
5.37GB |
U-ViT |
0.95B |
CLiP ViT-L + CLiP ViT-B |
0.12B + 0.16B |
VAE |
| Kandinsky |
Kandinsky |
2.2 |
5.15GB |
Unet |
1.25B |
CLiP ViT-G |
0.69B |
VQ |
| StabilityAI |
Stable Cascade |
Lite |
4.97GB |
Multi-stage UNet |
0.7B + 1.0B |
CLiP ViT-G |
0.69B |
42x VQE |
| PlaygroundAI |
Playground |
1 |
4.95GB |
UNet |
0.86B |
CLiP ViT-L |
0.12B |
VAE |
| MeissonFlow |
Meissonic |
N/A |
3.64GB |
DiT |
1.18B |
CLiP ViT-H |
0.35B |
VQ |
| Open-MUSE |
aMUSEd |
256 |
3.41GB |
ViT |
0.60B |
CLiP ViT-L |
0.12B |
VQ |
| StabilityAI |
Stable Diffusion |
2.1 |
2.58GB |
UNet |
0.86B |
CLiP ViT-H |
0.34B |
VAE |
| StabilityAI |
Stable Diffusion |
1.5 |
2.28GB |
UNet |
0.86B |
CLiP ViT-L |
0.12B |
VAE |
| IDKiro |
SDXS |
N/A |
2.05GB |
UNet |
0.32B |
CLiP ViT-L |
0.12B |
VAE |
| Segmind |
Tiny |
N/A |
1.03GB |
UNet |
0.32B |
CLiP ViT-L |
0.12B |
VAE |
Searching for highest resolution available on SD.Next ipex (dev 2025-07-13) on ultra 9 185H + 128GB RAM
Test Prompt: VibrantlySharp style, 3pic_vist4, animeniji, in a soft-focus painterly concept art style Upper body profile view, autumn dryad elf adorned in translucent red foliage and bark-textured silk, wind animating her hair and dress into motion, background: ancient twilight forest, distant amber sunlight diffused through fog and drifting leaves.
|
Model
|
Low steps
|
High steps
|
Wan2.1-T2V-14B
256x256 - 1m 41.11s @ 16 STEPS
512x512 - 5m 48.96s @ 20STEPS
512x512 - 11m 42.14s @ 40STEPS
512x768 - hangs
⭐⭐⭐⭐
|

Parameters: Pipeline: WanPipeline| Steps: 20| Size: 512x512| Seed: 1460879190| CFG scale: 6| App: SD.Next| Version: 251c919| Operations: txt2img| Model: Wan2.1-T2V-14B-Diffusers
Time: 5m 48.96s | total 511.83 pipeline 348.65 preview 140.76 offload 9.90 vae 8.66 gc 2.12 te 1.99 post 0.29 | GPU 39054 MB 30% | RAM 29.72 GB 24%
|

Parameters: Pipeline: WanPipeline| Steps: 40| Size: 512x512| Seed: 1460879190| CFG scale: 6| App: SD.Next| Version: 251c919| Operations: txt2img| Model: Wan2.1-T2V-14B-Diffusers
Time: 11m 42.14s | total 1079.70 pipeline 701.83 preview 345.45 vae 12.42 offload 11.25 te 6.87 gc 2.12 post 0.28 | GPU 38830 MB 30% | RAM 29.44 GB 23%
|
Wan2.1-T2V-1.3B
256x256 - 23.16s @ 16 STEPS
512x512 - 54.41s @ 20STEPS
768x768 - 3m 43.11s @ 40STEPS
1024x1024 - 7m 35.17s @ 40STEPS
1280x1280 - 12m 42.76s @ 40STEPS
(resolution to be divisible by 16)
⭐
|

Parameters: Pipeline: WanPipeline| Steps: 20| Size: 512x512| Seed: 1460879190| CFG scale: 6| App: SD.Next| Version: 3c66c35| Operations: txt2img| Model: Wan2.1-T2V-1.3B-Diffusers
Time: 54.41s | total 96.96 pipeline 54.15 preview 36.26 gc 2.09 te 1.98 offload 1.88 vae 0.80 | GPU 15416 MB 12% | RAM 3.16 GB 3%
|

Parameters: Pipeline: WanPipeline| Steps: 40| Size: 1280x1280| Seed: 1460879190| CFG scale: 6| App: SD.Next| Version: 3c66c35| Operations: txt2img| Model: Wan2.1-T2V-1.3B-Diffusers
Time: 12m 42.76s | total 1282.37 pipeline 762.43 preview 511.77 vae 2.40 gc 2.11 te 1.98 offload 1.88 post 0.30 | GPU 21686 MB 17% | RAM 3.14 GB 3%
|
FLUX.1-schnell 🪪
768x768 - ok
768x1024 - 1m 24.86s @ 4 STEPS
768x1024 - 7m 45.92s @ 24 STEPS
1024x1024 - hangs
⭐⭐⭐⭐
|

Parameters: Pipeline: FluxPipeline| Steps: 4| Size: 768x1024| Seed: 1460879190| CFG scale: 6| App: SD.Next| Version: 251c919| Operations: txt2img| Model: FLUX.1-schnell
Time: 1m 24.86s | total 127.77 pipeline 74.92 preview 36.64 decode 9.63 offload 6.53 post 0.29 gc 0.27 | GPU 33784 MB 26% | RAM 25.14 GB 20%
|

Parameters: Pipeline: FluxPipeline| Steps: 24| Size: 768x1024| Seed: 1460879190| CFG scale: 6| App: SD.Next| Version: 251c919| Operations: txt2img| Model: FLUX.1-schnell
Time: 7m 45.92s | total 903.61 pipeline 438.91 preview 400.19 move 14.96 prompt 14.96 te 14.91 decode 11.74 offload 7.80 post 0.30 gc 0.28 | GPU 33614 MB 26% | RAM 25.17 GB 20%
|
FLUX.1-dev 🪪
64x64 - 58.97s @ 16 STEPS
128x128 - 1m 0.43s @ 16 STEPS
256x256 - 1m 7.51s @ 16 STEPS
512x512 - 2m 0.41s @ 16 STEPS
768x768 - 3m 41.69s @ 16 STEPS
768x1024 - 4m 54.68s @ 16 STEPS
768x1024 - 14m 56.94s@ 40 STEPS
1024x1024 - hangs
⭐⭐⭐⭐
|
768x1024 - 4m 54.68s @ 16 STEPS
Parameters: Pipeline: FluxPipeline| Steps: 16| Size: 768x1024| Seed: 1460879190| CFG scale: 6| App: SD.Next| Version: 251c919| Operations: txt2img| Model: FLUX.1-dev
Time: 4m 54.68s | total 444.67 pipeline 282.38 preview 139.55 decode 10.05 offload 6.89 move 1.92 prompt 1.91 te 1.88 post 0.33 gc 0.32 | GPU 33804 MB 26% | RAM 25.12 GB 20%
|
768x1024 - 14m 56.94s@ 40 STEPS

Parameters: Pipeline: FluxPipeline| Steps: 50| Size: 768x1024| Seed: 1460879190| CFG scale: 6| App: SD.Next| Version: 251c919| Operations: txt2img| Model: FLUX.1-dev
Time: 14m 56.94s | total 1776.47 pipeline 875.87 preview 852.86 decode 10.81 move 9.96 prompt 9.95 te 9.91 offload 6.88 post 0.29 gc 0.27 | GPU 33632 MB 26% | RAM 25.15 GB 20%
|
HiDream I1 Full
(Meta-Llama-3.1-8B-Instruct 🪪)
512x512 generates 1024x1024
768x768 generates 1024x1024
1024x1024 -10m 7m 37.49s29.57s @ 812 STEPS
768x7681024x1024 -16m @ 20 STEPS
1024x1024 -42.62s @ 20 STEPS
1280x12801024x1024 -16m 42.62s @ 4050 STEPS
1280x1280 generates 1024x1024
2048x2048 generates 1024x1024
⭐⭐⭐⭐⭐
|

Parameters: Pipeline: HiDreamImagePipeline| Steps: 12| Size: 1024x1024| Seed: 1460879190| CFG scale: 6| App: SD.Next| Version: 3c66c35| Operations: txt2img| Model: HiDream-I1-Full
Time: 10m 29.57s | total 685.95 pipeline 593.18 move 21.17 prompt 21.13 te 18.92 decode 14.95 offload 11.79 preview 4.42 gc 0.61 post 0.27 | GPU 4508 MB 4% | RAM 58.59 GB 47%
|

Parameters: Pipeline: HiDreamImagePipeline| Steps: 50| Size: 1024x1024| Seed: 1460879190| CFG scale: 6| App: SD.Next| Version: 3c66c35| Operations: txt2img| Model: HiDream-I1-Full
Time: 41m 42.55s | total 2576.47 pipeline 2466.40 preview 22.89 move 20.63 prompt 20.52 te 18.32 decode 15.24 offload 12.13 gc 0.55 post 0.27 | GPU 4572 MB 4% | RAM 58.59 GB 47%
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|