What are the differences between the stable diffusion, SD versions?

What are the differences between the stable diffusion, SD versions?

on February 21, 2025

There are several versions of Stable Diffusion,

and each version has different performance, resolution, and architecture.

The major differences between versions are summarized as follows.

1. STABLE Diffusion 1.x

SD 1.3, SD 1.5

Resolution: Native 512 x 512
Based on latent diffusion model
Relatively light computational requirements
There are many different community models (LoRA, DreamBooth, etc.).
SD 1.5 offers improved quality over SD 1.4

2. Stable Diffusion 2.x

SD 2.0, SD 2.1

Resolution: Native 768 x 768
Improve text recognition accuracy using OpenClip
Capable of creating more detailed images than SD 1.x
There were many complaints about NSFW filtering becoming stronger
in version 2.0, which was eased in 2.1.

3. Stable Diffusion XL (SDXL)

SDXL 1.0, SDXL Turbo

Resolution: Default 1024 x 1024 (significantly higher resolution than 1.x)
More powerful text-to-image interpretation and improved image quality
Multi-prompt support (allows for more sophisticated prompt interpretation)
Improved natural facial and finger expressions
SDXL Turbo is a version with increased real-time creation speed.

Summary of key differences by version

version	native resolution	understand text	image quality	special feature
SD 1.4 / 1.5	512×512	commonly	commonly	Most widely used, custom models galore
SD 2.0 / 2.1	768×768	Improved	improved	OpenCLIP application, NSFW filter controversy
SDXL 1.0	1024×1024	greatly improved	very excellent	Improved face and hand expressions, large model
SDXL Turbo	1024×1024	greatly improved	very excellent	Significantly improved speed, real-time creation possible

Comments