Novel Techniques and Improvements
Stability AI has taken advantage of several novel techniques to improve quality and performance in Stable Diffusion 3.5. One notable addition is the integration of Query-Key Normalization into the transformer blocks. This technique facilitates easier fine-tuning and further development of the models by end-users. This innovation enables users to adapt the models to their specific needs and applications.
Stability AI has also enhanced its Multimodal Diffusion Transformer MMDiT-X architecture, specifically for the medium model. MMDiT-X is able to help improve image quality and enhance multi-resolution generation capabilities. This architecture allows the model to generate images with a range of resolutions, from low to high, while maintaining a high level of quality.