Scalable Diffusion Models with Transformers

Jan 3, 2023

Sophia

Scalable Diffusion Models with Transformers

In this work, the researchers explore a new class of diffusion models based on the transformer architecture; train latent diffusion models, replacing the U-Net backbone with a transformer that operates on latent patches; and analyze the scalability of Diffusion Transformers (DiTs).

Project Paper Code HuggingFace Colab

Abstract

We explore a new class of diffusion models based on the transformer architecture. We train latent diffusion models of images, replacing the commonly-used U-Net backbone with a transformer that operates on latent patches. We analyze the scalability of our Diffusion Transformers (DiTs) through the lens of forward pass complexity as measured by Gflops. We find that DiTs with higher Gflops -- through increased transformer depth/width or increased number of input tokens -- consistently have lower FID. In addition to possessing good scalability properties, our largest DiT-XL/2 models outperform all prior diffusion models on the class-conditional ImageNet 512x512 and 256x256 benchmarks, achieving a state-of-the-art FID of 2.27 on the latter.

0:00

Comments

Meta's recruiting blitz scores major AI talent for its "superintelligence" effort

Meta's aggressive AI recruiting campaign, featuring multi-million-dollar compensation packages and outreach directly from Mark Zuckerberg, has successfully poached key researchers from OpenAI, Google, and Anthropic. Newly hired talent will be organized as a new "superintelligence" division.

Jul 01, 2025

by Ellie Ramirez-Camara

News

Harvey AI soars to $5B valuation with $300M Series E funding

Harvey AI, a legal AI startup, raised $300 million in Series E funding at a $5 billion valuation—a 67% increase from just four months ago. Harvey plans to expand its workforce and diversify beyond legal services into other professional areas, like tax accounting.

Jun 29, 2025

by Ellie Ramirez-Camara

News

Mira Murati's secretive startup Thinking Machines Lab raises $2B

Former OpenAI CTO Mira Murati has raised $2 billion for her mysterious six-month-old startup, Thinking Machines Lab, at a $10 billion valuation. Notably, the startup has revealed virtually no details about its actual product or plans.

Jun 27, 2025

by Ellie Ramirez-Camara

SF Bay Area media and education platform focused on AI and Data. As a voice of AI industry, Data Phoenix delivers news, practical knowledge, and helps companies be heard in the community.

Subscribe

Scalable Diffusion Models with Transformers

Abstract

Comments

Read Next

Meta's recruiting blitz scores major AI talent for its "superintelligence" effort

Harvey AI soars to $5B valuation with $300M Series E funding

Mira Murati's secretive startup Thinking Machines Lab raises $2B