Papers

StyleGAN-T: Unlocking the Power of GANs for Fast Large-Scale Text-to-Image Synthesis

StyleGAN-T addresses the specific requirements of large-scale text-to-image synthesis to significantly improve over previous GANs and outperform distilled diffusion models in terms of sample quality and speed. Learn more about the model!

Sophia

· Jan 30, 2023

StyleGAN-T: Unlocking the Power of GANs for Fast Large-Scale Text-to-Image Synthesis

Paper Code

Abstract

Text-to-image synthesis has recently seen significant progress thanks to large pretrained language models, large-scale training data, and the introduction of scalable model families such as diffusion and autoregressive models. However, the best-performing models require iterative evaluation to generate a single sample. In contrast, generative adversarial networks (GANs) only need a single forward pass. They are thus much faster, but they currently remain far behind the state-of-the-art in large-scale text-to-image synthesis. This paper aims to identify the necessary steps to regain competitiveness. Our proposed model, StyleGAN-T, addresses the specific requirements of large-scale text-to-image synthesis, such as large capacity, stable training on diverse datasets, strong text alignment, and controllable variation vs. text alignment tradeoff. StyleGAN-T significantly improves over previous GANs and outperforms distilled diffusion models - the previous state-of-the-art in fast text-to-image synthesis - in terms of sample quality and speed.

Video

Comments

Arcee AI releases Trinity-Large-Thinking, a very capable open-weights reasoning model

Arcee AI, a tiny US startup that recently developed a completely open contender to Meta's Llama 4 Maverick, recently released Trinity-Large-Thinking, an open weights reasoning model claimed to be "the strongest open model ever released outside of China."

Apr 07, 2026

by Ellie Ramirez-Camara

News

Microsoft has launched three new models heavily marketed towards business use cases

Microsoft AI announced the availability of three foundational models for transcription and voice and image generation. The launch highlights these models' competitive pricing and practical business value as Microsoft shifts its AI strategy towards business and productivity solutions.

Apr 06, 2026

by Ellie Ramirez-Camara

News

Mistral secures $830M in debt financing to build its first data center in France

Mistral raised $830 million in debt financing from a consortium of seven banks to purchase 13,800 Nvidia GPUs for its first data center near Paris, positioning itself as Europe's sovereign AI alternative to US proprietary models.

Apr 03, 2026

by Ellie Ramirez-Camara

News

Mercor reports it fell victim to a cyberattack linked to the recently compromised LiteLLM

Mercor recently confirmed it was victim to a security incident related to the compromise of the popular open-source project LiteLLM. Extortion group Lapsus$ has claimed responsibility for the attack, allegedly posting a sample of the stolen data on its leak site.

Mar 31, 2026

by Ellie Ramirez-Camara

Legal tech darling Harvey confirms new funding round at a $11B valuation

Legal tech startup Harvey has recently confirmed that it successfully raised $200 million at an $11 billion valuation. The round was co-led by returning investors GIC and Sequoia, with participation from Andreessen Horowitz, Coatue, Conviction Partners, and Elad Gil, among others.

Mar 30, 2026

by Ellie Ramirez-Camara

Subscribe

StyleGAN-T: Unlocking the Power of GANs for Fast Large-Scale Text-to-Image Synthesis

Abstract

Video

Comments

Read Next

Arcee AI releases Trinity-Large-Thinking, a very capable open-weights reasoning model

Microsoft has launched three new models heavily marketed towards business use cases

Mistral secures $830M in debt financing to build its first data center in France

Mercor reports it fell victim to a cyberattack linked to the recently compromised LiteLLM

Legal tech darling Harvey confirms new funding round at a $11B valuation