Inside Muse: Google’s New Text-to-Image Super Model
The new generative AI model shows significant efficiency improvements over models like Stable Diffusion, Imagen and Parti.
I recently started an AI-focused educational newsletter, that already has over 150,000 subscribers. TheSequence is a no-BS (meaning no hype, no news etc) ML-oriented newsletter that takes 5 minutes to read. The goal is to keep you up to date with machine learning projects, research papers and concepts. Please give it a try by subscribing below:
Text-to-Image(TTI) models have been at the center of the generative AI revolution with models such as DALL-E, Stable Diffusion or Midjourney capturing the headlines. This explosion in high quality TTI models have been fundamentally powered by diffusion or autoregressive methods that can effectively compute similarities between text and images. The nascent nature of these architectures remain makes them relatively prohibited from a computational standpoint and there is still a lot of work that can be done to improve their…