Member-only story

Inside Meta AI’s Make-A-Video: The New Super Model that can Generate Videos from Textual Inputs

The new model builds on the principles of text-to-image methods to produce visually astonishing videos.

3 min readOct 3, 2022

I recently started an AI-focused educational newsletter, that already has over 125,000 subscribers. TheSequence is a no-BS (meaning no hype, no news etc) ML-oriented newsletter that takes 5 minutes to read. The goal is to keep you up to date with machine learning projects, research papers and concepts. Please give it a try by subscribing below:

TheSequence

The best source to stay up-to-date with the developments in the machine learning, artificial intelligence, and data…

thesequence.substack.com

Text-to-Video(T2V) is considered the next frontier for generative artificial intelligence(AI) models. While the text-to-image(T2I) space is experiencing a revolutions with models like DALL-E, Stable Diffusion or Midjouney, TTV still remains a monumental challenge. Recently, researchers from Meta AI unveiled Make-A-Video, a T2V model able to create realistic shot video clips from textual inputs.

Inside Meta AI’s Make-A-Video: The New Super Model that can Generate Videos from Textual Inputs

The new model builds on the principles of text-to-image methods to produce visually astonishing videos.

TheSequence

The best source to stay up-to-date with the developments in the machine learning, artificial intelligence, and data…

Written by Jesus Rodriguez

No responses yet