Text 2 video is getting pretty decent. It takes a lot of memory since all frames are handled simultaneously but you can get high resolution high quality results now.
Major problem though is a lack of control. Essentially your prompt is turned into an image and then that image is turned into an animated gif. Making a gif of specific motion or camera movement is nearly impossible.
I'm just glad the stupid phase of putting filters on video is hopefully behind us. For many months the community has been more focused on just taking videos of girls dancing and trying to turn them into anime. Like Reddit - Dive into anything
Also Dalle 3 just announced while I was writing this!
Major problem though is a lack of control. Essentially your prompt is turned into an image and then that image is turned into an animated gif. Making a gif of specific motion or camera movement is nearly impossible.
I'm just glad the stupid phase of putting filters on video is hopefully behind us. For many months the community has been more focused on just taking videos of girls dancing and trying to turn them into anime. Like Reddit - Dive into anything
Also Dalle 3 just announced while I was writing this!
here is dalle 3, which imo is quite amazing: https://t.co/UcPPehWxnQ
— Sam Altman (@sama) September 20, 2023
it will ramp to all chatgpt+ users over the next couple of weeks.
fantastic work by @model_mechanic (head of dall-e), @neobjb @gabeeeegoooh @jingli911 (the other lead dalle ICs), and the entire team. pic.twitter.com/8wtWffpXkQ