Video generation from a text: what is the best AI video generator?

13 October, 2021



ai generated landscape

 

A video generator which can manage the voice-over, the motion picture and the music would be great for whose who don’t want to spend hours in video editing.

 

In this article we are going to focus on the most difficult part, the motion picture generation. It means that first an AI (artificial intelligence) reads a text, and then it generates a video inspired by this text.

 

An AI video generator, as an alternative to the long process of traditional video creation, can cost much less and can convert text into videos in no time.

 

To know how to choose a video generator you first have to know which type of video generator can fulfill your needs.

 

Video generator can be divided in 2 different types: - 1/ video generation with a virtual human presenter & - 2/ synthetic media generation.

 

 

=> Type 1 - AI video generator with a virtual human presenter: 2 websites are offering this service, synthesia.io and rephrase.ai.

 

- synthesia.io

 

Website headline: Goodbye cameras, microphones and actors! Create professional AI videos from text in 50+ languages. Synthesia saves you money, time and nerves.

 

YouTube channel

 

 

- rephrase.ai

 

Website headline: Create Studio-quality AI videos as simply as typing. With Rephrase.ai, you can easily create stunning business videos. Say goodbye to film crews and expensive equipment.

 

YouTube channel

 

 

=> Type 2 - Synthetic media generation

 

Video generation from a text is a new field of study, which is far from being perfectly solved.

 

Static image generation is the first step to 100% AI-generated videos.

 

For example Rollideo will add a Text-to-Image generator. This feature is under development. Meanwhile the users of Rollideo can manually choose a picture as the background of a video sequence.

 

In the field of Text-to-Image generation, the most famous company is OpenAI.

 

OpenAI Blog page

 

Website headline: We’ve trained a neural network called DALL·E that creates images from text captions for a wide range of concepts expressible in natural language.

 

For example if you enter the text “an armchair in the shape of an avocado”, it will generate various images of that concept.

 

How can you test a public demo of DALL-E Text-to-Image?

 

Unfortunately DALL-E has not yet been publicly released in its entirety.

 

You can check the github if you have a technical background.

 

To stay informed about the last development in the Text-to-Image techniques, you can follow these resources:

 

- The subreddit about AI-generated and manipulated content.

 

- The subreddit dedicated to art produced via machine learning algorithms.