OpenAI Reveals Sora, Its New Text-to-Video AI Model that Creates Minute-Long Films

A lot more than just a creative toy!

By Muhammad QasimUpdated on April 5, 2024

Muhammad Qasim is a certified Google IT Support Professional.

Filmmaking may change forever the way we know it. Just 14 months after the hugely successful launch of ChatGPT, OpenAI has announced that they will be launching an AI model called Sora, capable of generating high-quality, realistic-looking videos up to one minute long, solely from text prompts. However, that’s only the tip of the iceberg.

Sora Turns Text into Minute-Long Videos, But it’s Not Publicly Available Yet

We’ve seen AI video models in the past like Google Lumiere and Runway, but Sora appears to top them all. For one, it can emulate real-world scenarios with utmost precision, thanks to its deep understanding of real-world physics and existence. But what truly sets it apart is the duration of videos it can generate.

Most text-to-video models can only simulate videos of up to a few seconds, and even then, most models tend to produce grainy videos that can’t be deemed anything close to realistic. Although the technology has been improving steadily, it hasn’t quite reached its full potential yet.

In its introductory blog post, OpenAI showcased a number of video samples with complex scenarios, generated from text prompts, and the results look to be oddly close to reality. The videos are full of detail, and the model does a great job of understanding how complex elements exist in the real world. Sora appears to have proper knowledge of how things work in the actual world, and how things are actually fimed with understanding of cinematic techniques like camera angles and pacing.

Snipped from a video generated via Sora | OpenAI

According to the company, Sora is built upon information from the existing DALL-E 3 and GPT models. Unlike DALL-E, which primarily focuses on creative outputs, Sora is much more powerful and has real-world applications. The model in itself is a “data-driven physics engine“, which can not only create visuals but also simulate physical interactions and real-world scenarios.

Sora Can Literally ‘Interact’ with the Virtual Worlds

Sora doesn’t just generate individual images or videos, but can create and interact with entire virtual worlds, be they realistic or fantastical. Not only will this have applications in the scientific world, but the technology can also be used to help in game development as well as virtual reality—the timing of release may just be perfect.

However, OpenAI states that the model isn’t ready yet and may struggle with accurately simulating the physics of a complex scene and understanding specific instances of cause and effect. The video samples, too, appear to have been specifically and carefully selected to showcase the best of Sora and its capabilities.

OpenAI has said that Sora is not yet available to the public and is currently undergoing a series of safety tests. It is accessible only to a team of designated red testers who are actively searching for potential ways the service could be abused.

This is all we know for now, but rest assured that we will keep you updated as new information becomes available.

Sora Turns Text into Minute-Long Videos, But it’s Not Publicly Available Yet

Sora Can Literally ‘Interact’ with the Virtual Worlds

Related Articles

Fortnite players are getting $126 million in refunds for Epic’s ‘dark’ in-game tricks

Samsung’s new phones are coming at Galaxy Unpacked on July 9th

The Dia browser arrives on macOS, and it can read your tabs using AI

Samsung Foundry Struggles With 3nm Yields at 50% as TSMC Climbs Past 90%

Honor Mocks Samsung’s “Consistent” Look In New Honor 400 Ad

Intel Sells 51% Stake in Altera for $8.75B to Help Revive Its Chip Business