OpenAI’s latest model is all about video

OpenAI has dipped its toes, or should I say its whole body, into the world of video generation. Following in the footsteps of startups like RunwayML, the titular AI company announced Sora, a text-to-video AI model that’s capable of producing some stunning — almost concerning results. 

It was announced last week, out of the blue, and it quickly took social media by storm. OpenAI CEO Sam Altman generated a number of videos based on people’s suggested prompts, including dogs recording a podcast, a drone race on Mars, and a variety of sea creatures riding bikes

Sora works like the rest of OpenAI’s offerings — enter a prompt as simple or as detailed as you like, and it will generate a minute-long 1080p video in whatever style you want, populated with things, people, animals, and different environments. You can also craft your blockbuster movie just by dropping in a still image which the AI will then go on to animate, or a video that can be extended by Sora. 

According to OpenAI, Sora was trained on around 10,000 hours of “high quality video” and is built upon a transformer architecture, which apparently gives the model a superior scaling performance. It also uses the same “recaptioning technique from DALL·E 3, which involves generating highly descriptive captions for the visual training data.”

Safety was a big concern for the team as well, so it’s not open to the public yet. Rather, the company is working with “red-teamers” — experts in things like misinformation, hate content, and bias — who will be testing the model thoroughly before any release to the wider public. 


Reddit 🤝 AI

Reddit is betting on AI to supercharge its IPO. According to Bloomberg, Reddit has signed a contract with an unnamed company to train an AI model on the site’s plethora of user-generated content.

Why does this matter?

Well, Reddit initially revealed the deal, which is rumored to be worth a whopping $60 million, earlier this year to potential investors of their anticipated initial public offering (IPO) which is speculated to happen as early as next month.

It also comes not long after the company tightened up how its users could access the company’s data with rate limits and some fairly pricey API plans, that put even legacy third-party apps like Apollo out of business.

Could this be a mechanism to shore up support for the company’s IPO? Let’s wait and see.


AGI, when?

AGI has become one of the biggest buzzwords of the last few years. It’s essentially a type of AI that can understand, learn, and apply knowledge across a wide array of tasks at a human level.

Ever since AI broke into the mainstream with the likes of OpenAI, people have speculated when AGI will arrive and what the consequences (good or bad) will be.

Well, it could be earlier than expected, according to Amazon. The company’s AGI team has stated that its LLM is exhibiting language abilities that it wasn’t trained on.

According to a not-yet-peer-reviewed academic paper, the AI is displaying “state-of-the-art naturalness” in conversational text. It seems the model was able to come up with all sorts of sentences that, according to an expert linguist, are on a human level of understanding.


