Phenaki: Google's AI model for generating videos from text

Phenaki: Google's AI model for generating videos from text

PhenakiIt is a software developed by Google Research Team that canText generation videoofAI Models, which can synthesize realistic video images based on a series of text prompts entered by the user.

This technology is innovative and leading in the field of video generation as it is able to address multiple challenges such as high computational cost, variable video length, and lack of high-quality text-based video data.

Phenaki has two main components: an encoder-decoder model that compresses videos into discrete embeddings or tokens while being able to handle videos of varying lengths;

The other is a transformer model that converts text embeddings into video tokens and then decodes them into actual videos.

Phenaki also leverages a large amount of image-text pair data and a small amount of video-text pair data for joint training, thereby achieving generalization beyond video datasets.

Phenaki is currently able to generate videos of arbitrary length from open-domain time-variant text or stories, and outperforms current frame-by-frame baselines used in the literature in terms of both spatial-temporal quality and the number of tokens per video.

Features

  • Generating videos from time-varying text: Phenaki can generate video clips in chronological order based on a series of text prompts entered by the user. These text prompts can be of any theme, style, and plot, as long as they can describe a clear and coherent scene.
  • Generate realistic and diverse videos: Phenaki can generate videos with high resolution, high frame rate, high dynamic range and high color accuracy, while maintaining the clarity, stability and continuity of the picture. Phenaki can also generate diverse and creative videos, such as presenting scenes that do not exist or are difficult to achieve in reality, or mixing and transforming different styles and elements.
  • Support interactive and iterative generation: Phenaki supports users to interact with the model and iterative generation, that is, users can modify, add or delete text prompts at any time, and then observe how the model adjusts the video output. In this way, users can create and edit according to their own preferences and needs, achieving a higher degree of personalization and customization.

Product Price

Currently, Phenaki has not been officially released as a commercial product, so no specific pricing information has been announced. However, according to the information released by the Google Research Team on its website, Phenaki has currently opened some sample videos for users to watch online, and plans to provide more video samples and interactive demonstrations in the future.

In addition, the Google research team also stated that they are exploring the application of Phenaki in different fields and scenarios, such as education, entertainment, advertising, games, etc., as well as combining it with other video processing technologies, such as super-resolution, style transfer, video editing, etc.

Official website address:https://phenaki.video/

statement:The content is collected from various media platforms such as public websites. If the included content infringes on your rights, please contact us by email and we will deal with it as soon as possible.
productvideo

Artflow.ai: AI virtual human video generation platform based on artificial intelligence

2024-1-26 9:36:17

productvideo

HeadshotPro: AI-powered professional avatar generator tool

2024-1-26 9:39:58

Search