AI virtual clothing-changing model CatVTON, with small parameters of only 899.06M and supports high resolution

CatVTONIt is a smallAI virtual clothes changingThe model is suitable for every fashion lover. CatVTON is characterized by a lightweight network with a total of 899.06M parameters, and only 49.57M trainable parameters are required for training. Moreover, when performing inference, the video memory used is less than 8G, and it supports a high resolution of 1024x768, which is very suitable for operation on personal computers.

AI virtual clothing-changing model CatVTON, with small parameters of only 899.06M and supports high resolution

Product portal: https://github.com/Zheng-Chong/CatVTON

The product features can be summarized as follows:

1) Lightweight network (899.06M parameters in total)

2) Efficient parameter training (49.57M parameters can be trained)

3) Simplified reasoning (<8G VRAM resolution 1024X768)

The development team of CatVTON recently released the latest code and deployment process on GitHub, including how to quickly deploy CatVTON on ComfyUI. With just a few simple steps, you can experience the latest virtual makeup trial technology at home.

Installation steps:

First, you need to set up the environment according to the installation guide, then download the ComfyUI-CatVTON file and unzip it to the custom_nodes folder of the ComfyUI project. After completing these steps, start ComfyUI and you can enjoy the fun of fashion matching.

Of course, if you prefer to use the Gradio app, just run a command and the system will automatically download the required checkpoints from HuggingFace, saving time and effort. Whether you want to perform reasoning on the DressCode or VITON-HD datasets, CatVTON can easily meet your needs, and the related reasoning commands are also very simple. Just enter the corresponding instructions in the command line and you can see the effect you like in a few minutes.

In addition, CatVTON also supports multiple precision options to ensure that users can get the best experience under different hardware conditions. This model uses image restoration technology based on Stable Diffusion v1.5, combined with SCHP and DensePose, which can automatically generate masks to help you better try out makeup.

statement:The content is collected from various media platforms such as public websites. If the included content infringes on your rights, please contact us by email and we will deal with it as soon as possible.
TutorialEncyclopedia

AI Prompt Words: Does AI-generated content have a strong AI flavor? Just add these words to the prompt words to say goodbye to the AI flavor!

2024-7-31 19:36:02

Encyclopedia

Hedra, an AI lip-syncing video tool, perfectly matches mouth shapes based on audio and supports Chinese!

2024-8-2 0:28:07

Search