Stability AI open source audio generation model Stable Audio Open, can generate 47 seconds of stereo audio

recent,Stability AI The team launched a newOpen Source Audio Generation Model, named Stable Audio OpenWhat’s special about this model is that it can generate up to 47 seconds of stereo audio from text prompts, with a sampling rate of up to 44.1kHz.

With many currently popularAudio Generation ModelUnlike the previous model, the weights of Stable Audio Open are open, which means that anyone can view, modify and extend the model. This design concept not only promotes the progress of scientific research, but also provides more possibilities for developers. More importantly, this model is trained only with audio files licensed under Creative Commons, which not only ensures the legality of the data, but also avoids potential copyright issues, reflecting the high attention paid to the ethical use of data.

In terms of technical architecture, Stable Audio Open uses an advanced architecture to ensure high fidelity of text-to-audio generation. It can generate high-quality stereo audio, which allows users to enjoy a clear and realistic sound experience. During the training process, the model is exposed to a variety of audio samples, which also helps it learn a richer soundscape, making the generated audio more realistic and diverse.

In addition, to ensure that the performance of the new model is comparable to the industry's top models, the development team conducted a comprehensive performance evaluation. Through the key evaluation indicator FDopenl3, the researchers found that the model performed well in generating high-quality audio, comparable to other excellent models in the industry. This comparative study further proves the superiority and practicality of Stable Audio Open.

The launch of Stable Audio Open not only focuses on openness and high-quality audio synthesis, but also provides an important tool for researchers, artists and developers.

statement:The content of the source of public various media platforms, if the inclusion of the content violates your rights and interests, please contact the mailbox, this site will be the first time to deal with.

Stability AI open source audio generation model Stable Audio Open, which can generate 47 seconds of stereo audio

Zuckerberg: Llama 3.1 is expected to become the Linux of open source AI

Sakana AI launches new model to recreate traditional Japanese Ukiyo-e art

AI Weibo

AI Applications

5000+ AI applications! Updated daily

1AICLUB

Highly recommended! Official brand Weibo

AI Tutorials

Tons of tutorials to read

AI Basic Training Camp

Zero-based entry, leading you to become an AI expert

1ai tiktok

1ai master

TikTok account: 1ai.net

1ai master

TikTok account: 1ai.net

1ai WeChat

Five minutes a day

Become a master in one year

Scan the QR code to follow

Related content:

Zuckerberg: Llama 3.1 is expected to become the Linux of open source AI

Sakana AI launches new model to recreate traditional Japanese Ukiyo-e art

Stability AI open source SD 3: available for download on June 12, not for commercial use

Stability AI releases Stable Audio Open, an AI audio model that supports generating drum beats, musical instruments and other sound effects from text

Stable Diffusion3 open source commercial protocol, will open source a larger version of the model

Stability AI launches new AI model Stable Fast 3D: Generate 3D images 1,200 times faster in half a second

AI Applications

5000+ AI applications! Updated daily

1AICLUB

Highly recommended! Official brand Weibo

AI Tutorials

Tons of tutorials to read

AI Basic Training Camp

Zero-based entry, leading you to become an AI expert

1ai master

TikTok account: 1ai.net

1ai master

TikTok account: 1ai.net

Five minutes a day

Become a master in one year

Scan the QR code to follow