Google launches Gemini 1.5 Pro public preview, now supports audio processing

GooglePreviously announced at the Google Next conference, it will be available to the public for the first time through its AI application platform Vertex AI Gemini 1.5 Pro.

Google launches Gemini 1.5 Pro public preview, now supports audio processing

Google has now launched a public preview of Gemini 1.5 Pro, and in doing so has given it "ears" to help users work with audio content, such as uploading an audio file directly for it to analyze, or uploading a recording of an earnings call or video for it to summarize.

This version of the Gemini family, which is positioned as a "middleweight" model, is said to have surpassed the performance of its own larger model, the Gemini Ultra. Google claims that the Gemini 1.5 Pro understands complex commands without the need to fine-tune the model. Google says Gemini 1.5 Pro can understand complex commands without the need to fine-tune the model.

Of course, Gemini 1.5 Pro is currently limited to the Vertex AI user experience, while Gemini Ultra is available to all Pro users because of the Gemini chatbot. However, while Gemini Ultra has more features and can understand long commands, it doesn't process as fast as Gemini 1.5 Pro.

In fact, Gemini 1.5 Pro isn't the only big Google model to get an update, as Imagen 2, the Vincennes model that aids Gemini in generating images, will also include image repair and expansion capabilities, allowing users to add or remove image elements.

Google also provides a digital watermarking feature called "SynthID" for all images in the Imagen model. Simply put, SynthID adds a security mark to an image that is not visible to the user, but can be verified by a detection tool to confirm its origin.

It's worth noting that many of Imagen's new features, such as image repair and expansion, have already appeared in other Vincennes models, such as Stability AI's Stable Diffusion and Getty's Generative AI by iStock, not to mention similar features in the latest Samsung Galaxy phones.

Google said they will also try to combine AI responses with Google search results in a public preview in order to utilize the latest intelligence for answers.

statement:The content is collected from various media platforms such as public websites. If the included content infringes on your rights, please contact us by email and we will deal with it as soon as possible.
Information

Instagram's short video section Reels tests AI simultaneous interpretation, allowing users to "speak foreign languages" with their own voices

2024-4-10 9:49:17

Information

MediaTek launches generative AI service platform "DaGe", supporting "the strongest traditional Chinese large model" MR BreeXe

2024-4-10 9:51:27

Search