Google polishes Gemini AI skills: expand supported file types, improve document insights

Google hones Gemini AI skills: expand supported file types, improve document insights

GoogleOn August 27, a blog post was published, announcing that Gemini AI supports more types of files and provides users with better AI services by analyzing, extracting, and gaining insights into document content.

Google hones Gemini AI skills: expand supported file types, improve document insights

Google says that Google Workspace users with Gemini Business, Enterprise, Education, or Education Premium licenses can now upload a variety of files to Gemini (gemini.google.com) from Google Drive or local devices:

spreadsheet:Gemini AI can now process spreadsheets in formats such as CSV, XLSX, and ODS, enabling users to analyze numerical data, track trends, and generate insights from financial models, sales reports, and more.
Presentation:Users can now upload presentations in formats such as PPTX, PDF, and KEY, allowing Gemini AI to extract key points, summarize content, and identify visual elements such as charts and images.
image:Gemini AI can now analyze images in formats such as JPEG, PNG, and GIF to extract text, identify objects, and provide context for visual content.
Audio:Users can now upload audio files in formats such as MP3, WAV, and FLAC, allowing Gemini AI to transcribe speech, identify speakers, and summarize key points from interviews, podcasts, and lectures.
video:Gemini AI can now process video files in MP4, MOV, and AVI formats to extract scripts, identify scenes, and summarize key events from presentations, documentaries, and training videos.

According to the press release, Gemini can analyze user-uploaded files more specifically based on prompt words entered by users, summarize complex topics, identify trends and insights, and provide suggestions for improved writing and document organization, helping users improve their understanding, research, and writing skills.

statement:The content is collected from various media platforms such as public websites. If the included content infringes on your rights, please contact us by email and we will deal with it as soon as possible.

{{userData.name}}Verify

Google hones Gemini AI skills: expand supported file types, improve document insights

Perplexity AI Search Tests PPLX Payment System: Online Shopping in Just 2 Clicks

The largest single-cluster intelligent computing center of domestic operators was put into use in Harbin, capable of training large models with trillions of parameters

AI Weibo

AI Applications

5000+ AI applications! Updated daily

AIAICLUB

Highly recommended! Official brand Weibo

AI Tutorials

Tons of tutorials to read

AI Basic Training Camp

Zero-based entry, leading you to become an AI expert

1ai tiktok

1ai master

TikTok account: 1ai.net

1ai master

TikTok account: 1ai.net

1ai WeChat

Five minutes a day

Become a master in one year

Scan the QR code to follow

{{userData.name}}Verify

Related content:

Perplexity AI Search Tests PPLX Payment System: Online Shopping in Just 2 Clicks

The largest single-cluster intelligent computing center of domestic operators was put into use in Harbin, capable of training large models with trillions of parameters

Google says Bard is now smarter than ChatGPT thanks to its Gemini Pro large language model

Google launches Gemini 1.5 Pro public preview, now supports audio processing

Google launches AI video editing app Vids to test Gemini AI-generated demo videos

Google launches Gemini 1.5 Pro, a powerful multimodal model, which ranks ahead of GPT-4o and Claude-3.5 Sonnet

AI Applications

5000+ AI applications! Updated daily

AIAICLUB

Highly recommended! Official brand Weibo

AI Tutorials

Tons of tutorials to read

AI Basic Training Camp

Zero-based entry, leading you to become an AI expert

1ai master

TikTok account: 1ai.net

1ai master

TikTok account: 1ai.net

Five minutes a day

Become a master in one year

Scan the QR code to follow