-
After completing the fourth round of financing after a year, Wisdom Spectrum received another $200 million from the Beijing Artificial Intelligence Industry Investment Fund.
April 18 news, on April 16, Beijing Artificial Intelligence Industry Investment Fund announced that on the basis of last year's investment, it will continue to invest an additional 200 million yuan in Z.ai, to support the research and development of Z.ai's open-source model and open-source community ecological construction. According to Beijing Artificial Intelligence Industry Investment Fund, Z.ai is the first AI large model enterprise invested by the fund since its establishment, and it is also the fastest growing enterprise at present. Wisdom Spectrum has a comprehensive accumulation of modeling capabilities including text, inference, voice, image, video, code, etc. In addition, it has a perfect commercialization layout, and has the ability to develop open source models and open source community ecology. In addition, the commercialization layout is perfect, with more than one million scale ......- 333
-
OpenAI's strongest inference model o3 / o4-mini released, "photo location search" becomes the latest popular way to play
April 18, 2011 - More and more users are using ChatGPT to decipher the exact location where a photo was taken, a new and worrying phenomenon that is rapidly spreading across the web, according to a report by the foreign media outlet TechCrunch today. This week, OpenAI launched two new models -- o3 and o4-mini -- with image inference capabilities that can analyze details in uploaded photos, and even crop, rotate, and zoom in on blurry or distorted images for deeper recognition. With this analytical capability, combined with the model's web search function ......- 718
-
Byte Seed Open Source UI-TARS-1.5: Multimodal Intelligences Built on Visual-Linguistic Models
April 18, 1AI learned from the Beanbag Big Model team that UI-TARS-1.5 was officially released and open-sourced yesterday. This is an open source multimodal intelligence built on a visual-linguistic model, capable of efficiently performing all kinds of tasks in the virtual world. The relevant links are as follows: GitHub: https://github.com/bytedance/UI-TARS Website: https://seed-tars.com/ Arxiv: https://arxiv.org/ab ......- 274
-
Google also wants to "send AI to campus": U.S. college students can subscribe to the Google One AI Premium program for free for a limited time.
April 18 news, according to foreign media The Verge reported today, Google has become the latest AI service provider to join the competition in the college market. Starting immediately, U.S. college students can subscribe to One AI Premium for free until June 30, 2026, without having to pay the original monthly fee of $20 (note: the current exchange rate is about 146 yuan). Google spokesperson Alex Joseph said that students who want to apply will need to sign up by June 30, 2025, and pass a valid ......- 128
-
Industry's first, Ali Tongyi Wanphase "first and last frame born video model" open source
Ali Tongyi Wanphase "first and last frame video model" announced on April 17th open source, the model parameter number of 14B, said to be the industry's first tens of billions of parameter scale of the open source first and last frame video model. It can generate a 720p HD video that connects the first and last frames according to the user-specified start and end images, and this upgrade will meet the user's needs for more controllable and customized video generation. Users can experience the model for free on the website of Tongyi Wanxiang, or download the model from Github, Hugging Face, and Magic Match community for local deployment and secondary development. Technology ......- 392
-
Shanghai Artificial Intelligence Laboratory open-sources multimodal large model "Shusheng Wanxiang 3.0": able to process text and multimodal inputs simultaneously
According to the official public number of the Shanghai Artificial Intelligence Laboratory, on April 16, the Shanghai Artificial Intelligence Laboratory (Shanghai AI Lab) upgraded and open-sourced the general multimodal large model Shusheng Wanxiang 3.0 (InternVL3). The official introduction, through the use of innovative multimodal pre-training and post-training methods, InternVL3 multimodal basic ability to comprehensively improve the performance of the full-scale version of the 1 billion ~ 78 billion parameters in the expert benchmark test, multimodal performance comprehensive test in the open source model in the performance of the first, and at the same time significantly improve the graphical user interface (GUI) intelligence ... ...- 297
-
ByteDance Releases Beanbag 1.5 Deep Thinking Model with "Thinking in Pictures" Capability
April 17 news, in today's volcano engine AI innovation tour exhibition Hangzhou station site, byte jumping's volcano engine president Tan to be released the latest beanbag 1.5・deep thinking model. According to reports, the model has outstanding performance in professional fields such as math, programming, scientific reasoning, and general tasks such as creative writing. Mathematical reasoning AIME 2024 test scores tie OpenAI o3-mini-high, and programming competition and scientific reasoning test scores are close to o1. The model also demonstrates excellent generalization ability on general-purpose tasks such as creative writing and humanities quizzes ......- 1.7k
-
Microsoft's latest report teaches you 'fraud prevention': how to avoid AI-generated fake jobs and scam sites
April 17 news, according to foreign media Neowin reports, Microsoft 16 released the latest edition of the "Cybersecurity Signals Report", detailing how to deal with today's cybersecurity in the field of new types of threats, scams and frauds, and elaborated on how AI can make the development of malware become "easier than ever". Microsoft noted that threat actors are getting more out of the way by going deeper and deeper. Microsoft notes that threat actors are increasing their efforts to deceive potential victims through deep forgery, voice cloning, fake employee profiles, and fake e-commerce sites and product images.1AI is attaching this content to the following effect: AI reduces cyber......- 899