-
Microsoft clarifies: it won't use users' Word and Excel data to train AI models
Microsoft Office is known for its Connected Experiences feature, which analyzes user-created content to provide design recommendations, editing suggestions, data insights, and more. However, 1AI notes that @nixCraft, a blogger at cybersecurity blog Cyberciti.biz, claims that Microsoft's Connected Experiences feature automatically grabs data from users' Word and Excel documents and uses it to train the company's AI models. What's even more troubling is that the feature is turned on by default,...- 779
-
The AI industry faces the challenge of a "data wall": high-quality training data may be exhausted by 2028
Recently, the shortage of training data for large AI models has once again become the focus of media attention. The Economist magazine's latest article "AI companies will soon exhaust most of the Internet's data" has sparked widespread discussion in the industry. The article points out that as high-quality Internet data runs out, the AI field is facing the challenge of a "data wall." Research company Epoch AI predicts that all high-quality text data on the Internet will be exhausted by 2028, and machine learning data sets may exhaust all "high-quality language data" by 2026. This &qu…- 4.7k
-
OpenAI CTO: Not sure where Sora's training data came from
OpenAI recently launched the hot text-to-video generation model Sora, but the company's Chief Technology Officer (CTO) Mira Murati was vague in an interview with the Wall Street Journal and could not clearly explain the source of Sora's training data. During the interview, when the reporter directly asked Murati about the source of Sora's training data, she only used vague official language to prevaricate: "We use publicly available data and licensed data." When the reporter asked whether the specific source included YouTube videos, Murati...- 1.4k
-
ChatGPT and other models: By 2026, high-quality training data will be exhausted
MIT Technology Review once published an article on its official website stating that with the continued popularity of large models such as ChatGPT, the demand for training data is increasing. Large models are like a "network black hole" that constantly absorbs, and will eventually lead to insufficient data for training. The well-known AI research institute Epochai published a paper directly on the data training problem, pointing out that by 2026, large models will consume all high-quality data; by 2030-2050, they will consume all low-quality data; by 2030-2060, they will consume all image training data...- 2.2k
❯
Search
Scan to open current page
Top
Checking in, please wait
Click for today's check-in bonus!
You have earned {{mission.data.mission.credit}} points today!
My Coupons
-
¥CouponsLimitation of useExpired and UnavailableLimitation of use
before
Limitation of usePermanently validCoupon ID:×Available for the following products: Available for the following products categories: Unrestricted use:Available for all products and product types
No coupons available!
Unverify
Daily tasks completed: