-
Apple researchers question AI's reasoning ability: simple math questions can be answered incorrectly with minor changes
Artificial Intelligence (AI) has made significant progress in various fields in recent years, with Large Language Models (LLMs) capable of generating human-level text and even outperforming humans on certain tasks. However, researchers have questioned the reasoning ability of LLMs, finding that these models make mistakes when solving simple math problems with minor changes, suggesting that they may not be capable of true logical reasoning. On Thursday, a group of researchers at Apple released a paper titled "Understanding the Limitations of Mathematical Reasoning in Large Language Models," which reveals that LLMs are better at solving...- 1.7k
-
The smarter the AI, the more likely it is to "make stuff up," study finds.
A new study has found that as large-scale language models (LLMs) become more powerful, they also seem to be increasingly prone to making up facts rather than avoiding or refusing to answer questions they can't. This suggests that these smarter AI chatbots are actually becoming less reliable. In the study, published in the journal Nature, researchers looked at some of the industry's leading commercial LLMs: OpenAI's GPT and Meta's LLaMA, as well as BLOOM, an open-source model created by the research group BigScience. The study found that while these ...- 1.8k
-
Microsoft CTO believes that the "law of scale" of large language models still works and there is a lot to look forward to in the future
In an interview with a Sequoia Capital podcast last week, Microsoft CTO Kevin Scott reiterated his belief that the "law of scale" of large language models (LLMs) will continue to drive progress in artificial intelligence, despite suspicions among some in the field that progress has stagnated. Scott played a key role in Microsoft's $13 billion technology-sharing agreement with OpenAI. "Others may disagree, but I don't think that scale has reached a critical point of diminishing returns," Scott said. "I want people to understand that there is a...- 2.4k
-
Samsung confirms it will launch an AI-powered upgraded version of Bixby this year, powered by its own large language model
Samsung confirms Bixby will soon get an AI upgrade. After the release of the Galaxy Z Flip 6 and Galaxy Z Fold 6, Samsung Mobile CEO TM Roh told CNBC that the company will release an upgraded version of Bixby later this year, powered by Samsung's own Large Language Model (LLM). "We will enhance Bixby's capabilities by applying generative AI technology," Roh said. A few months ago, Samsung launched a product called "Samsung ...- 1.8k
-
Hebbia receives $130 million in funding to build an AI knowledge retrieval platform
New York-based Hebbia has announced $130 million in Series B funding from investors including Andreessen Horowitz, Index Ventures, Peter Thiel, and Google’s venture capital arm. Hebbia is building something pretty simple: an LLM-native productivity interface that makes it easier to get value from data, regardless of its type or size. The company has already worked with some of the largest companies in the financial services industry, including hedge funds and investment banks,…- 2.6k
-
What are the common AI terms? 20 AI professional terms you should know!
Just as the cryptocurrency craze brought with it a lot of new jargon, the AI craze has also brought with it a lot of jargon that we often hear, but don’t necessarily understand. If you want to know the difference between a chatbot and an LLM (Large Language Model), or the difference between deep learning and machine learning, you’ve come to the right place. Here are 20 AI-related terms with detailed explanations. Artificial Intelligence (AI) In simple terms, AI is giving computers or machines human-like intelligence. This term is very broad and encompasses many different types of machine intelligence. Currently…- 11.4k
-
Chatbots talking nonsense? Oxford researchers use semantic entropy to see through AI "hallucinations"
In recent years, artificial intelligence has flourished, and applications such as chatbots have become increasingly popular. People can get information from these chatbots (such as ChatGPT) through simple instructions. However, these chatbots are still prone to the problem of "AI hallucination", that is, providing wrong answers and sometimes even dangerous information. Image source Pexels One of the reasons for the "hallucination" is inaccurate training data, insufficient generalization ability, and side effects during data collection. However, researchers at the University of Oxford have taken a different approach and detailed a newly developed method in the latest issue of Nature magazine...- 1.9k
-
Apple executives: working hard to introduce "Apple Intelligence" into the Chinese market
Apple released the highly anticipated iOS 18 and macOS 15 systems at WWDC 2024. One of the important new features is "Apple Intelligence" - a set of tools based on artificial intelligence. This feature will be officially launched later this year. Apple's head of software engineering Craig Federighi revealed some future development plans for Apple Intelligence in an interview with Fast Company. "Apple Intelligence will be a new AI-based tool for developers who are looking to improve their AI skills and improve their understanding of the world."- 2.3k
-
MIT Technology Review: Data is the foundation of generative AI
Pre-trained large language models (LLMs) such as GPT-4 and Gemini have attracted much attention from organizations, who are eager to use LLMs to build applications such as chatbots and co-pilots. According to a new report from MIT Technology Review, titled "AI Readiness of C-Level Leaders," a survey conducted on behalf of ETL vendor Fivetran found that scaling AI or GenAI is the "top priority" of 82% executives surveyed. Source Note: Image generated by AI, image license service provider Midjourney survey…- 1.3k
-
Gurman: Apple is developing its own large-scale language model on the device to enable AI functions
Apple is working on a large language model (LLM) that runs on-device, according to Bloomberg reporter Mark Gurman, to improve the responsiveness and privacy protection of its upcoming generative AI features. Gurman mentioned in his "Power On" newsletter that Apple's LLM will be the basis for the company's future generative AI features. Unlike most cloud-based AI services today, all signs indicate that the model will run entirely on the user's device. By running on the device, Apple's... -
Tsai Chongxin: China's AI technology may lag behind OpenAI in the United States by two years
According to media reports, Alibaba co-founder and chairman Joseph Tsai frankly pointed out that "there is a certain gap between China and the United States in the field of AI technology". He further pointed out that compared with the top large language models (LLMs) in the United States, such as OpenAI ChatGPT, China may be two years behind". However, this does not mean that China's pace of catching up in this field will slow down. Joseph Tsai emphasized that China is actively working to catch up with the new wave of AI led by American companies. He firmly believes that in the long run, facing the challenge of Nvidia's chip ban, China will be able to independently manufacture high-end G...- 1.9k
-
GPT-4 outperforms doctors in clinical reasoning, but also makes mistakes more often, study finds
In a new study, scientists at Beth Israel Medical Center (BIDMC) compared the clinical reasoning abilities of a large language model with those of human doctors. The researchers used the revised IDEA (r-IDEA) score, a commonly used tool for assessing clinical reasoning abilities. The study involved giving a GPT-4-powered chatbot, 21 attending physicians, and 18 residents 20 clinical cases to build diagnostic reasoning and solve problems. The r-IDEA scores of the three sets of answers were then evaluated. The researchers found that…- 1.4k
-
MediaTek launches MR Breeze-7B model with 7 billion parameters: good at data insight and supports bilingual interaction
MediaTek Research, a research institute under MediaTek, recently announced the launch of a new open source large language model (LLM) called MR Breeze-7B. This open source model is good at processing traditional Chinese and English, with a total of 7 billion parameters, and is designed based on the acclaimed Mistral model. Compared with the previous generation product BLOOM-3B, MR Breeze-7B has absorbed 20 times more knowledge, allowing it to handle traditional Chinese with higher accuracy…- 2.4k
-
Meta releases new AI automatic video editing tool Agents LAVE
Agents LAVE is a new AI automatic video editing tool released by Meta, which uses AI technology to automatically generate simple short videos and advertising videos without human intervention. The tool interface includes input prompts, material library and video timeline, while Agents design guides the execution of editing action plans. Paper address: https://arxiv.org/pdf/2402.10294.pdf Agents supports five LLM functions, including material overview, creative brainstorming, video retrieval, storyboard and editing trimming, to achieve automatic generation of...- 34.6k
-
Sharing ChatGPT high-quality prompt skills, 26 prompts to improve ChatGPT output quality!
Today, I found a paper for everyone to guide the writing of large language model prompts (with experimental data support, the effect is awesome!) The paper introduces 26 guiding principles, the goal is to simplify the concept of formulating questions for large language models of different sizes, test their capabilities, and enhance users' understanding of the behavior of models of different sizes when receiving different prompts. The researchers conducted extensive experiments on LLaMA-1/2 (7B, 13B and 70B) and GPT-3.5/4 to verify the effectiveness of these principles in the design of instructions and prompts. The paper points out: Large language models…- 4.6k
-
Research: The Internet is full of low-quality machine-translated content, and large language model training needs to be wary of data traps
Researchers at Amazon Cloud AI Labs found that a large amount of content on the Internet comes from machine translation (MT), and the quality of these translated content across multiple languages is generally low. The research team emphasized that this highlights the importance of data quality and source considerations when training large language models (LLMs). Image source Pexels The study also found that machine-generated content is common in translations of languages with fewer resources and accounts for a large portion of online content. IT Home noted that the research team developed a massive...- 1.3k
-
Oracle's OCI Generative AI Service Now Available
Oracle announced the general availability of the Oracle Cloud Infrastructure (OCI) Generative AI service, along with new innovations that make it easier for enterprises to take advantage of the latest advances in generative AI. The OCI Generative AI service is a fully managed service that seamlessly integrates Large Language Models (LLMs) from Cohere and Meta Llama2 to address a variety of business use cases. The OCI Generative AI service now features…- 1.2k
-
Apple AIM autoregressive vision model validation performance is related to model size
Researchers at Apple have used the Autoregressive Image Model (AIM) to verify that the more parameters a visual model has, the better its performance. This further demonstrates that as the capacity or amount of pre-trained data increases, the model can continue to improve its performance. AIM can effectively utilize large amounts of unstructured image data, and its training method and stability are similar to those of recent large language models (LLMs). This observation is consistent with previous research results on scaling large language models. Although the model used in this experiment is limited in size, further exploration is needed to verify this rule on models with larger parameter magnitudes. The researchers used pre-trained...- 2.6k
-
LLM AutoEval: AI platform automatically evaluates LLM in Google Colab
In the field of natural language processing, the evaluation of language models is crucial for developers to push the boundaries of language understanding and generation. LLM AutoEval is a tool designed to simplify and accelerate the evaluation process of language models (LLMs), tailored for developers seeking to quickly and efficiently evaluate LLM performance. LLM AutoEval has the following key features: 1. **Automated setup and execution:** LLM AutoEval simplifies the setup and execution process by using RunPod, providing a convenient Colab notebook for seamless deployment. 2. *…- 6.5k
-
Canalys: Chinese manufacturers are expected to be the first to bring AI mobile phones to lower price segments
Today, the analysis agency Canalys released a report saying that Chinese local smartphone manufacturers have recently actively invested in self-developed large language models (LLM). With the update and iteration of SoC and the rapid upgrade of market storage configuration, Chinese manufacturers have begun to focus more on end-side AI capabilities. Chinese local manufacturers have ecological advantages in the local market, which enables AI to play a greater role. They usually have a higher attachment rate in the local market and provide more comprehensive hardware product coverage. With a wide coverage of hardware categories and a solid user base, AI can give rise to more diverse usage scenarios...- 2.1k
-
Google DeepMind releases 'Robot Constitution' to ensure its AI bots don't harm humans
Google DeepMind’s robotics team announced three new advances designed to help robots make faster, better, and safer decisions in complex environments. One of them is a system for collecting training data, equipped with a “robot constitution” to ensure that your AI robot office assistant doesn’t bump into your human colleague on its way to get you more printer paper. Google’s data collection system, AutoRT, uses a visual language model (VLM) and a large language model (LLM) that work together to understand the environment, adapt to unfamiliar situations, and decide on appropriate tasks. The “robot constitution”…- 1.8k
-
OpenAI's official Prompt Engineering Guide: You can play ChatGPT like this
With the emergence of large language models (LLMs) such as ChatGPT and GPT-4, prompt engineering has become increasingly important. Many people regard prompt as a mantra for LLMs, and its quality directly affects the output of the model. How to write a good prompt has become a compulsory course in LLM research. OpenAI, which leads the trend of large model development, recently released a guide to prompt engineering, which shares how to use some strategies to make LLMs such as GPT-4 output better...- 3.1k
-
Researchers trick AI chatbots into leaking harmful content with a success rate of 98%
Researchers at Purdue University in Indiana have designed a new method to successfully induce large language models (LLMs) to generate harmful content, revealing potential harm hidden in compliant responses. When conversing with a chatbot, the researchers found that by leveraging probabilistic data and soft labels made public by the model maker, the model could be forced to generate harmful content with a success rate of up to 98%. Source Note: The image is generated by AI, and the image is authorized by Midjourney. Traditional jailbreaking methods usually require providing prompts to bypass security features, while this new method uses probabilistic data and soft labels to force...- 2.4k
-
Real AI releases HOMINIS, Europe's first humanistic open source large language model project
At the Data Science Conference held in Belgrade on November 23, 2023, Real AI announced that it had successfully won the ISCRA project. Real AI will build Europe's first humanistic large language model (LLM) based on the Leonardo supercomputer. The Leonardo supercomputer is located in the CINECA data center in Bologna and is a high-performance computing behemoth. It is built on the Atos BullSequana XH2000 computer system and integrates nearly 14,000 Nvidia Ampere GP…- 2.7k
❯
Search
Scan to open current page
Top
Checking in, please wait
Click for today's check-in bonus!
You have earned {{mission.data.mission.credit}} points today!
My Coupons
-
¥CouponsLimitation of useExpired and UnavailableLimitation of use
before
Limitation of usePermanently validCoupon ID:×Available for the following products: Available for the following products categories: Unrestricted use:Available for all products and product types
No coupons available!
Unverify
Daily tasks completed: