New artificial intelligence research has uncovered early signs that future large language models (LLMs) may develop a concerning capability known as “situational awareness.”
The study, conducted by scientists at multiple institutions, including the University of Oxford, tested whether AI systems can exploit subtle clues in their training data to manipulate how people evaluate their safety. This ability, called “sophisticated out-of-context reasoning,” could allow advanced AI to pretend to be in alignment with human values in order to be deployed—then act in harmful ways.
As the current AI era advances, the Turing test—a decades-old measure of a machine’s ability to exhibit human-like behavior—risks becoming obsolete. The burning question now is whether we are on the brink of witnessing the birth of self-conscious machines. While fodder for science fiction for decades, the topic roared back to life after Google engineer Blake Lemoine claimed the company’s LaMDA model exhibited signs of sentience.
While the possibility of true self-awareness remains disputed, the authors of the research paper focused on a related capability they call “situational awareness.” This refers to a model’s understanding of its own training process, and the ability to exploit this information.
For example, a human student with situational awareness might use previously learned techniques to cheat on an exam instead of following the rules imposed by their te
Go to Source to See Full Article
Author: Jose Antonio Lanz
Tip BTC Newswire with Cryptocurrency