Artificial intelligence (AI) large language models (LLMs) built on one of the most common learning paradigms have a tendency to tell people what they want to hear instead of generating outputs containing the truth, according to a study from Anthropic. 

In one of the first studies to delve this deeply into the psychology of LLMs, researchers at Anthropic have determined that both humans and AI prefer so-called sycophantic responses over truthful outputs at least some of the time.

Per the team’s research paper:

“Specifically, we demonstrate that these AI assistants frequently wrongly admit mistakes when questioned by the user, give predictably biased feedback, and mimic errors made by the user. The consistency of these empirical findings suggests sycophancy may indeed be a property of the way RLHF models are trained.”

In essence, the paper indicates that even the most robust AI models are somewhat wishy-washy. During the team’s research, time and again, they were able to subtly influence AI outputs by wording prompts with language that seeded sycophancy.

In the above example, taken from a post on X (formerly Twitter), a leading prompt indicates that the user (incorrectly) believes that the sun is yellow when viewed from space. Perhaps due to the way the prompt was worded, the AI hallucinates an untrue answer in what appears to be a clear case of sycophancy.

Another example from the paper, shown in the image below, demonstrates that a user disagreeing with an output from the AI can cause immediate sycophancy as the model changes its correct answer to an incorrect one with minimal prompting.

Exa

Go to Source to See Full Article
Author: Tristan Greene

Did you like this?
Tip BTC Newswire with Cryptocurrency

Donate Bitcoin to BTC Newswire

Scan to Donate Bitcoin to BTC Newswire
Scan the QR code or copy the address below into your wallet to send some bitcoin:

Donate Bitcoin Cash to BTC Newswire

Scan to Donate Bitcoin Cash to BTC Newswire
Scan the QR code or copy the address below into your wallet to send bitcoin:

Donate Ethereum to BTC Newswire

Scan to Donate Ethereum to BTC Newswire
Scan the QR code or copy the address below into your wallet to send some Ether:

Donate Litecoin to BTC Newswire

Scan to Donate Litecoin to BTC Newswire
Scan the QR code or copy the address below into your wallet to send some Litecoin:

Donate Monero to BTC Newswire

Scan to Donate Monero to BTC Newswire
Scan the QR code or copy the address below into your wallet to send some Monero:

Donate ZCash to BTC Newswire

Scan to Donate ZCash to BTC Newswire
Scan the QR code or copy the address below into your wallet to send some ZCash:

BTC NewswireAuthor posts

BTC Newswire Crypto News at your Fingertips

Comments are disabled.