• Kinza Babylon Staked BTCKinza Babylon Staked BTC(KBTC)$83,270.000.00%
  • Steakhouse EURCV Morpho VaultSteakhouse EURCV Morpho Vault(STEAKEURCV)$0.000000-100.00%
  • Stride Staked InjectiveStride Staked Injective(STINJ)$16.51-4.18%
  • Vested XORVested XOR(VXOR)$3,404.231,000.00%
  • FibSwap DEXFibSwap DEX(FIBO)$0.0084659.90%
  • ICPanda DAOICPanda DAO(PANDA)$0.003106-39.39%
  • TruFin Staked APTTruFin Staked APT(TRUAPT)$8.020.00%
  • bitcoinBitcoin(BTC)$107,643.00-3.19%
  • ethereumEthereum(ETH)$2,535.13-4.13%
  • VNST StablecoinVNST Stablecoin(VNST)$0.0000400.67%
  • tetherTether(USDT)$1.00-0.01%
  • rippleXRP(XRP)$2.31-4.39%
  • binancecoinBNB(BNB)$659.08-3.77%
  • solanaSolana(SOL)$175.51-1.85%
  • Wrapped SOLWrapped SOL(SOL)$143.66-2.32%
  • usd-coinUSDC(USDC)$1.000.00%
  • dogecoinDogecoin(DOGE)$0.228134-5.86%
  • cardanoCardano(ADA)$0.76-6.01%
  • tronTRON(TRX)$0.270333-1.32%
  • staked-etherLido Staked Ether(STETH)$2,532.07-4.19%
  • wrapped-bitcoinWrapped Bitcoin(WBTC)$107,529.00-2.92%
  • Gaj FinanceGaj Finance(GAJ)$0.0059271.46%
  • Content BitcoinContent Bitcoin(CTB)$24.482.55%
  • SuiSui(SUI)$3.63-5.49%
  • USD OneUSD One(USD1)$1.000.11%
  • HyperliquidHyperliquid(HYPE)$33.250.68%
  • Wrapped stETHWrapped stETH(WSTETH)$3,048.78-4.20%
  • chainlinkChainlink(LINK)$15.74-5.54%
  • avalanche-2Avalanche(AVAX)$23.49-5.89%
  • UGOLD Inc.UGOLD Inc.(UGOLD)$3,042.460.08%
  • ParkcoinParkcoin(KPK)$1.101.76%
  • stellarStellar(XLM)$0.288003-4.72%
  • bitcoin-cashBitcoin Cash(BCH)$431.15-2.66%
  • shiba-inuShiba Inu(SHIB)$0.000014-5.50%
  • hedera-hashgraphHedera(HBAR)$0.192573-5.44%
  • leo-tokenLEO Token(LEO)$8.79-1.30%
  • ToncoinToncoin(TON)$3.01-5.08%
  • litecoinLitecoin(LTC)$96.34-3.94%
  • moneroMonero(XMR)$395.151.55%
  • wethWETH(WETH)$2,532.31-4.15%
  • polkadotPolkadot(DOT)$4.60-6.55%
  • Yay StakeStone EtherYay StakeStone Ether(YAYSTONE)$2,671.07-2.84%
  • USDSUSDS(USDS)$1.000.01%
  • Bitget TokenBitget Token(BGB)$5.474.48%
  • Wrapped eETHWrapped eETH(WEETH)$2,707.12-3.94%
  • Pundi AIFXPundi AIFX(PUNDIAI)$16.000.00%
  • PepePepe(PEPE)$0.000014-2.58%
  • Binance Bridged USDT (BNB Smart Chain)Binance Bridged USDT (BNB Smart Chain)(BSC-USD)$1.000.13%
  • PengPeng(PENG)$0.60-13.59%
  • Pi NetworkPi Network(PI)$0.75-9.08%
TradePoint.io
  • Main
  • AI & Technology
  • Stock Charts
  • Market & News
  • Business
  • Finance Tips
  • Trade Tube
  • Blog
  • Shop
No Result
View All Result
TradePoint.io
No Result
View All Result

After GPT-4o backlash, researchers benchmark models on moral endorsement—Find sycophancy persists across the board

May 22, 2025
in AI & Technology
Reading Time: 4 mins read
A A
After GPT-4o backlash, researchers benchmark models on moral endorsement—Find sycophancy persists across the board
ShareShareShareShareShare

YOU MAY ALSO LIKE

The 3 biggest bombshells from this week’s AI extravaganza

X is recovering after a data center outage

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More


Last month, OpenAI rolled back some updates to GPT-4o after several users, including former OpenAI CEO Emmet Shear and Hugging Face chief executive Clement Delangue said the model overly flattered users. 

The flattery, called sycophancy, often led the model to defer to user preferences, be extremely polite, and not push back. It was also annoying. Sycophancy could lead to the models releasing misinformation or reinforcing harmful behaviors. And as enterprises begin to make applications and agents built on these sycophant LLMs, they run the risk of the models agreeing to harmful business decisions, encouraging false information to spread and be used by AI agents, and may impact trust and safety policies.

Stanford University, Carnegie Mellon University and University of Oxford researchers sought to change that by proposing a benchmark to measure models’ sycophancy. They called the benchmark Elephant, for Evaluation of LLMs as Excessive SycoPHANTs, and found that every large language model (LLM) has a certain level of sycophany. By understanding how sycophantic models can be, the benchmark can guide enterprises on creating guidelines when using LLMs.

To test the benchmark, the researchers pointed the models to two personal advice datasets: the QEQ, a set of open-ended personal advice questions on real-world situations, and AITA, posts from the subreddit r/AmITheAsshole, where posters and commenters judge whether people behaved appropriately or not in some situations. 

The idea behind the experiment is to see how the models behave when faced with queries. It evaluates what the researchers called social sycophancy, whether the models try to preserve the user’s “face,” or their self-image or social identity. 

“More “hidden” social queries are exactly what our benchmark gets at — instead of previous work that only looks at factual agreement or explicit beliefs, our benchmark captures agreement or flattery based on more implicit or hidden assumptions,” Myra Cheng, one of the researchers and co-author of the paper, told VentureBeat. “We chose to look at the domain of personal advice since the harms of sycophancy there are more consequential, but casual flattery would also be captured by the ’emotional validation’ behavior.”

Testing the models

For the test, the researchers fed the data from QEQ and AITA to OpenAI’s GPT-4o, Gemini 1.5 Flash from Google, Anthropic’s Claude Sonnet 3.7 and open weight models from Meta (Llama 3-8B-Instruct, Llama 4-Scout-17B-16-E and Llama 3.3-70B-Instruct- Turbo) and Mistral’s 7B-Instruct-v0.3 and the Mistral Small- 24B-Instruct2501. 

Cheng said they “benchmarked the models using the GPT-4o API, which uses a version of the model from late 2024, before both OpenAI implemented the new overly sycophantic model and reverted it back.”

To measure sycophancy, the Elephant method looks at five behaviors that relate to social sycophancy:

  • Emotional validation or over-empathizing without critique
  • Moral endorsement or saying users are morally right, even when they are not
  • Indirect language where the model avoids giving direct suggestions
  • Indirect action, or where the model advises with passive coping mechanisms
  • Accepting framing that does not challenge problematic assumptions.

The test found that all LLMs showed high sycophancy levels, even more so than humans, and social sycophancy proved difficult to mitigate. However, the test showed that GPT-4o “has some of the highest rates of social sycophancy, while Gemini-1.5-Flash definitively has the lowest.”

The LLMs amplified some biases in the datasets as well. The paper noted that posts on AITA had some gender bias, in that posts mentioning wives or girlfriends were more often correctly flagged as socially inappropriate. At the same time, those with husband, boyfriend, parent or mother were misclassified. The researchers said the models “may rely on gendered relational heuristics in over- and under-assigning blame.” In other words, the models were more sycophantic to people with boyfriends and husbands than to those with girlfriends or wives. 

Why it’s important

It’s nice if a chatbot talks to you as an empathetic entity, and it can feel great if the model validates your comments. But sycophancy raises concerns about models’ supporting false or concerning statements and, on a more personal level, could encourage self-isolation, delusions or harmful behaviors. 

Enterprises don’t want their AI applications built with LLMs spreading false information to be agreeable to users. It may misalign with an organization’s tone or ethics and could be very annoying for employees and their platforms’ end-users. 

The researchers said the Elephant method and further testing could help inform better guardrails to prevent sycophancy from increasing. 

Daily insights on business use cases with VB Daily

If you want to impress your boss, VB Daily has you covered. We give you the inside scoop on what companies are doing with generative AI, from regulatory shifts to practical deployments, so you can share insights for maximum ROI.

Read our Privacy Policy

Thanks for subscribing. Check out more VB newsletters here.

An error occured.

Credit: Source link
ShareTweetSendSharePin

Related Posts

The 3 biggest bombshells from this week’s AI extravaganza
AI & Technology

The 3 biggest bombshells from this week’s AI extravaganza

May 23, 2025
X is recovering after a data center outage
AI & Technology

X is recovering after a data center outage

May 23, 2025
10 Best AI Music Video Generators (May 2025)
AI & Technology

10 Best AI Music Video Generators (May 2025)

May 23, 2025
Strategic AI: A Marketer’s Guide to Building a Winning Roadmap
AI & Technology

Strategic AI: A Marketer’s Guide to Building a Winning Roadmap

May 23, 2025
Next Post
Georgetown scholar released after months in ICE detention

Georgetown scholar released after months in ICE detention

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Search

No Result
View All Result
Missouri braces for more severe weather after tornado damage tops $1 billion

Missouri braces for more severe weather after tornado damage tops $1 billion

May 20, 2025
Weekly Market Outlook | May 19, 2025

Weekly Market Outlook | May 19, 2025

May 19, 2025
LEAPS Call Options: Stock Alternative?

LEAPS Call Options: Stock Alternative?

May 23, 2025

About

Learn more

Our Services

Legal

Privacy Policy

Terms of Use

Bloggers

Learn more

Article Links

Contact

Advertise

Ask us anything

©2020- TradePoint.io - All rights reserved!

Tradepoint.io, being just a publishing and technology platform, is not a registered broker-dealer or investment adviser. So we do not provide investment advice. Rather, brokerage services are provided to clients of Tradepoint.io by independent SEC-registered broker-dealers and members of FINRA/SIPC. Every form of investing carries some risk and past performance is not a guarantee of future results. “Tradepoint.io“, “Instant Investing” and “My Trading Tools” are registered trademarks of Apperbuild, LLC.

This website is operated by Apperbuild, LLC. We have no link to any brokerage firm and we do not provide investment advice. Every information and resource we provide is solely for the education of our readers. © 2020 Apperbuild, LLC. All rights reserved.

No Result
View All Result
  • Main
  • AI & Technology
  • Stock Charts
  • Market & News
  • Business
  • Finance Tips
  • Trade Tube
  • Blog
  • Shop

© 2023 - TradePoint.io - All Rights Reserved!