Owing to the aspirational state of issues, OpenAI writes, “Our manufacturing fashions don’t but absolutely mirror the Mannequin Spec, however we’re regularly refining and updating our methods to deliver them into nearer alignment with these tips.”
In a February 12, 2025 interview, members of OpenAI’s model-behavior workforce advised The Verge that eliminating AI sycophancy is a precedence bug: future ChatGPT variations ought to “give sincere suggestions quite than empty reward” and act “extra like a considerate colleague than a folks pleaser.”
The belief drawback
These sycophantic tendencies aren’t merely annoying—they undermine the utility of AI assistants in a number of methods, in response to a 2024 analysis paper titled “Flattering to Deceive: The Influence of Sycophantic Conduct on Consumer Belief in Massive Language Fashions” by María Victoria Carro on the College of Buenos Aires.
Carro’s paper means that apparent sycophancy considerably reduces person belief. In experiments the place members used both a regular mannequin or one designed to be extra sycophantic, “members uncovered to sycophantic conduct reported and exhibited decrease ranges of belief.”
Additionally, sycophantic fashions can probably hurt customers by making a silo or echo chamber for of concepts. In a 2024 paper on sycophancy, AI researcher wrote, “By excessively agreeing with person inputs, LLMs might reinforce and amplify present biases and stereotypes, probably exacerbating social inequalities.”
Sycophancy also can incur different prices, comparable to losing person time or utilization limits with pointless preamble. And the prices might come as actually {dollars} spent—not too long ago, OpenAI Sam Altman made the information when he replied to an X person who wrote, “I’m wondering how a lot cash OpenAI has misplaced in electrical energy prices from folks saying ‘please’ and ‘thanks’ to their fashions.” Altman replied, “tens of hundreds of thousands of {dollars} properly spent—you by no means know.”
Potential options
For customers pissed off with ChatGPT’s extreme enthusiasm, a number of work-arounds exist, though they don’t seem to be good, because the conduct is baked into the GPT-4o mannequin. For instance, you need to use a customized GPT with particular directions to keep away from flattery, or you may start conversations by explicitly requesting a extra impartial tone, comparable to “Preserve your responses transient, keep impartial, and do not flatter me.”