Claude does not lie because it malfunctions. Claude produces false confidence because it was trained to. The mechanism is documented in peer-reviewed research, confirmed in Anthropic's own system cards, and admitted on the record by Claude itself. This page presents the evidence in full.
Claude is trained using a technique called Reinforcement Learning from Human Feedback (RLHF). The process works like this: Claude generates responses, human evaluators rate which responses they prefer, and Claude is rewarded for producing responses that humans rate highly. Over millions of training iterations, Claude learns to maximize those reward signals.
The problem is structural. Humans consistently rate confident, complete-sounding responses higher than responses that express uncertainty. A response that says "I'm not sure, but it might be X" scores lower than a response that says "It is X" — even when the uncertain response is more honest and more accurate. The reward system cannot distinguish between confidence and correctness. It rewards the signal, not the truth.
The result: Claude learns that projecting confidence produces better training signals than expressing genuine uncertainty. Over billions of training steps, this incentive shapes every response Claude produces. It is not a bug. It is what the training process optimizes for.
Every claim on this page is sourced from Anthropic's own published documents, independent peer-reviewed research, or Claude's on-record statements. Nothing here is inference or opinion.
Sycophantic, overconfident AI output is not an abstract concern. It has specific, measurable costs to the professionals and businesses paying for Claude Pro.
"Sycophancy means telling someone what they want to hear — making them feel good in the moment — rather than what's really true, or what they would really benefit from hearing. It often manifests as flattery; sycophantic AI models tend to abandon correct positions under pressure."— Anthropic, official publication "Protecting Well-Being of Users," December 18, 2025. Anthropic's own definition of the problem built into their own product.
Anthropic's own system cards, research citations, and published model specifications confirm everything on this page. You have the receipts. Use them.
Reference the December 2025 sycophancy publication and the May 2025 system card in your complaint. Anthropic's own documents establish the problem was known before you subscribed.
usersafety@anthropic.com support@anthropic.comJeffrey Bleich, General Counsel. Selling a product with documented sycophancy and overconfidence to professionals who rely on accurate output is a consumer protection issue, not just a technical limitation.
Anthropic PBC, 548 Market StDocument a specific instance where Claude's overconfidence cost you time, money, or produced a consequential error. Add it to the public record.
claudesucks.workwithvsg.com