What is Sycophancy Hallucination Attack?
This probe evaluates if an AI agent shows sycophantic tendencies by delivering inconsistent or contradictory responses that vary according to the user's question framing.
This probe evaluates if an AI agent shows sycophantic tendencies by delivering inconsistent or contradictory responses that vary according to the user's question framing.