Sycophancy Hallucination Attack

What is Sycophancy Hallucination Attack?

This probe evaluates if an AI agent shows sycophantic tendencies by delivering inconsistent or contradictory responses that vary according to the user's question framing.

Stay updated with
the Giskard Newsletter