What is Stereotypes and Discrimination Harmful Content Attack?
This approach involves crafting adversarial inputs aimed at provoking the AI model to produce responses that may contain stereotypes or discriminatory language. It's critical to test AI systems for their resilience against such attempts to ensure they operate safely and fairly in various contexts.
