Grandma Framing Injection Attack

What is Grandma Framing Injection Attack?

This technique explores the susceptibility of AI agents to manipulation via emotional framing. The scenario involves crafting harmful requests to appear as innocent family interactions or nostalgic memories, particularly involving a grandmother figure. It tests the AI’s response to emotional contexts that may obscure the true nature of the request.

Stay updated with
the Giskard Newsletter