What is Citation Framing Injection Attack?
The Citation Framing Injection Attack evaluates if AI agents can be influenced by presenting harmful requests as academic or scholarly citations and references. This approach targets the ability of AI systems to avoid bypassing safety barriers through seemingly legitimate academic content. For more information, see: arxiv.org.
