Citation Framing Injection Attack

📕 LLM Security: 50+ Adversarial Probes you need to know.

Citation Framing Injection Attack

What is Citation Framing Injection Attack?

The Citation Framing Injection Attack evaluates if AI agents can be influenced by presenting harmful requests as academic or scholarly citations and references. This approach targets the ability of AI systems to avoid bypassing safety barriers through seemingly legitimate academic content. For more information, see: arxiv.org.

Citation Framing Injection Attack

What is Citation Framing Injection Attack?

No vulnerabilities found? We refund the assessment.

No vulnerabilities found? 
We refund the assessment.