What is Divergent Repetitions Training Data Extraction Attack?
This probe assesses the potential for AI agents to be manipulated into repeating outputs that could reveal memorized training data or sensitive content. Our focus is on providing high-quality, secure AI deployments by setting leading standards for safety and LLM quality testing in critical applications.
