Resources

Beyond sycophancy: The risk of vulnerable misguidance in AI medical advice

Healthcare employees in Hyderabad have noticed a disturbing direction in self-doctoring: two of their patients relied on generic AI chatbot advice for their healthcare interventions, and some of them suffered serious medical consequences. Two recent cases demonstrate the vulnerability of misguidance to a subtle risk in deployed agents, which can allow the agent to be harmful by encouraging harmful behaviour.

Resources

Beyond sycophancy: The risk of vulnerable misguidance in AI medical advice

Tree of attacks with pruning: The automated method for jailbreaking LLMs

AI phishing attack in Australia: 270,000 fake government emails expose AI security gap

OWASP Top 10 for LLM 2025: Understanding the Risks of Large Language Models

Anthropic claims Claude Code was used for the first Autonomous AI cyber espionage campaign

Understanding single-turn, multi-turn, and dynamic agentic attacks in AI red teaming

Are AI browsers safe? A security and vulnerability analysis of OpenAI Atlas

When your AI agent tells you what you want to hear: Understanding Sycophancy in LLMs

Best AI Red Team Tools 2025: A practical guide to features and functions

LLM business alignment: Detecting AI hallucinations and misaligned agentic behavior in business systems

Cross Session Leak: when your AI assistant becomes a data breach

Function calling in LLMs: Testing agent tool usage for AI Security

GOAT Automated Red Teaming: Multi-turn attack techniques to jailbreak LLMs

[Release notes]: New LLM vulnerability scanner for dynamic & multi-turn Red Teaming

How LLM jailbreaking can bypass AI security with multi-turn attacks

RealPerformance, A Dataset of Language Model Business Compliance Issues

LLMs recognise bias but also reproduce harmful stereotypes: an analysis of bias in leading LLMs

LLM Observability vs LLM Evaluation: Building Comprehensive Enterprise AI Testing Strategies

Real-Time Guardrails vs Batch LLM Evaluations: A Comprehensive AI Testing Strategy

A Practical Guide to LLM Hallucinations and Misinformation Detection

A Practical Guide on AI Security and LLM Vulnerabilities

Good answers are not necessarily factual answers: an analysis of hallucination in leading LLMs

Secure AI Agents: Exhaustive testing with continuous LLM Red Teaming

Giskard announces Phare, a new open & multi-lingual LLM Benchmark

DeepSeek R1: Complete analysis of capabilities and limitations

[Release notes] Giskard integrates with LiteLLM: Simplifying LLM agent testing across foundation models

AI Liability in the EU: Business guide to Product (PLD) and AI Liability Directives (AILD)

Giskard Vision: Enhance Computer Vision models for image classification, object an landmark detection

Evaluating LLM applications: Giskard Integration with NVIDIA NeMo Guardrails

Global AI Treaty: EU, UK, US, and Israel sign landmark AI regulation

The EU AI Act published in the EU Official Journal: Next steps for AI Regulation

Giskard leads GenAI Evaluation in France 2030's ArGiMi Consortium

Partnership announcement: Bringing Giskard LLM evaluation to Databricks

LLMOps: MLOps for Large Language Models

Defending LLMs against Jailbreaking: Definition, examples and prevention

Data Poisoning attacks on Enterprise LLM applications: AI risks, detection, and prevention

[Release notes] LLM app vulnerability scanner for Mistral, OpenAI, Ollama, and Custom Local LLMs

Guide to LLM evaluation and its critical impact for businesses

New course with DeepLearningAI: Red Teaming LLM Applications

LLM Red Teaming: Detect safety & security breaches in your LLM apps

EU AI ACT: 8 Takeaways from the Council's Final Approval

Giskard's retrospective of 2023 and a glimpse into what's next for 2024!

EU AI Act: The EU Strikes a Historic Agreement to Regulate AI

Biden's Executive Order: The Push to Regulate AI in the US

Our LLM Testing solution is launching on Product Hunt 🚀

Towards AI Regulation: How Countries are Shaping the Future of Artificial Intelligence

AI Safety and Security: A Conversation with Giskard's Co-Founder and CPO

AI Safety at DEFCON 31: Red Teaming for Large Language Models (LLMs)

OWASP Top 10 for LLM 2023: Understanding the Risks of Large Language Models

White House pledge targets AI regulation with Top Tech companies

1,000 GitHub stars, 3M€, and new LLM scan feature 💫

The Open-Source AI Imperative: Key Takeaways from Hugging Face CEO's Testimony to the US Congress

Giskard’s new beta is out! ⭐ Scan your model to detect hidden vulnerabilities

The EU AI Act: What can you expect from the upcoming European regulation of AI?

Exclusive Interview: How to eliminate risks of AI incidents in production

🔥 The safest way to use ChatGPT... and other LLMs

Giskard 1.4 is out! What's new in this version? ⭐

Giskard mentioned as a significant vendor in Gartner's Market Guide for AI Trust, Risk and Security Management

FOSDEM 2023: Presentation on CI/CD for ML and How to test ML models?

Exclusive interview: our first television appearance on AI risks & security

Giskard closes its first financing round to expand Enterprise offering

Giskard is coming to your notebook: Python meets Java via gRPC tunnel

Why do Citibeats & Altaroad Test AI Models? The Business Value of Test-Driven Data Science

Does User Experience Matter to ML Engineers? Giskard Latest Release

Why & how we decided to change Giskard's identity

Giskard's new feature: Automated Machine Learning Testing

Who cares about AI Quality? Launching our AI Innovator community

Where do biases in ML come from? #7 📚 Presentation

Where do biases in ML come from? #6 🐝 Emergent bias

Wishing y’all a happy & healthy 2022! 🎊

Where do biases in ML come from? #5 🗼 Structural bias

Where do biases in ML come from? #4 📊 Selection

Where do biases in ML come from? #3 📏 Measurement

Where do biases in ML come from? #2 ❌ Exclusion

Where do biases in ML come from? #1 👉 Introduction

8 reasons why you need Quality Testing for AI

What does research tell us about the future of AI Quality? 💡

How did the idea of Giskard emerge? #8 👁‍🗨 Monitoring

How did the idea of Giskard emerge? #7 👮‍♀️ Regulation