Resources

Risk assessment for LLMs and AI agents: OWASP, MITRE Atlas, and NIST AI RMF explained

There are three major tools for assessing risks associated with LLMs and AI Agents: OWASP, MITRE Attack and NIST AI RMF. Each of them has its own approach to risk and security, while examining it from different angles with varying levels of granularity and organisational scope. This blog will help you understand them.

Resources

Risk assessment for LLMs and AI agents: OWASP, MITRE Atlas, and NIST AI RMF explained

Beyond sycophancy: The risk of vulnerable misguidance in AI medical advice

Tree of attacks with pruning: The automated method for jailbreaking LLMs

AI phishing attack in Australia: 270,000 fake government emails expose AI security gap

OWASP Top 10 for LLM 2025: Understanding the Risks of Large Language Models

Anthropic claims Claude Code was used for the first Autonomous AI cyber espionage campaign

Understanding single-turn, multi-turn, and dynamic agentic attacks in AI red teaming

Are AI browsers safe? A security and vulnerability analysis of OpenAI Atlas

When your AI agent tells you what you want to hear: Understanding Sycophancy in LLMs

Best AI Red Team Tools 2025: A practical guide to features and functions

LLM business alignment: Detecting AI hallucinations and misaligned agentic behavior in business systems

Cross Session Leak: when your AI assistant becomes a data breach

Function calling in LLMs: Testing agent tool usage for AI Security

GOAT Automated Red Teaming: Multi-turn attack techniques to jailbreak LLMs

[Release notes]: New LLM vulnerability scanner for dynamic & multi-turn Red Teaming

How LLM jailbreaking can bypass AI security with multi-turn attacks

RealPerformance, A Dataset of Language Model Business Compliance Issues

RAG Benchmarking: Comparing RAGAS, BERTScore, and Giskard for AI Evaluation

LLM Observability vs LLM Evaluation: Building Comprehensive Enterprise AI Testing Strategies

Real-Time Guardrails vs Batch LLM Evaluations: A Comprehensive AI Testing Strategy

A Practical Guide to LLM Hallucinations and Misinformation Detection

A Practical Guide on AI Security and LLM Vulnerabilities

Good answers are not necessarily factual answers: an analysis of hallucination in leading LLMs

How to implement LLM as a Judge to test AI Agents? (Part 2)

How to implement LLM as a Judge to test AI Agents? (Part 1)

Secure AI Agents: Exhaustive testing with continuous LLM Red Teaming

Giskard announces Phare, a new open & multi-lingual LLM Benchmark

DeepSeek R1: Complete analysis of capabilities and limitations

[Release notes] Giskard integrates with LiteLLM: Simplifying LLM agent testing across foundation models

AI Liability in the EU: Business guide to Product (PLD) and AI Liability Directives (AILD)

Giskard Vision: Enhance Computer Vision models for image classification, object an landmark detection

Evaluating LLM applications: Giskard Integration with NVIDIA NeMo Guardrails

Global AI Treaty: EU, UK, US, and Israel sign landmark AI regulation

L'Oréal leverages Giskard for advanced Facial Landmark Detection

The EU AI Act published in the EU Official Journal: Next steps for AI Regulation

Partnership announcement: Bringing Giskard LLM evaluation to Databricks

LLMOps: MLOps for Large Language Models

Defending LLMs against Jailbreaking: Definition, examples and prevention

Giskard + Grafana for Data Drift Monitoring

Data Poisoning attacks on Enterprise LLM applications: AI risks, detection, and prevention

[Release notes] LLM app vulnerability scanner for Mistral, OpenAI, Ollama, and Custom Local LLMs

Guide to LLM evaluation and its critical impact for businesses

New course with DeepLearningAI: Red Teaming LLM Applications

LLM Red Teaming: Detect safety & security breaches in your LLM apps

Data Drift Monitoring with Giskard

EU AI ACT: 8 Takeaways from the Council's Final Approval

Giskard's retrospective of 2023 and a glimpse into what's next for 2024!

How to find the best Open-Source LLM for your Customer Service Chatbot

EU AI Act: The EU Strikes a Historic Agreement to Regulate AI

Biden's Executive Order: The Push to Regulate AI in the US

Our LLM Testing solution is launching on Product Hunt 🚀

Mastering ML Model Evaluation with Giskard: From Validation to CI/CD Integration

How to address Machine Learning Bias in a pre-trained HuggingFace text classification model?

Towards AI Regulation: How Countries are Shaping the Future of Artificial Intelligence

Guide to Model Evaluation: Eliminating Bias in Machine Learning Predictions

AI Safety and Security: A Conversation with Giskard's Co-Founder and CPO

Opening the Black Box: Using SHAP values to explain and enhance Machine Learning models

AI Safety at DEFCON 31: Red Teaming for Large Language Models (LLMs)

OWASP Top 10 for LLM 2023: Understanding the Risks of Large Language Models

1,000 GitHub stars, 3M€, and new LLM scan feature 💫

Testing Machine Learning Classification models for fraud detection

Giskard’s new beta is out! ⭐ Scan your model to detect hidden vulnerabilities

🔥 The safest way to use ChatGPT... and other LLMs

Giskard 1.4 is out! What's new in this version? ⭐

How to evaluate and load a PyTorch model with Giskard?

FOSDEM 2023: Presentation on CI/CD for ML and How to test ML models?

Giskard is coming to your notebook: Python meets Java via gRPC tunnel

How to deploy a robust HuggingFace model for sentiment analysis into production?

Why do Citibeats & Altaroad Test AI Models? The Business Value of Test-Driven Data Science

Does User Experience Matter to ML Engineers? Giskard Latest Release

Why & how we decided to change Giskard's identity

Giskard's new feature: Automated Machine Learning Testing

Who cares about AI Quality? Launching our AI Innovator community

How to test ML models? #3 📈 Numerical data drift

Why & how we decided to make Giskard Open-Source

How to test ML models #2 🧱 Categorical data drift

How to test ML models? #1 👉 Introduction

Where do biases in ML come from? #7 📚 Presentation

Where do biases in ML come from? #6 🐝 Emergent bias