Weixuan Xiao

Claude Mythos: Analyzing Anthropic’s new frontier model for AI security
Blog

Claude Mythos: Analyzing Anthropic’s new frontier model for AI security

In this article, we analyze Anthropic's newly announced Claude Mythos model and its announced capabilities in automated vulnerability discovery and exploit generation. We explore how this frontier model impacts the cybersecurity landscape.

Weixuan Xiao - Machine Learning Engineer at Giskard
Weixuan Xiao
View post
OpenClaw security issues include data leakage & prompt injection
Blog

OpenClaw security vulnerabilities include data leakage and prompt injection risks

This article explores the critical security failures of the OpenClaw agentic AI, which allowed sensitive data to leak across user sessions and IM channels. It examines how architectural weaknesses in the Control UI and session management created direct paths for prompt injection and unauthorized tool use. Finally, it outlines the essential hardening steps and systematic red-teaming strategies required to transform a vulnerable "fun bot" into a secure enterprise assistant.

Weixuan Xiao - Machine Learning Engineer at Giskard
Weixuan Xiao
View post