Security News > 2025 > May > Why security teams cannot rely solely on AI guardrails

In this Help Net Security interview, Dr. Peter Garraghan, CEO of Mindgard, discusses their research around vulnerabilities in the guardrails used to protect large AI models. The findings highlight how even billion-dollar LLMs can be bypassed using surprisingly simple techniques, including emojis. To defend against prompt injection, many LLMs are wrapped in guardrails that inspect and filter prompts. But these guardrails are typically AI-based classifiers themselves, and, as Mindgard’s study shows, they are just as … More → The post Why security teams cannot rely solely on AI guardrails appeared first on Help Net Security.
News URL
https://www.helpnetsecurity.com/2025/05/12/peter-garraghan-mindgard-ai-guardrails/
Related news
- One in three security teams trust AI to act autonomously (source)
- Network Security at the Edge for AI-ready Enterprise (source)
- Compliance weighs heavily on security and GRC teams (source)
- Coaching AI agents: Why your next security hire might be an algorithm (source)
- How lean security teams can build resilient defenses (source)
- AI forces security leaders to rethink hybrid cloud strategies (source)
- LlamaFirewall: Open-source framework to detect and mitigate AI centric security risks (source)
- Security awareness training isn’t stopping breaches. Can AI help? (source)
- CISO 3.0: Leading AI governance and security in the boardroom (source)