905: Why RAG Makes LLMs Less Safe (And How to Fix It), with Bloomberg’s Dr. Sebastian Gehrmann

905: Why RAG Makes LLMs Less Safe (And How to Fix It), with Bloomberg’s Dr. Sebastian Gehrmann

Author: Jon Krohn July 15, 2025 Duration: 57:49
RAG LLMs are not safer: Sebastian Gehrmann speaks to Jon Krohn about his latest research into how retrieval-augmented generation (RAG) actually makes LLMs less safe, the three ‘H’s for gauging the effectivity and value of a RAG, and the custom guardrails and procedures we need to use to ensure our RAG is fit-for-purpose and secure. This is a great episode for anyone who wants to know how to work with RAG in the context of LLMs, as you’ll hear how to select the best model for purpose, useful approaches and taxonomies to keep your projects secure, and which models he finds safest when RAG is applied. Additional materials: ⁠⁠⁠⁠⁠⁠www.superdatascience.com/905⁠⁠ This episode is brought to you⁠ by, ⁠⁠⁠Adverity, the conversational analytics platform⁠⁠⁠ and by the ⁠⁠⁠Dell AI Factory with NVIDIA⁠⁠⁠. Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information. In this episode you will learn: (03:28) Findings from the paper “RAG LLMs are Not Safer: A Safety Analysis of Retrieval-Augmented Generation for Large Language Models” (09:35) What attack surfaces are in the context of AI (38:51) Small versus large models with RAG (46:27) How to select an LLM with safety in mind

Hosted by Dr. Jon Krohn, Super Data Science: ML & AI Podcast with Jon Krohn is a deep and accessible exploration of how artificial intelligence and machine learning are reshaping our world. Each episode features conversations with leading researchers, engineers, and entrepreneurs from both academia and industry, breaking down complex ideas into something tangible and relevant. You'll hear firsthand about emerging techniques, practical applications, and the evolving landscape of data-driven careers. The sheer volume of data in our world is growing at a staggering rate, and this podcast serves as a guide to understanding that expansion and finding your place within it. Rather than offering abstract theory, these discussions focus on real-world impact, from cutting-edge algorithms to the human stories behind major breakthroughs. Tune in for a thoughtful, nuanced look at the tools and trends that are defining the future, all through the lens of experts who are building that future every day. Whether you're actively working in the field or simply curious about the forces driving technological change, this podcast provides a consistent source of insight and inspiration, demystifying the science that is quietly transforming every aspect of our lives.
Author: Language: English Episodes: 100

Super Data Science: ML & AI Podcast with Jon Krohn
Podcast Episodes
936: LLMs Are Delighted to Help Phishing Scams [not-audio_url] [/not-audio_url]

Duration: 5:07
How much power – and risk – do we carry around with us in our pockets? A Reuters investigation about how easily LLMs can be utilized for online phishing scams is the subject of this week’s Five-Minute Friday with Jon Kro…
934: Is AI Replacing Junior Workers? [not-audio_url] [/not-audio_url]

Duration: 6:55
With the number of jobs dramatically slowing in the last year, many question if this decline is down to companies turning to AI for completing entry-level tasks in particular. Research published earlier this month by Yal…
932: Should You Build or Buy Your AI Solution? With Larissa Schneider [not-audio_url] [/not-audio_url]

Duration: 29:10
Larissa Schneider speaks to Jon Krohn in this Feature Friday about finding the right time to invest in AI solutions, and when it’s better to build them yourself. She discusses her work leading global strategy and operati…
930: In Case You Missed It in September 2025 [not-audio_url] [/not-audio_url]

Duration: 37:25
Jon Krohn’s highlights from this month of interviews focus on ways to future-proof your career, looking at the hardware that will get you the most mileage, the emerging roles that are well worth a look, and the developme…
928: The “Lethal Trifecta”: Can AI Agents Ever Be Safe? [not-audio_url] [/not-audio_url]

Duration: 5:55
Prompt injections, malicious code, and AI agents: In this week’s Five-Minute Friday, Jon Krohn looks into the current security weaknesses found in AI systems. A structural vulnerability that The Economist dubs a “lethal…
927: Automating Code Review with AI, feat. CodeRabbit’s David Loker [not-audio_url] [/not-audio_url]

Duration: 1:19:18
Earlier this year, David Loker joined CodeRabbit as their Director of AI. As more people come to write code with the help of large language models, David believes CodeRabbit will become a helpful assistant for code revie…