DeepMind’s RAG System with Animesh Chatterji and Ivan Solovyev

DeepMind’s RAG System with Animesh Chatterji and Ivan Solovyev

Author: softwareengineeringdaily.com March 12, 2026 Duration: 40:57
Retrieval-augmented generation, or RAG, has become a foundational approach to building production AI systems. However, deploying RAG in practice can be complex and costly. Developers typically have to manage vector databases, chunking strategies, embedding models, and indexing infrastructure. Designing effective RAG systems is also a moving target, as techniques and best practices evolve in step with rapidly advancing language models. Google DeepMind recently released the File Search Tool, a fully managed RAG system built directly into the Gemini API. File Search abstracts away the retrieval pipeline, allowing developers to upload documents, code, and other text data, automatically generate embeddings, and query their knowledge base. We wanted to understand how the DeepMind team designed a general-purpose RAG system that maintains high retrieval quality. Animesh Chatterji is a Software Engineer at Google DeepMind and Ivan Solovyev is a Product Manager at DeepMind, and they worked on File Search Tool. They joined the podcast with Sean Falconer to discuss the evolution of RAG, why simplicity and pricing transparency matter, how embedding models have improved retrieval quality, the tradeoffs between configurability and ease of use, and what’s next for multimodal retrieval across text, images, and beyond. Sean’s been an academic, startup founder, and Googler. He has published works covering a wide range of topics from AI to quantum computing. Currently, Sean is an AI Entrepreneur in Residence at Confluent where he works on AI strategy and thought leadership. You can connect with Sean on LinkedIn.   Please click here to see the transcript of this episode. Sponsorship inquiries: sponsor@softwareengineeringdaily.com

For anyone curious about how the code running our world actually gets built, Software Engineering Daily offers a clear and consistent look behind the curtain. This isn't about hype cycles or surface-level news; it's a deep, technical conversation with the engineers, architects, and thinkers who are shaping our digital infrastructure. Each episode focuses on a specific technology, practice, or problem, breaking down complex systems into understandable parts. You'll hear detailed discussions on everything from database architectures and programming language design to the organizational challenges of scaling teams and the real-world trade-offs made in production systems. Hosted by softwareengineeringdaily.com, the podcast serves as a reliable source for developers who want to stay informed and inspired, translating the rapid pace of technological change into substantive, lasting knowledge. It’s for professionals who believe that understanding the "how" and "why" is just as important as knowing the "what." By dedicating time to thorough exploration, this podcast provides context that shorter formats simply cannot, making it an essential resource for anyone building the future, one line of code at a time. Tune in to hear unfiltered insights from the people on the front lines, discussing the tools and decisions that define modern software engineering.
Author: Language: en-us Episodes: 100

Software Engineering Daily
Podcast Episodes
Hype and Reality of the AI Coding Shift [not-audio_url] [/not-audio_url]

Duration: 1:00:04
AI coding tools have gone from novelty to core infrastructure in under three years. Today, many devs use AI daily, a substantial share of new code is AI-generated, and expectations for automation are rapidly increasing.…
Unlocking the Data Layer for Agentic AI with Simba Khadder [not-audio_url] [/not-audio_url]

Duration: 49:04
AI agents are increasingly capable of reasoning and performing autonomous work over long periods. However, as agents take on more complex, longer-horizon tasks, keeping them supplied with the right information becomes th…
Agentic Mesh with Eric Broda [not-audio_url] [/not-audio_url]

Duration: 49:23
AI agents are evolving from individual productivity tools into distributed systems components inside enterprises. The next frontier is coming into focus, and it involves large-scale ecosystems of collaborating agents emb…
New Relic and Agentic DevOps with Nic Benders [not-audio_url] [/not-audio_url]

Duration: 48:18
Observability emerged from the need to understand complex software systems, and involves tracking metrics, logs, and traces so engineers can detect and diagnose problems before they affect users. However, modern applicat…
Mobile App Security with Ryan Lloyd09 [not-audio_url] [/not-audio_url]

Duration: 54:52
Mobile apps have become a primary interface for critical services, including banking, payments, and healthcare. Unlike web applications, much of the logic and intellectual property in a mobile app lives directly on the u…
FastMCP with Adam Azzam and Jeremiah Lowin [not-audio_url] [/not-audio_url]

Duration: 1:07:03
The Model Context Protocol, or MCP, gives developers a common way to expose tools, data, and capabilities to large language models, and it has quickly become an important standard in agentic AI. FastMCP is an open source…
SED News: OpenCode, AI Code vs. Shipped Code, and the LiteLLM Breach [not-audio_url] [/not-audio_url]

Duration: 58:42
SED News is a monthly podcast from Software Engineering Daily where hosts Gregor Vand and Sean Falconer unpack the biggest stories shaping software engineering, Silicon Valley, and the broader tech industry. In this epis…
FreeBSD with John Baldwin [not-audio_url] [/not-audio_url]

Duration: 1:03:50
FreeBSD is one of the longest-running and most influential open-source operating systems in the world. It was born from the Berkeley Software Distribution in the early 1990s, it has powered everything from high-performan…
Cilium, eBPF, and Modern Kubernetes Networking with Bill Mulligan [not-audio_url] [/not-audio_url]

Duration: 59:29
Modern cloud-native systems are built on highly dynamic, distributed infrastructure where containers spin up and down constantly, services communicate across clusters, and traditional networking assumptions break down. L…
Games That Push Back with Bennett Foddy [not-audio_url] [/not-audio_url]

Duration: 1:08:33
Bennett Foddy is a legendary game designer known for creating wholly distinctive games such as QWOP, Getting Over It with Bennett Foddy, and the recently released Baby Steps. He’s also a former professor at the NYU Game…