Baseten CEO Tuhin Srivastava on the AI Inference Crunch, Custom Models, and Building the Inference Cloud

Baseten CEO Tuhin Srivastava on the AI Inference Crunch, Custom Models, and Building the Inference Cloud

Author: Conviction May 1, 2026 Duration: 42:57
Baseten CEO and co-founder Tuhin Srivastava sits down with Sarah Guo and Elad Gil to discuss the rapid growth of AI inference demand, Baseten’s 30x growth, and why inference is becoming the strategic “last market.” Tuhin Srivastava argues the application layer will persist because companies with unique user signals can encode value into workflows and post-train specialized models, citing examples like Abridge and support workflows. The conversation covers GPU capacity constraints, Baseten’s multi-cloud fabric across 18 clouds and 90 clusters, long-term contracting dynamics, the importance of the software layer for stickiness, evolving workloads, multichip possibilities, and operational lessons at scale. Sign up for new podcasts every week. Email feedback to show@no-priors.com Follow us on Twitter: @NoPriorsPod | @Saranormous | @EladGil | @Tuhinone   Chapters: 00:31 Baseten growth 01:55 Why the app layer wins 05:57 Serving frontier customers 07:55 Open source model mix 09:21 Chinese models and geopolitics 13:07 Custom inference dominates 14:22 Post training acquisition 17:10 When to invest in custom models 18:35 Supply crunch and data centerse 22:25 Longer GPU Contracts 24:09 What Makes a Winner 26:07 Multi Chip Future 28:19 Runtime Roadmap 31:08 Scaling Edge Cases 33:48 Hiring and Leadership 36:44 Operations Pager Culture 38:19 Efficiency Drives Demand 40:41 Concierge Everything Future 42:34 Conclusion

Elad Gil and Sarah Guo guide conversations in No Priors: Artificial Intelligence | Technology | Startups that cut straight to the core of what's happening now. This isn't about abstract futures; it's grounded in dialogues with the very people building and shaping the field-leading AI engineers, pioneering researchers, and the founders turning theory into reality. Each episode tackles the pressing, often daunting questions that define this technological inflection point. You'll hear them explore the practical pathways and hurdles toward AGI, debate which industries are genuinely poised for transformation, and examine how the state-of-the-art in research translates into real-world products and societal shifts. The discussions naturally span the impact on commerce, culture, and the very structure of how we live and work. Produced by Conviction, this podcast serves as an essential, clear-eyed resource for anyone looking to move beyond the hype and understand the forces driving the AI revolution. Sarah Guo, a startup investor, and Elad Gil bring their direct experience to these conversations, ensuring every interview provides substantive insight you can use.
Author: Language: English Episodes: 100

No Priors: Artificial Intelligence | Technology | Startups
Podcast Episodes
Will we have Superintelligence by 2028? With Anthropic’s Ben Mann [not-audio_url] [/not-audio_url]

Duration: 41:25
What happens when you give AI researchers unlimited compute and tell them to compete for the highest usage rates? Ben Mann, Co-Founder, from Anthropic sits down with Sarah Guo and Elad Gil to explain how Claude 4 went fr…
AI is Making Enterprise Search Relevant, with Arvind Jain of Glean [not-audio_url] [/not-audio_url]

Duration: 31:34
Arvind Jain joins Sarah and Elad on this episode of No Priors. Arvind is the founder and CEO of Glean, an AI-powered enterprise search platform. He previously co-founded Rubrik and spent over a decade as an engineering l…
Gaming as the Future of Education with Duolingo CEO Luis von Ahn [not-audio_url] [/not-audio_url]

Duration: 32:14
On this episode of No Priors, Sarah talks to Luis von Ahn, founder and CEO of Duolingo, the world’s most popular education app with over 116 million monthly users and a market cap of approximately $17 billion. Controvers…