State Space Models and Real-time Intelligence with Karan Goel and Albert Gu from Cartesia

State Space Models and Real-time Intelligence with Karan Goel and Albert Gu from Cartesia

Author: Conviction June 27, 2024 Duration: 34:08
This week on No Priors, Sarah Guo and Elad Gil sit down with Karan Goel and Albert Gu from Cartesia. Karan and Albert first met as Stanford AI Lab PhDs, where their lab invented Space Models or SSMs, a fundamental new primitive for training large-scale foundation models. In 2023, they Founded Cartesia to build real-time intelligence for every device. One year later, Cartesia released Sonic which generates high quality and lifelike speech with a model latency of 135ms—the fastest for a model of this class. Sign up for new podcasts every week. Email feedback to show@no-priors.com Follow us on Twitter: @NoPriorsPod | @Saranormous | @EladGil | @krandiash | @_albertgu Show Notes:  (0:00) Introduction (0:28) Use Cases for Cartesia and Sonic  (1:32) Karan Goel & Albert Gu’s professional backgrounds (5:06) State Space Models (SSMs) versus Transformer Based Architectures  (11:51) Domain Applications for Hybrid Approaches  (13:10) Text to Speech and Voice (17:29) Data, Size of Models and Efficiency  (20:34) Recent Launch of Text to Speech Product (25:01) Multimodality & Building Blocks (25:54) What’s Next at Cartesia?  (28:28) Latency in Text to Speech (29:30) Choosing Research Problems Based on Aesthetic  (31:23) Product Demo (32:48) Cartesia Team & Hiring

Elad Gil and Sarah Guo guide conversations in No Priors: Artificial Intelligence | Technology | Startups that cut straight to the core of what's happening now. This isn't about abstract futures; it's grounded in dialogues with the very people building and shaping the field-leading AI engineers, pioneering researchers, and the founders turning theory into reality. Each episode tackles the pressing, often daunting questions that define this technological inflection point. You'll hear them explore the practical pathways and hurdles toward AGI, debate which industries are genuinely poised for transformation, and examine how the state-of-the-art in research translates into real-world products and societal shifts. The discussions naturally span the impact on commerce, culture, and the very structure of how we live and work. Produced by Conviction, this podcast serves as an essential, clear-eyed resource for anyone looking to move beyond the hype and understand the forces driving the AI revolution. Sarah Guo, a startup investor, and Elad Gil bring their direct experience to these conversations, ensuring every interview provides substantive insight you can use.
Author: Language: English Episodes: 100

No Priors: Artificial Intelligence | Technology | Startups
Podcast Episodes
Cloud Strategy in the AI Era with Matt Garman, CEO of AWS [not-audio_url] [/not-audio_url]

Duration: 42:58
In this episode of No Priors, hosts Sarah and Elad are joined by Matt Garman, the CEO of Amazon Web Services. They talk about the evolution of Amazon Web Services (AWS) from its inception to its current position as a maj…
The marketplace for AI compute with Jared Quincy Davis from Foundry [not-audio_url] [/not-audio_url]

Duration: 43:12
In this episode of No Priors, hosts Sarah and Elad are joined by Jared Quincy Davis, former DeepMind researcher and the Founder and CEO of Foundry, a new AI cloud computing service provider. They discuss the research pro…
The Best of 2024 (so far) with Sarah Guo and Elad Gil [not-audio_url] [/not-audio_url]

Duration: 25:56
Believe or not, we’re almost halfway through 2024. Sarah and Elad have spent the first of this year talking with some of the most innovative minds in the AI industry, so we’re taking a look at some of our favorite No Pri…
Can AI replace the camera? with Joshua Xu from HeyGen [not-audio_url] [/not-audio_url]

Duration: 27:26
AI video generation models still have a long way to go when it comes to making compelling and complex videos but the HeyGen team are well on their way to streamlining the video creation process by using a combination of…