State Space Models and Real-time Intelligence with Karan Goel and Albert Gu from Cartesia

State Space Models and Real-time Intelligence with Karan Goel and Albert Gu from Cartesia

Author: Conviction June 27, 2024 Duration: 34:08
This week on No Priors, Sarah Guo and Elad Gil sit down with Karan Goel and Albert Gu from Cartesia. Karan and Albert first met as Stanford AI Lab PhDs, where their lab invented Space Models or SSMs, a fundamental new primitive for training large-scale foundation models. In 2023, they Founded Cartesia to build real-time intelligence for every device. One year later, Cartesia released Sonic which generates high quality and lifelike speech with a model latency of 135ms—the fastest for a model of this class. Sign up for new podcasts every week. Email feedback to show@no-priors.com Follow us on Twitter: @NoPriorsPod | @Saranormous | @EladGil | @krandiash | @_albertgu Show Notes:  (0:00) Introduction (0:28) Use Cases for Cartesia and Sonic  (1:32) Karan Goel & Albert Gu’s professional backgrounds (5:06) State Space Models (SSMs) versus Transformer Based Architectures  (11:51) Domain Applications for Hybrid Approaches  (13:10) Text to Speech and Voice (17:29) Data, Size of Models and Efficiency  (20:34) Recent Launch of Text to Speech Product (25:01) Multimodality & Building Blocks (25:54) What’s Next at Cartesia?  (28:28) Latency in Text to Speech (29:30) Choosing Research Problems Based on Aesthetic  (31:23) Product Demo (32:48) Cartesia Team & Hiring

Elad Gil and Sarah Guo guide conversations in No Priors: Artificial Intelligence | Technology | Startups that cut straight to the core of what's happening now. This isn't about abstract futures; it's grounded in dialogues with the very people building and shaping the field-leading AI engineers, pioneering researchers, and the founders turning theory into reality. Each episode tackles the pressing, often daunting questions that define this technological inflection point. You'll hear them explore the practical pathways and hurdles toward AGI, debate which industries are genuinely poised for transformation, and examine how the state-of-the-art in research translates into real-world products and societal shifts. The discussions naturally span the impact on commerce, culture, and the very structure of how we live and work. Produced by Conviction, this podcast serves as an essential, clear-eyed resource for anyone looking to move beyond the hype and understand the forces driving the AI revolution. Sarah Guo, a startup investor, and Elad Gil bring their direct experience to these conversations, ensuring every interview provides substantive insight you can use.
Author: Language: English Episodes: 100

No Priors: Artificial Intelligence | Technology | Startups
Podcast Episodes
How YC fosters AI Innovation with Garry Tan [not-audio_url] [/not-audio_url]

Duration: 39:59
Garry Tan is a notorious founder-turned-investor who is now running one of the most prestigious accelerators in the world, Y Combinator. As the president and CEO of YC, Garry has been credited with reinvigorating the pro…
The Data Foundry for AI with Alexandr Wang from Scale [not-audio_url] [/not-audio_url]

Duration: 39:00
Alexandr Wang was 19 when he realized that gathering data will be crucial as AI becomes more prevalent, so he dropped out of MIT and started Scale AI. This week on No Priors, Alexandr joins Sarah and Elad to discuss how…
Music consumers are becoming the creators with Suno CEO Mikey Shulman [not-audio_url] [/not-audio_url]

Duration: 30:26
Mikey Shulman, the CEO and co-founder of Suno, can see a future where the Venn diagram of music creators and consumers becomes one big circle. The AI music generation tool trying to democratize music has been making wave…
The Future of AI Artistry with Suhail Doshi from Playground AI [not-audio_url] [/not-audio_url]

Duration: 24:31
Multimodal models are making it possible to create AI art and augment creativity across artistic mediums. This week on No Priors, Sarah and Elad talk with Suhail Doshi, the founder of Playground AI, an image generator an…

«1...678910