E186: Unlocking Your Unstructured Data with Typedef

E186: Unlocking Your Unstructured Data with Typedef

Author: Robby (MTF); Tim (Essence VC) November 20, 2025 Duration: 42:05

In our latest episode, our co-hosts Robby and Tim talk with Yoni Michael and Kostas Pardalis, Co-Founders of Typedef. Both have deep backgrounds in data infrastructure (Starburst, Tecton, etc.) and, after meeting through a "blind date" at Blue Bottle Coffee, decided to team up to address the growing brittleness of large-scale data pipelines - issues made worse by the rise of AI.

They explain how traditional systems like Spark weren’t designed for today’s AI workloads, especially unstructured data and LLM inference. Fenic was their answer: an open-source engine and DataFrame library built specifically for LLM workflows, multi-step reasoning, and agentic systems - without the operational complexity.

Their biggest lessons: start GTM early, talk to as many data leaders as possible, and keep validating - insights that led directly to open-sourcing Fenic and building its MCP-powered developer experience.


Building a company around open source software is a unique and often misunderstood path, full of specific challenges and rare opportunities. The Open Source Startup Podcast digs into that journey directly with the people who have navigated it, moving beyond theory to the practical realities shared in conversation. Hosts Robby and Tim bring their distinct perspectives from MTF and Essence VC to these discussions, creating a space where founders speak candidly. You’ll hear from the architects behind names like HashiCorp, MongoDB, and Vercel, as well as leaders from Chronosphere, DBT, and mobile.dev, as they unpack their experiences. This podcast focuses on the pivotal decisions around community building, monetization strategies, and maintaining project ethos under the pressures of scaling a business. Each episode serves as a detailed case study, revealing how these companies turned publicly available code into sustainable, impactful enterprises. The dialogue naturally explores the tensions between open collaboration and commercial needs, offering a real-world blueprint that is both instructive and nuanced. For anyone curious about the intersection of community-driven development and venture-scale growth, this series provides an essential and unfiltered resource.
Author: Language: English Episodes: 100

Open Source Startup Podcast
Podcast Episodes
E142: Redefining Self-Serve Analytics with Dremio [not-audio_url] [/not-audio_url]

Duration: 41:26
Tomer Shiran is Founder of Dremio, the data lakehouse platform for self-service analytics and AI based on open source frameworks Apache Arrow, which the Dremio team created, and Apache Iceberg. Dremio has raised over $40…
E139: Taking on AWS with an Open Source Alternative [not-audio_url] [/not-audio_url]

Duration: 38:05
Umur Cubukcu is Co-Founder of Ubicloud, the open source and portable cloud that can reduce cloud spend by 3–10x. Their project, also called ubicloud, has over 3K stars and provides elastic compute, block storage, virtual…
E138: The Database Pioneer Behind Ingres, Postgres & DBOS [not-audio_url] [/not-audio_url]

Duration: 38:28
Michael Stonebraker is a legendary database system pioneer as the founder of Ingres, Postgres, and now DBOS. His work while at Berkeley and then MIT has been central to many relational database companies. His new company…
E137: Monitoring Infrastructure with Chalk Marks [not-audio_url] [/not-audio_url]

Duration: 40:13
John Viega is Co-Founder & CEO of Crash Override, the open source monitoring platform based on the Chalk project which has 22K stars on GitHub. Crash Override has raised $14M from investors including SYN Ventures, BVP &…
E136: Creating the Vector Database for AI Application Developers [not-audio_url] [/not-audio_url]

Duration: 39:35
Jeff Huber is Co-Founder of Chroma, the open source vector database. Their open source project, also called chroma, has 13K stars on GitHub. Chroma has raised $20M from investors including Quiet Ventures and Bloomberg Be…
E135: Riding the Homebrew Wave [not-audio_url] [/not-audio_url]

Duration: 42:31
John Britton & Mike McQuaid are Co-Founders of Workbrew, the company that provides additional features and support for companies using Homebrew. Homebrew's main project, brew, is a wildly popular open source project with…
E134: Making Complex Data RAG-Ready with Unstructured [not-audio_url] [/not-audio_url]

Duration: 37:06
Brian Raymond is Founder & CEO of Unstructured, the platform to extract and transform complex data for use with every major vector database and LLM framework. Their open source project has 7K stars on GitHub and includes…
E133: Reinventing Authorization with Google's Zanzibar Paper [not-audio_url] [/not-audio_url]

Duration: 39:25
Jake Moshenko is Co-Founder & CEO of AuthZed, the scalable authorization platform based on Google's Zanzibar white paper. Their open source permissions database spiceDB has 5K stars on GitHub and enables fine-grained acc…