E186: Unlocking Your Unstructured Data with Typedef

E186: Unlocking Your Unstructured Data with Typedef

Author: Robby (MTF); Tim (Essence VC) November 20, 2025 Duration: 42:05

In our latest episode, our co-hosts Robby and Tim talk with Yoni Michael and Kostas Pardalis, Co-Founders of Typedef. Both have deep backgrounds in data infrastructure (Starburst, Tecton, etc.) and, after meeting through a "blind date" at Blue Bottle Coffee, decided to team up to address the growing brittleness of large-scale data pipelines - issues made worse by the rise of AI.

They explain how traditional systems like Spark weren’t designed for today’s AI workloads, especially unstructured data and LLM inference. Fenic was their answer: an open-source engine and DataFrame library built specifically for LLM workflows, multi-step reasoning, and agentic systems - without the operational complexity.

Their biggest lessons: start GTM early, talk to as many data leaders as possible, and keep validating - insights that led directly to open-sourcing Fenic and building its MCP-powered developer experience.


Building a company around open source software is a unique and often misunderstood path, full of specific challenges and rare opportunities. The Open Source Startup Podcast digs into that journey directly with the people who have navigated it, moving beyond theory to the practical realities shared in conversation. Hosts Robby and Tim bring their distinct perspectives from MTF and Essence VC to these discussions, creating a space where founders speak candidly. You’ll hear from the architects behind names like HashiCorp, MongoDB, and Vercel, as well as leaders from Chronosphere, DBT, and mobile.dev, as they unpack their experiences. This podcast focuses on the pivotal decisions around community building, monetization strategies, and maintaining project ethos under the pressures of scaling a business. Each episode serves as a detailed case study, revealing how these companies turned publicly available code into sustainable, impactful enterprises. The dialogue naturally explores the tensions between open collaboration and commercial needs, offering a real-world blueprint that is both instructive and nuanced. For anyone curious about the intersection of community-driven development and venture-scale growth, this series provides an essential and unfiltered resource.
Author: Language: English Episodes: 100

Open Source Startup Podcast
Podcast Episodes
E132: From General Purpose to Specialized Databases [not-audio_url] [/not-audio_url]

Duration: 40:03
Joran Dirk Greef is Founder & CEO of TigerBeetle, the open source financial transactions database. Their project, also called tigerbeetle, has over 7K stars and is a database designed for mission-critical workloads and p…
E130: Orchestrating AI Workloads with Union AI [not-audio_url] [/not-audio_url]

Duration: 38:43
Ketan Umare is Co-Founder & CEO of Union AI, the scalable MLOps platform focused on AI orchestration based on the flyte open source project. Union AI has raised $29M from investors including NEA & Nava Ventures. In this…
E129: The Race to Help Build Custom AI Models [not-audio_url] [/not-audio_url]

Duration: 38:39
Sahil Chaudhary is Founder of Glaive AI, the platform to build models that are faster, cheaper and outperform general purpose models with the help of synthetic data. In this episode, we discuss why education is so import…
E128: Simplifying Complex Infrastructure with Encore [not-audio_url] [/not-audio_url]

Duration: 33:49
André Eriksson is Founder & CEO of Encore, the backend development platform for startups building event-driven and distributed systems. This is Andre's second time on the Open Source Startup Podcast (first episode here)…
E127: Reimagining VPNs with Tailscale [not-audio_url] [/not-audio_url]

Duration: 43:19
Avery Pennarun is Co-Founder & CEO of Tailscale, the Wireguard-based VPN that reimagines secure, private networks. Tailscale has raised $115M from investors including Heavybit, Accel, CRV, and Insight. In this episode, w…
E126: RisingWave's Take on Launching a New Database [not-audio_url] [/not-audio_url]

Duration: 40:27
Yingjun Wu is Founder of RisingWave, a new open source stream processing database. RisingWave has raised $40M from investors including Yunqi Partners. In this episode, we discuss RisingWave's approach versus Apache Flink…
E125: Let's Help Engineering Teams Productionize AI [not-audio_url] [/not-audio_url]

Duration: 43:29
Andrew Hoh is Co-Founder of LastMile AI, the AI developer platform for engineering teams to productionize LLM applications. They take an "open periphery" stance on open source with projects like AIConfig to help develope…
E124: Re-Focusing on Security - the Sysdig Story [not-audio_url] [/not-audio_url]

Duration: 44:51
Loris Degioanni is Founder & CTO of Sysdig, the observability and container security company behind the Falco and Sysdig open source projects. Both projects are widely adopted, with 7K GitHub Stars each. Sysdig is a $2.5…
E123: Real-time Video & Audio Infrastructure for Conversational AI [not-audio_url] [/not-audio_url]

Duration: 44:15
Russ d'Sa is Founder of LiveKit, the real-time streaming audio, video, and data infrastructure platform for developers. Their open source project, also called livekit, provides the end-to-end stack for WebRTC and has ove…