E182: The Rise of ClickHouse

E182: The Rise of ClickHouse

Author: Robby (MTF); Tim (Essence VC) October 9, 2025 Duration: 47:02

In the episode, we sat down with ClickHouse Co-Founder Yury Izrailevsky to unpack how one of the fastest open-source databases in the world became the analytics engine of choice for 2,000 customers including Harvey, Canva, HP, and Supabase. From its Yandex origins to powering AI observability, Yury shares how ClickHouse balances open-source roots, cloud innovation, and a remote-first culture moving at breakneck speed.

ClickHouse's Series C valued the company at $6.35B earlier this year, and just yesterday they announced an extension to that round, just months after it was raised.

In this episode, we dig into:

  • Origins & Founding Story

    • ClickHouse began as an internal project at Yandex to power a Google Analytics–style platform, focused on performance and scale.

    • Open-sourced in 2016 - rapid global adoption laid the foundation for ClickHouse the company.

    • Yury first discovered ClickHouse while at Google; impressed by its speed, he later co-founded the company in 2021 alongside Aaron Katz (ex-Elastic) and the original creator Alexey Milovidov.

  • Why ClickHouse Stands Out

    • Column-oriented, open source OLAP database designed for massive-scale analytical processing.

    • Excels in performance, efficiency, and cost - ideal for large data volumes and real-time analytics (and now AI workloads).

    • Architectural choices:

      • Columnar storage = better compression and faster execution.

      • Separation of compute and storage enables elasticity, scalability, and resilience in the cloud.

  • Open Source vs. Cloud

    • Open-source version offers freedom and flexibility.

    • Cloud product delivers much lower total cost of ownership and fully managed experience.

    • Architectural parity between the two ensuring no vendor lock-in for customers.

    • Customers can run the same queries on both; most stay with cloud due to simplicity and cost efficiency.

  • Use Cases & Ecosystem

    • 4 main use cases:

      1. Real-time analytics

      2. Data Warehousing

      3. Observability

      4. AI / ML Workloads

  • Company Building & Culture

    • Fully remote from day one.

    • Prioritized experienced, self-sufficient engineers over early-career hires.

    • Built and launched GA version in less than a year - insane pace of innovation.

  • Innovation & Community

    • Monthly release cadence.

    • Hundreds of integrations and connectors.

    • Strong open-source and commercial community

  • Advice for Founders

    • Focus on what matters most

    • Hire mature, independent thinkers.

    • Move fast but maintain quality; ClickHouse Cloud achieved production-grade quality in record time.


Building a company around open source software is a unique and often misunderstood path, full of specific challenges and rare opportunities. The Open Source Startup Podcast digs into that journey directly with the people who have navigated it, moving beyond theory to the practical realities shared in conversation. Hosts Robby and Tim bring their distinct perspectives from MTF and Essence VC to these discussions, creating a space where founders speak candidly. You’ll hear from the architects behind names like HashiCorp, MongoDB, and Vercel, as well as leaders from Chronosphere, DBT, and mobile.dev, as they unpack their experiences. This podcast focuses on the pivotal decisions around community building, monetization strategies, and maintaining project ethos under the pressures of scaling a business. Each episode serves as a detailed case study, revealing how these companies turned publicly available code into sustainable, impactful enterprises. The dialogue naturally explores the tensions between open collaboration and commercial needs, offering a real-world blueprint that is both instructive and nuanced. For anyone curious about the intersection of community-driven development and venture-scale growth, this series provides an essential and unfiltered resource.
Author: Language: English Episodes: 100

Open Source Startup Podcast
Podcast Episodes
E142: Redefining Self-Serve Analytics with Dremio [not-audio_url] [/not-audio_url]

Duration: 41:26
Tomer Shiran is Founder of Dremio, the data lakehouse platform for self-service analytics and AI based on open source frameworks Apache Arrow, which the Dremio team created, and Apache Iceberg. Dremio has raised over $40…
E139: Taking on AWS with an Open Source Alternative [not-audio_url] [/not-audio_url]

Duration: 38:05
Umur Cubukcu is Co-Founder of Ubicloud, the open source and portable cloud that can reduce cloud spend by 3–10x. Their project, also called ubicloud, has over 3K stars and provides elastic compute, block storage, virtual…
E138: The Database Pioneer Behind Ingres, Postgres & DBOS [not-audio_url] [/not-audio_url]

Duration: 38:28
Michael Stonebraker is a legendary database system pioneer as the founder of Ingres, Postgres, and now DBOS. His work while at Berkeley and then MIT has been central to many relational database companies. His new company…
E137: Monitoring Infrastructure with Chalk Marks [not-audio_url] [/not-audio_url]

Duration: 40:13
John Viega is Co-Founder & CEO of Crash Override, the open source monitoring platform based on the Chalk project which has 22K stars on GitHub. Crash Override has raised $14M from investors including SYN Ventures, BVP &…
E136: Creating the Vector Database for AI Application Developers [not-audio_url] [/not-audio_url]

Duration: 39:35
Jeff Huber is Co-Founder of Chroma, the open source vector database. Their open source project, also called chroma, has 13K stars on GitHub. Chroma has raised $20M from investors including Quiet Ventures and Bloomberg Be…
E135: Riding the Homebrew Wave [not-audio_url] [/not-audio_url]

Duration: 42:31
John Britton & Mike McQuaid are Co-Founders of Workbrew, the company that provides additional features and support for companies using Homebrew. Homebrew's main project, brew, is a wildly popular open source project with…
E134: Making Complex Data RAG-Ready with Unstructured [not-audio_url] [/not-audio_url]

Duration: 37:06
Brian Raymond is Founder & CEO of Unstructured, the platform to extract and transform complex data for use with every major vector database and LLM framework. Their open source project has 7K stars on GitHub and includes…
E133: Reinventing Authorization with Google's Zanzibar Paper [not-audio_url] [/not-audio_url]

Duration: 39:25
Jake Moshenko is Co-Founder & CEO of AuthZed, the scalable authorization platform based on Google's Zanzibar white paper. Their open source permissions database spiceDB has 5K stars on GitHub and enables fine-grained acc…