Context-Aware SQL and Metadata with Shinji Kim

Context-Aware SQL and Metadata with Shinji Kim

Author: softwareengineeringdaily.com September 4, 2025 Duration: 41:37
A common challenge in data-rich organizations is that critical context about the data is often hard to capture and even harder to keep up to date. As more people across the organization use data and data models get more complex, simply finding the right dataset can be slow and create bottlenecks. Select Star is a data discovery and metadata platform that builds a continuously updated knowledge graph of an organization’s data by analyzing both its structure and how it’s actually used. It enriches data with context such as popularity, lineage, and semantic models, making it easier for AI and teams to discover, trust, and use the right data. These enriched metadata layers are also highly valuable for large language models, significantly improving the accuracy of generated SQL queries. Shinji Kim is the founder and CEO of Select Star, and she joins Sean Falconer to discuss solving metadata curation challenges, managing data context at scale, using LLMs for SQL generation, emerging trends in metadata management, and more. Full Disclosure: This episode is sponsored by Select Star. Sean's been an academic, startup founder, and Googler. He has published works covering a wide range of topics from AI to quantum computing. Currently, Sean is an AI Entrepreneur in Residence at Confluent where he works on AI strategy and thought leadership. You can connect with Sean on LinkedIn.   Please click here to see the transcript of this episode. Sponsorship inquiries: sponsor@softwareengineeringdaily.com

For anyone curious about how the code running our world actually gets built, Software Engineering Daily offers a clear and consistent look behind the curtain. This isn't about hype cycles or surface-level news; it's a deep, technical conversation with the engineers, architects, and thinkers who are shaping our digital infrastructure. Each episode focuses on a specific technology, practice, or problem, breaking down complex systems into understandable parts. You'll hear detailed discussions on everything from database architectures and programming language design to the organizational challenges of scaling teams and the real-world trade-offs made in production systems. Hosted by softwareengineeringdaily.com, the podcast serves as a reliable source for developers who want to stay informed and inspired, translating the rapid pace of technological change into substantive, lasting knowledge. It’s for professionals who believe that understanding the "how" and "why" is just as important as knowing the "what." By dedicating time to thorough exploration, this podcast provides context that shorter formats simply cannot, making it an essential resource for anyone building the future, one line of code at a time. Tune in to hear unfiltered insights from the people on the front lines, discussing the tools and decisions that define modern software engineering.
Author: Language: en-us Episodes: 100

Software Engineering Daily
Podcast Episodes
Turning Agent Autonomy into Productivity with Chris Weichel [not-audio_url] [/not-audio_url]

Duration: 1:01:21
A common challenge in software development is creating and maintaining robust development environments. The rise of AI agents has amplified this complexity by adding new demands around permission controls, environment is…
Homebrew and macOS Package Management with Mike McQuaid [not-audio_url] [/not-audio_url]

Duration: 1:06:58
Homebrew is a widely used package manager that simplifies the installation of open-source software on macOS. It was created in response to the growing demand for a lightweight, developer-friendly tool suited to an increa…
Engineering in the Age of Agents with Yechezkel Rabinovich [not-audio_url] [/not-audio_url]

Duration: 50:05
Modern software platforms are increasingly composed of diverse microservices, third-party APIs, and cloud resources. The distributed nature of these systems makes it difficult for engineers to gain a clear view of how th…
Static Analysis for Ruby with Jake Zimmerman [not-audio_url] [/not-audio_url]

Duration: 48:15
Dynamic languages, like Ruby, Python, and JavaScript, determine the types of variables at runtime rather than at compile time. This flexibility allows for rapid development and concise code, but it also makes it harder t…
Scaling AI in Enterprise Codebases with Guy Gur-Ari [not-audio_url] [/not-audio_url]

Duration: 52:04
The rise of language-model coding assistants has led to the creation of the vibe coding paradigm. In this mode of software development, AI agents take a plain language prompt and generate entire applications, which drama…
Orkes and Agentic Workflow Orchestration with Viren Baraiya [not-audio_url] [/not-audio_url]

Duration: 46:44
Modern software systems are composed of many independent microservices spanning frontends, backends, APIs, and AI models, and coordinating and scaling them reliably is a constant challenge. A workflow orchestration platf…
Turbopuffer with Simon Hørup Eskildsen [not-audio_url] [/not-audio_url]

Duration: 51:29
Vector search has become a foundational technology for AI applications, enabling everything from semantic code search to contextual retrieval for large language models. However, a major challenge with vector databases ha…
Building an Indie Hit in Godot with Jay Baylis and Tom Coxon [not-audio_url] [/not-audio_url]

Duration: 43:07
Cassette Beasts is a turn-based monster-battling RPG that lets players record creatures onto cassette tapes and transform into them during battle. The game was an indie hit, and is also one of the most successful games b…
Rethinking GraphQL Frontends with Robert Balicki [not-audio_url] [/not-audio_url]

Duration: 38:50
A challenge in modern frontend application design is efficiently fetching and managing GraphQL data while keeping UI components responsive and maintainable. Developers often face issues like over-fetching, under-fetching…