Technical advances in document understanding

Technical advances in document understanding

Author: Practical AI LLC December 2, 2025 Duration: 49:18

Chris and Daniel unpack how AI-driven document processing has rapidly evolved well beyond traditional OCR with many technical advances that fly under the radar. They explore the progression from document structure models to language-vision models, all the way to the newest innovations like Deepseek-OCR. The discussion highlights the pros and cons of these various approaches focusing on practical implementation and usage.

Featuring:

Sponsors:

  • Shopify – The commerce platform trusted by millions. From idea to checkout, Shopify gives you everything you need to launch and scale your business—no matter your level of experience. Build beautiful storefronts, market with built-in AI tools, and tap into the platform powering 10% of all U.S. eCommerce. Start your one-dollar trial at shopify.com/practicalai
  • Fabi.ai - The all-in-one data analysis platform for modern teams. From ad hoc queries to advanced analytics, Fabi lets you explore data wherever it lives—spreadsheets, Postgres, Snowflake, Airtable and more. Built-in Python and AI assistance help you move fast, then publish interactive dashboards or automate insights delivered straight to Slack, email, spreadsheets or wherever you need to share it. Learn more and get started for free at fabi.ai
  • Framer – Design and publish without limits with Framer, the free all-in-one design platform. Unlimited projects, no tool switching, and professional sites—no Figma imports or HTML hassles required. Start creating for free at framer.com/design with code `PRACTICALAI` for a free month of Framer Pro.

Upcoming Events: 


There's a lot of noise out there about artificial intelligence, but cutting through the hype to find what's genuinely useful can be a challenge. That's the space where Practical AI operates. Hosted by the team at Practical AI LLC, this technology podcast moves beyond abstract theory to explore how AI, machine learning, and large language models are actually being applied right now. Each episode features unscripted conversations with a diverse mix of experts, developers, business leaders, and curious minds. You'll hear tangible discussions about implementing machine learning systems, the realities of MLOps, the evolution of neural networks, and the practical implications of breakthroughs in deep learning and GANs. The dialogue is grounded in real-world scenarios, focusing on how these technologies solve problems, drive productivity, and create value in accessible ways. Whether you're a professional building models, a business person integrating AI tools, or an enthusiast eager to understand the landscape, this podcast offers a clear, conversational entry point. It’s about making sense of a complex field through the lens of practical application, demystifying the concepts that are shaping our world without losing sight of how they work on the ground.
Author: Language: en-us Episodes: 100

Practical AI
Podcast Episodes
Workforce dynamics in an AI-assisted world [not-audio_url] [/not-audio_url]

Duration: 44:06
We unpack how AI is reshaping hiring decisions, shifting job roles, and creating new expectations for professionals — from engineers to marketers. They explore the rise of AI-assisted teams, the growing compensation bubb…
Reimagining actuarial science with AI [not-audio_url] [/not-audio_url]

Duration: 40:59
In this episode, Chris sits down with Igor Nikitin, CEO and co-founder of Nice Technologies, to explore how AI and modern engineering practices are transforming the actuarial field and setting the stage for the future of…
Agentic AI for Drone & Robotic Swarming [not-audio_url] [/not-audio_url]

Duration: 46:27
In this episode of Practical AI, Chris and Daniel explore the fascinating world of agentic AI for drone and robotic swarms, which is Chris's passion and professional focus. They unpack how autonomous vehicles (UxV), dron…
AI in the shadows: From hallucinations to blackmail [not-audio_url] [/not-audio_url]

Duration: 44:50
In the first episode of an "AI in the shadows" theme, Chris and Daniel explore the increasing concerning world of agentic misalignment. Starting out with a reminder about hallucinations and reasoning models, they break d…
Finding Nemotron [not-audio_url] [/not-audio_url]

Duration: 46:23
In this episode, we sit down with Joey Conway to explore NVIDIA's open source AI, from the reasoning-focused Nemotron models built on top of Llama, to the blazing-fast Parakeet speech model. We chat about what makes open…
AI hot takes and debates: Autonomy [not-audio_url] [/not-audio_url]

Duration: 45:36
Can AI-driven autonomy reduce harm, or does it risk dehumanizing decision-making? In this “AI Hot Takes & Debates” series episode, Daniel and Chris dive deep into the ethical crossroads of AI, autonomy, and military appl…
Behind-the-Scenes: VC Funding for AI Startups [not-audio_url] [/not-audio_url]

Duration: 41:48
It seems like we are bombarded by news about millions of dollars pouring into AI startups, which have crazy valuations. In this episode, Chris and Dan dive deep into the highs, lows, and hard choices behind funding an AI…
AI-Automated Film Making [not-audio_url] [/not-audio_url]

Duration: 43:02
An recent article in Variety was titled: "Sylvester Stallone-Backed Largo.ai Teams With Brilliant Pictures for ‘World’s First Fully AI-Automated Film Company’". Obviously this caught our attention! We sit down with Sami…
Federated learning in production (part 2) [not-audio_url] [/not-audio_url]

Duration: 45:25
Chong Shen from Flower Labs joins us to discuss what it really takes to build production-ready federated learning systems that work across data silos. We talk about the Flower framework and it's architecture (supernodes,…
Federated learning in production (part 1) [not-audio_url] [/not-audio_url]

Duration: 44:38
In this first of a two part series of episodes on federated learning, we dive into the evolving world of federated learning and distributed AI frameworks with Patrick Foley from Intel. We explore how frameworks like Open…