YOLOv9: Computer vision is alive and well

YOLOv9: Computer vision is alive and well

Author: Practical AI LLC March 6, 2024 Duration: 42:46

While everyone is super hyped about generative AI, computer vision researchers have been working in the background on significant advancements in deep learning architectures. YOLOv9 was just released with some noteworthy advancements relevant to parameter efficient models. In this episode, Chris and Daniel dig into the details and also discuss advancements in parameter efficient LLMs, such as Microsofts 1-Bit LLMs and Qualcomm’s new AI Hub.


Sponsors:

  • Changelog News – A podcast+newsletter combo that’s brief, entertaining & always on-point. Subscribe today
  • Fly.ioThe home of Changelog.com — Deploy your apps and databases close to your users. In minutes you can run your Ruby, Go, Node, Deno, Python, or Elixir app (and databases!) all over the world. No ops required. Learn more at fly.io/changelog and check out the speedrun in their docs
  • SentryLaunch week! New features and products all week long (so get comfy)! Tune in to Sentry’s YouTube and Discord daily at 9am PT to hear the latest scoop. Too busy? No problem - enter your email address to receive all the announcements (and win swag along the way). Use the code CHANGELOG when you sign up to get $100 OFF the team plan. 
  • Typesense – Lightning fast, globally distributed Search-as-a-Service that runs in memory. You literally can’t get any faster! 

Featuring:

Show Notes:

YOLOv9:

Parameter efficient LLMs:

Upcoming Events: 


There's a lot of noise out there about artificial intelligence, but cutting through the hype to find what's genuinely useful can be a challenge. That's the space where Practical AI operates. Hosted by the team at Practical AI LLC, this technology podcast moves beyond abstract theory to explore how AI, machine learning, and large language models are actually being applied right now. Each episode features unscripted conversations with a diverse mix of experts, developers, business leaders, and curious minds. You'll hear tangible discussions about implementing machine learning systems, the realities of MLOps, the evolution of neural networks, and the practical implications of breakthroughs in deep learning and GANs. The dialogue is grounded in real-world scenarios, focusing on how these technologies solve problems, drive productivity, and create value in accessible ways. Whether you're a professional building models, a business person integrating AI tools, or an enthusiast eager to understand the landscape, this podcast offers a clear, conversational entry point. It’s about making sense of a complex field through the lens of practical application, demystifying the concepts that are shaping our world without losing sight of how they work on the ground.
Author: Language: en-us Episodes: 100

Practical AI
Podcast Episodes
The perplexities of information retrieval [not-audio_url] [/not-audio_url]

Duration: 46:06
Daniel & Chris sit down with Denis Yarats, Co-founder & CTO at Perplexity, to discuss Perplexity’s sophisticated AI-driven answer engine. Denis outlines some of the deficiencies in search engines, and how Perplexity’s ap…
Using edge models to find sensitive data [not-audio_url] [/not-audio_url]

Duration: 38:28
We’ve all heard about breaches of privacy and leaks of private health information (PHI). For healthcare providers and those storing this data, knowing where all the sensitive data is stored is non-trivial. Ramin, from Ta…
Rise of the AI PC & local LLMs [not-audio_url] [/not-audio_url]

Duration: 35:35
We’ve seen a rise in interest recently and a number of major announcements related to local LLMs and AI PCs. NVIDIA, Apple, and Intel are getting into this along with models like the Phi family from Microsoft. In this ep…
AI in the U.S. Congress [not-audio_url] [/not-audio_url]

Duration: 40:48
At the age of 72, U.S. Representative Don Beyer of Virginia enrolled at GMU to pursue a Master’s degree in C.S. with a concentration in Machine Learning.Rep. Beyer is Vice Chair of the bipartisan Artificial Intelligence…
First impressions of GPT-4o [not-audio_url] [/not-audio_url]

Duration: 43:28
Daniel & Chris share their first impressions of OpenAI’s newest LLM: GPT-4o and Daniel tries to bring the model into the conversation with humorously mixed results. Together, they explore the implications of Omni’s new f…
Full-stack approach for effective AI agents [not-audio_url] [/not-audio_url]

Duration: 47:02
There’s a lot of hype about AI agents right now, but developing robust agents isn’t yet a reality in general. Imbue is leading the way towards more robust agents by taking a full-stack approach; from hardware innovations…
Autonomous fighter jets?! [not-audio_url] [/not-audio_url]

Duration: 41:11
Yep, you heard that right. Autonomous fighter jets are in the news. Chris and Daniel discuss a modified F-16 known as the X-62A VISTA and autonomous vehicles/ systems more generally. They also comment on the Linux Founda…
Private, open source chat UIs [not-audio_url] [/not-audio_url]

Duration: 38:25
We recently gathered some Practical AI listeners for a live webinar with Danny from LibreChat to discuss the future of private, open source chat UIs. During the discussion we hear about the motivations behind LibreChat,…
Mamba & Jamba [not-audio_url] [/not-audio_url]

Duration: 41:13
First there was Mamba… now there is Jamba from AI21. This is a model that combines the best non-transformer goodness of Mamba with good ‘ol attention layers. This results in a highly performant and efficient model that A…
Udio & the age of multi-modal AI [not-audio_url] [/not-audio_url]

Duration: 38:52
2024 promises to be the year of multi-modal AI, and we are already seeing some amazing things. In this “fully connected” episode, Chris and Daniel explore the new Udio product/service for generating music. Then they dig…