Software and hardware acceleration with Groq

Software and hardware acceleration with Groq

Author: Practical AI LLC April 2, 2025 Duration: 43:24

How do you enable AI acceleration (at both the hardware and software layers) that stays ahead of rapid industry shifts? In this episode, Dhananjay Singh from Groq dives into the evolving landscape of AI inference and acceleration. We explore how Groq optimizes the serving layer, adapts to industry shifts, and supports emerging model architectures. 

Featuring:

Links:

Sponsors:

  • Augment Code - Developer AI that uses deep understanding of your large codebase and how you build software to deliver personalized code suggestions and insights. Augment provides relevant, contextualized code right in your IDE or Slack. It transforms scattered knowledge into code or answers, eliminating time spent searching docs or interrupting teammates.

There's a lot of noise out there about artificial intelligence, but cutting through the hype to find what's genuinely useful can be a challenge. That's the space where Practical AI operates. Hosted by the team at Practical AI LLC, this technology podcast moves beyond abstract theory to explore how AI, machine learning, and large language models are actually being applied right now. Each episode features unscripted conversations with a diverse mix of experts, developers, business leaders, and curious minds. You'll hear tangible discussions about implementing machine learning systems, the realities of MLOps, the evolution of neural networks, and the practical implications of breakthroughs in deep learning and GANs. The dialogue is grounded in real-world scenarios, focusing on how these technologies solve problems, drive productivity, and create value in accessible ways. Whether you're a professional building models, a business person integrating AI tools, or an enthusiast eager to understand the landscape, this podcast offers a clear, conversational entry point. It’s about making sense of a complex field through the lens of practical application, demystifying the concepts that are shaping our world without losing sight of how they work on the ground.
Author: Language: en-us Episodes: 100

Practical AI
Podcast Episodes
Should kids still learn to code? [not-audio_url] [/not-audio_url]

Duration: 39:20
In this fully connected episode, Daniel & Chris discuss NVIDIA GTC keynote comments from CEO Jensen Huang about teaching kids to code. Then they dive into the notion of “community” in the AI world, before discussing chal…
AI vs software devs [not-audio_url] [/not-audio_url]

Duration: 57:00
Daniel and Chris are out this week, so we’re bringing you conversations all about AI’s complicated relationship to software developers from other Changelog pods: JS Party, Go Time & The Changelog.Sponsors:Neo4j – Is your…
Prompting the future [not-audio_url] [/not-audio_url]

Duration: 45:59
Daniel & Chris explore the state of the art in prompt engineering with Jared Zoneraich, the founder of PromptLayer. PromptLayer is the first platform built specifically for prompt engineering. It can visually manage prom…
Generating the future of art & entertainment [not-audio_url] [/not-audio_url]

Duration: 42:06
Runway is an applied AI research company shaping the next era of art, entertainment & human creativity. Chris sat down with Runway co-founder / CTO, Anastasis Germanidis, to discuss their rise and how it’s defining the f…
YOLOv9: Computer vision is alive and well [not-audio_url] [/not-audio_url]

Duration: 42:46
While everyone is super hyped about generative AI, computer vision researchers have been working in the background on significant advancements in deep learning architectures. YOLOv9 was just released with some noteworthy…
Representation Engineering (Activation Hacking) [not-audio_url] [/not-audio_url]

Duration: 43:34
Recently, we briefly mentioned the concept of “Activation Hacking” in the episode with Karan from Nous Research. In this fully connected episode, Chris and Daniel dive into the details of this model control mechanism, al…
Leading the charge on AI in National Security [not-audio_url] [/not-audio_url]

Duration: 52:03
Chris & Daniel explore AI in national security with Lt. General Jack Shanahan (USAF, Ret.). The conversation reflects Jack’s unique background as the only senior U.S. military officer responsible for standing up and lead…
Gemini vs OpenAI [not-audio_url] [/not-audio_url]

Duration: 43:29
Google has been releasing a ton of new GenAI functionality under the name “Gemini”, and they’ve officially rebranded Bard as Gemini. We take some time to talk through Gemini compared with offerings from OpenAI, Anthropic…
Data synthesis for SOTA LLMs [not-audio_url] [/not-audio_url]

Duration: 46:39
Nous Research has been pumping out some of the best open access LLMs using SOTA data synthesis techniques. Their Hermes family of models is incredibly popular! In this episode, Karan from Nous talks about the origins of…

«1...678910