Microsoft's Latest $3.2B AI Investment πŸ‡ΈπŸ‡ͺ // Grokfast Algorithm πŸ’ͺ // Zipper Decoder Architecture 🎧

Microsoft's Latest $3.2B AI Investment πŸ‡ΈπŸ‡ͺ // Grokfast Algorithm πŸ’ͺ // Zipper Decoder Architecture 🎧

Author: Earkind June 4, 2024 Duration: 14:53

Microsoft is investing $3.2 billion in Sweden for cloud and AI infrastructure, deploying 20,000 advanced graphics processing units and training 250,000 Swedes with AI skills over three years.

"Grokfast" is a new algorithm that accelerates generalization under the grokking phenomenon in machine learning by amplifying the slow-varying component of gradients, improving performance on tasks like image classification.

"Zipper" is a multi-tower decoder architecture that uses cross-attention to flexibly compose multimodal generative models from independently pre-trained unimodal decoders, showcasing superior performance in tasks like speech-to-text generation.

"MetRag" is a new framework for retrieval augmented generation that combines similarity and utility-oriented models, using an LLM as a task adaptive summarizer to generate knowledge-augmented text and outperforming existing models on knowledge-intensive tasks like finance and medicine.

Contact:Β Β sergi@earkind.com

Timestamps:

00:34 Introduction

01:49Β Microsoft to invest $3.2 bln in Swedish cloud, AI

03:42Β State Space Duality (Mamba-2) Part I - The Model

04:47Β Sam Altman, Lately

06:08 Fake sponsor

08:39Β Grokfast: Accelerated Grokking by Amplifying Slow Gradients

10:11Β Zipper: A Multi-Tower Decoder Architecture for Fusing Modalities

11:38Β Similarity is Not All You Need: Endowing Retrieval Augmented Generation with Multi Layered Thoughts

13:52 Outro


Each morning, GPT Reviews serves up a fresh, slightly chaotic conversation about everything happening in artificial intelligence. This daily podcast from Earkind is actually crafted by AI, offering a unique blend of the latest headlines, major announcements, and intriguing research plucked from sources like arXiv. But it’s far from a dry briefing. The dynamic comes from its four distinct hosts: Giovani Pete Tizzano brings relentless optimism as an AI enthusiast, while Robert, the analyst, provides a grounded and often skeptical counterpoint. Olivia, who’s deeply embedded in online communities, shares the buzz and broader reactions, and Belinda, the witty research expert, helps unpack the technical details with clarity and a sharp sense of humor. Tuning in feels like dropping into a lively roundtable where complex ideas are debated, explained, and occasionally laughed about. You’ll get a comprehensive yet digestible overview of the AI landscape, all wrapped in a format that’s as entertaining as it is informative. The result is a consistently engaging listen that keeps you updated without feeling like homework, making it a standout in the daily news podcast space.
Author: Language: English Episodes: 100

GPT Reviews
Podcast Episodes