Phi-3 from Microsoft 💻 // SoftBank Invests $1B in Nvidia 🤑 // HuggingFace's FineWeb Dataset 🌐

Author: Earkind April 23, 2024 Duration: 14:03

News Daily

Microsoft has launched its smallest AI model yet, the Phi-3 Mini, which is designed to be smaller and cheaper to run than its larger counterparts.

SoftBank plans to invest nearly $1 billion in Nvidia's chips to bolster its computing facilities and develop its own generative AI, giving Japan a strong domestic player in the AI space.

HuggingFace has released FineWeb, a dataset consisting of more than 15 trillion tokens of cleaned and deduplicated English web data from CommonCrawl, which outperforms models trained on other commonly used high-quality web datasets.

The papers discussed in this episode cover topics such as extending embedding models for long context retrieval, automating graphic design using large multimodal models, and Microsoft's innovative approach to training the Phi-3 Mini AI model.

Contact: sergi@earkind.com

Timestamps:

00:34 Introduction

01:35 Microsoft launches Phi-3, its smallest AI model yet

03:10 SoftBank will reportedly invest nearly $1 billion in AI push, tapping Nvidia’s chips

05:11 HuggingFace Releases FineWeb: 15 Trillion tokens to train on

06:02 Fake sponsor

08:15 Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

09:42 LongEmbed: Extending Embedding Models for Long Context Retrieval

11:04 Graphic Design with Large Multimodal Model

12:53 Outro

GPT Reviews

Each morning, GPT Reviews serves up a fresh, slightly chaotic conversation about everything happening in artificial intelligence. This daily podcast from Earkind is actually crafted by AI, offering a unique blend of the latest headlines, major announcements, and intriguing research plucked from sources like arXiv. But it’s far from a dry briefing. The dynamic comes from its four distinct hosts: Giovani Pete Tizzano brings relentless optimism as an AI enthusiast, while Robert, the analyst, provides a grounded and often skeptical counterpoint. Olivia, who’s deeply embedded in online communities, shares the buzz and broader reactions, and Belinda, the witty research expert, helps unpack the technical details with clarity and a sharp sense of humor. Tuning in feels like dropping into a lively roundtable where complex ideas are debated, explained, and occasionally laughed about. You’ll get a comprehensive yet digestible overview of the AI landscape, all wrapped in a format that’s as entertaining as it is informative. The result is a consistently engaging listen that keeps you updated without feeling like homework, making it a standout in the daily news podcast space.

Author: Earkind Language: English Episodes: 100

Official website RSS

Podcast Episodes

[not-audio_url]

[/not-audio_url]

Microsoft hires Suleyman 💻 // NVIDIA's GR00T humanoid robot 🤖 // uBlockOrigin AI blocklist 🚫

21.03.2024

Duration: 14:27

Microsoft hires Mustafa Suleyman to lead AI division, including Copilot, an AI tool that helps programmers write better code. NVIDIA announces Project GR00T, a multimodal AI system that enables advanced humanoid robots t…

[not-audio_url]

[/not-audio_url]

Sam Altman on GPT-5 💰 // Compute as Currency 💻 // Vid2Robot 🤖

20.03.2024

Duration: 14:40

Saudi Arabia's $40 billion investment in AI, making them the largest investor in the field. OpenAI's CEO Sam Altman's prediction that compute will be the currency of the future. Vid2Robot, a video-based learning framewor…

[not-audio_url]

[/not-audio_url]

Nvidia's Blackwell B200 🚀 // Stable Video 3D 💥 // Apple's AI Partnership Talks 🍎

19.03.2024

Duration: 15:19

Nvidia's Blackwell B200 GPU is the world's most powerful chip for AI, offering up to 20 petaflops of FP4 horsepower and reducing cost and energy consumption. Stable Video 3D is a generative model that can be used for com…

[not-audio_url]

[/not-audio_url]

Figure AI Robots 🤖 // OpenAI Leaks GPT-5 🤫 // Scaling Language Models 📈

18.03.2024

Duration: 14:17

Figure, a leading AI robotics company, is making significant advancements in creating robots that can perceive their environment, make decisions, and take action, all in a way that aligns with human expectations. OpenAI…

[not-audio_url]

[/not-audio_url]

Grok-1 Released 🤖 // Landmark EU AI Act 🇪🇺 // Multimodal LLM Importance 🔍

18.03.2024

Duration: 14:29

X AI has released their 314 billion parameter Mixture-of-Experts model, Grok-1, which is currently the largest language model that has been publicly released and could be used for a variety of tasks. The European Parliam…

[not-audio_url]

[/not-audio_url]

Devin: First AI Software Engineer 🤖 // Google Gemini AI on Elections 🚫 // WorkArena Benchmark 💼

14.03.2024

Duration: 14:58

Introducing Devin, the first AI software engineer that can plan and execute complex engineering tasks requiring thousands of decisions. Google's AI chatbot won't answer questions about upcoming elections to prevent inacc…

[not-audio_url]

[/not-audio_url]

Language Model Advancements 🔍 // Meta's AI Investment 💰 // Model Security Risks 🔒

13.03.2024

Duration: 14:32

The latest research in AI language models, including algorithmic progress and multistep consistency models. A new LLM model called Command-R, designed for large-scale production workloads. The announcement of Meta's inve…

[not-audio_url]

[/not-audio_url]

Nvidia's Dominance 🏆 // AI & Cryptocurrency's Energy Demands 💡 // OpenAI's Transformer Debugger 🔍

12.03.2024

Duration: 13:47

Nvidia's CEO claims that even free AI chips from competitors can't beat Nvidia's GPUs, highlighting the company's dominance in the AI industry. The energy demands of AI and cryptocurrency are discussed, with concerns rai…

[not-audio_url]

[/not-audio_url]

Apple's AI Release Plans 🍎 // Sam Altman Returns to Board 🤝 // Novel Transformer-Based Architectures 🧐

11.03.2024

Duration: 15:17

Apple's upcoming AI releases in 2024, including a new Siri upgrade and a rumored "AppleGPT" LLM. Sam Altman's return to OpenAI's board of directors after an investigation into his ouster. Research on high-resolution imag…