#199 - OpenAI's 03-mini, Gemini Thinking, Deep Research, s1

#199 - OpenAI's 03-mini, Gemini Thinking, Deep Research, s1

Author: Skynet Today February 12, 2025 Duration: 1:37:46
Our 199th episode with a summary and discussion of last week's big AI news! Recorded on 02/09/2025 Join our brand new Discord here! https://discord.gg/nTyezGSKwP Hosted by Andrey Kurenkov and Jeremie Harris. Feel free to email us your questions and feedback at contact@lastweekinai.com and/or hello@gladstone.ai Read out our text newsletter and comment on the podcast at https://lastweekin.ai/. In this episode: - OpenAI's deep research feature capability launched, allowing models to generate detailed reports after prolonged inference periods, competing directly with Google's Gemini 2.0 reasoning models.  - France and UAE jointly announce plans to build a massive AI data center in France, aiming to become a competitive player within the AI infrastructure landscape.  - Mistral introduces a mobile app, broadening its consumer AI lineup amidst market skepticism about its ability to compete against larger firms like OpenAI and Google.  - Anthropic unveils 'Constitutional Classifiers,' a method showing strong defenses against universal jailbreaks; they also launched a $20K challenge to find weaknesses. Timestamps + Links: (00:00:00) Intro / Banter (00:02:27) News Preview (00:03:28) Response to listener comments Tools & Apps (00:08:01) OpenAI now reveals more of its o3-mini model’s thought process (00:16:03) Google’s Gemini app adds access to ‘thinking’ AI models (00:21:04) OpenAI Unveils A.I. Tool That Can Do Research Online (00:31:09) Mistral releases its AI assistant on iOS and Android (00:36:17) AI music startup Riffusion launches its service in public beta (00:39:11) Pikadditions by Pika Labs lets users seamlessly insert objects into videos Applications & Business (00:41:19) Softbank set to invest $40 billion in OpenAI at $260 billion valuation, sources say (00:47:36) UAE to invest billions in France AI data centre (00:50:34) Report: Ilya Sutskever’s startup in talks to fundraise at roughly $20B valuation (00:52:03) ASML to Ship First Second-Gen High-NA EUV Machine in the Coming Months, Aiming for 2026 Production (00:54:38) NVIDIA’s GB200 NVL 72 Shipments Not Under Threat From DeepSeek As Hyperscalers Maintain CapEx; Meanwhile, Trump Tariffs Play Havoc With TSMC’s Pricing Strategy Projects & Open Source (00:56:49) The Allen Institute for AI (AI2) Releases Tülu 3 405B: Scaling Open-Weight... (01:00:06) SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model (01:03:56) PhD Knowledge Not Required: A Reasoning Challenge for Large Language Models (01:08:26) OpenEuroLLM: Europe’s New Initiative for Open-Source AI Development Research & Advancements (01:10:34) LIMO: Less is More for Reasoning (01:16:39) s1: Simple test-time scaling (01:19:17) ZebraLogic: On the Scaling Limits of LLMs for Logical Reasoning (01:23:55) Streaming DiLoCo with overlapping communication: Towards a Distributed Free Lunch Policy & Safety (01:26:50) US sets AI safety aside in favor of 'AI dominance' (01:29:39) Almost Surely Safe Alignment of Large Language Models at Inference-Time (01:32:02) Constitutional Classifiers: Defending against Universal Jailbreaks across Thousands of Hours of Red Teaming (01:33:16) Anthropic offers $20,000 to whoever can jailbreak its new AI safety system See Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.

Keeping up with artificial intelligence can feel like drinking from a firehose. Every week brings a new breakthrough, a surprising application, or an urgent ethical debate. Last Week in AI, from the team at Skynet Today, is here to turn that torrent into a clear, digestible stream. Instead of getting lost in the noise, you'll get a thoughtful rundown of the developments that actually have impact, explained without unnecessary jargon. Each episode feels like a conversation with well-informed friends who have done the homework for you, sifting through research papers, product launches, and industry announcements to highlight what's substantive. You'll hear nuanced discussions that go beyond the headlines, considering the real-world implications of new models, policy shifts, and corporate moves in the tech landscape. This podcast doesn't just tell you what happened; it provides context on why it matters for developers, businesses, and society at large. It’s an efficient way to stay informed and critically engaged with a field that is reshaping our world at a breathtaking pace. Tune in for a consistently insightful analysis that makes the complex world of AI feel accessible and relevant, week after week.
Author: Language: English Episodes: 100

Last Week in AI
Podcast Episodes
#191 - Sora leak, Pixtral Large, OpenAI email archives [not-audio_url] [/not-audio_url]

Duration: 1:42:11
Our 191st episode with a summary and discussion of last week's big AI news! Hosted by Andrey Kurenkov and Jeremie Harris. Feel free to email us your questions and feedback at contact@lastweekinai.com and/or hello@gladsto…
#190 - AI scaling struggles, OpenAI Agents, Super Weights [not-audio_url] [/not-audio_url]

Duration: 1:37:21
Our 190th episode with a summary and discussion of last week's* big AI news! *and sometimes last last week's Hosted by Andrey Kurenkov and Jeremie Harris. Note from Andrey: this one is coming out a bit later than planned…
#189 - Chat.com, FrontierMath, Relaxed Transformers, Trump & AI [not-audio_url] [/not-audio_url]

Duration: 1:42:46
Our 189th episode with a summary and discussion of last week's big AI news! Hosted by Andrey Kurenkov and Jeremie Harris. Feel free to email us your questions and feedback at contact@lastweekinai.com and/or hello@gladsto…
#188 - ChatGPT+Search, OpenAI+AMD, SimpleQA, π0 [not-audio_url] [/not-audio_url]

Duration: 1:51:50
Our 188th episode with a summary and discussion of last week's big AI news! Hosted by Andrey Kurenkov and Jeremie Harris. Feel free to email us your questions and feedback at contact@lastweekinai.com and/or hello@gladsto…
#186 - Adobe AI Tools, Tesla's Cybercab, Nobel Prizes [not-audio_url] [/not-audio_url]

Duration: 1:33:54
Our 186th episode with a summary and discussion of last week's big AI news! With hosts Andrey Kurenkov and guest host Jon Krohn from the SuperDataScience Podcast. Check out Jon’s upcoming agent-focused event here - AI Ca…
#185 - Movie Gen, ChatGPT Canvas, OpenAI's VC Round, SB 1047 Vetoed [not-audio_url] [/not-audio_url]

Duration: 1:29:37
Our 185th episode with a summary and discussion of last week's big AI news! With hosts Andrey Kurenkov and guest host Gavin Purcell from the AI for Humans podcast. Read out our text newsletter and comment on the podcast…
#183 - OpenAI o1, Adobe vid gen, Reflection 70B, DeepMind AlphaProteo [not-audio_url] [/not-audio_url]

Duration: 1:48:16
Our 183rd episode with a summary and discussion of last week's big AI news! With hosts Andrey Kurenkov and Jeremie Harris. Note: once again, apologies from Andrey on this one coming out late. Starting with the next one w…
# 182 - Alexa 2.0, MiniMax, Surskever raises $1B, SB 1047 approved [not-audio_url] [/not-audio_url]

Duration: 1:38:47
Our 182nd episode with a summary and discussion of last week's big AI news! With hosts Andrey Kurenkov and Jeremie Harris. Read out our text newsletter and comment on the podcast at https://lastweekin.ai/. If you would l…