#196 - Nvidia Digits, Cosmos, PRIME, ICLR, InfAlign

#196 - Nvidia Digits, Cosmos, PRIME, ICLR, InfAlign

Author: Skynet Today January 13, 2025 Duration: 1:46:34
Our 196th episode with a summary and discussion of last week's* big AI news! *and sometimes last last week's  Recorded on 01/10/2024 Join our brand new Discord here! https://discord.gg/nTyezGSKwP Hosted by Andrey Kurenkov and Jeremie Harris. Feel free to email us your questions and feedback at contact@lastweekinai.com and/or hello@gladstone.ai Read out our text newsletter and comment on the podcast at https://lastweekin.ai/. Sponsors: The Generator - An interdisciplinary AI lab empowering innovators from all fields to bring visionary ideas to life by harnessing the capabilities of artificial intelligence. In this episode: - Nvidia announced a $3,000 personal AI supercomputer called Digits, featuring the GB10 Grace Blackwell Superchip, aiming to lower the barrier for developers working on large models.  - The U.S. Department of Justice finalizes a rule restricting the transmission of specific data types to countries of concern, including China and Russia, under executive order 14117. - Meta allegedly trained Llama on pirated content from LibGen, with internal concerns about the legality confirmed through court filings.  - Microsoft paused construction on a section of a large data center project in Wisconsin to reassess based on new technological changes. If you would like to become a sponsor for the newsletter, podcast, or both, please fill out this form. Timestamps + Links: (00:00:00) Intro / Banter (00:04:52) Sponsor Break Tools & Apps (00:05:55) Nvidia announces $3,000 personal AI supercomputer called Digits (00:10:23) Meta removes AI character accounts after users criticize them as ‘creepy and unnecessary’ Applications & Business (00:16:16) NVIDIA Is Reportedly Focused Towards “Custom Chip” Manufacturing, Recruiting Top Taiwanese Talent (00:21:54) AI start-up Anthropic closes in on $60bn valuation (00:25:38) Why OpenAI is Taking So Long to Launch Agents (00:30:08) TSMC Set to Expand CoWoS Capacity to Record 75,000 Wafers in 2025, Doubling 2024 Output (00:33:10) Microsoft 'pauses construction' on part of data center site in Mount Pleasant, Wisconsin (00:37:23) Google folds more AI teams into DeepMind to ‘accelerate the research to developer pipeline’ Projects & Open Source (00:41:59) Cosmos World Foundation Model Platform for Physical AI (00:48:21) Microsoft releases Phi-4 language model on Hugging Face Research & Advancements (00:50:16) PRIME: Online Reinforcement Learning with Process Rewards (00:58:29) ICLR: In-Context Learning of Representations (01:07:38) Do NOT Think That Much for 2+3=? On the Overthinking of o1-Like LLMs (01:11:44) METAGENE-1: Metagenomic Foundation Model for Pandemic Monitoring (01:15:45) TransPixar: Advancing Text-to-Video Generation with Transparency (01:18:03) The amount of compute used to train frontier models has been growing at a breakneck pace of over 4x per year since 2018, resulting in an overall scale-up of more than 10,000x! But what factors are enabling this rapid growth? Policy & Safety (01:23:45) InfAlign: Inference-aware language model alignment (01:28:44) Mark Zuckerberg gave Meta’s Llama team the OK to train on copyrighted works, filing claims (01:33:19) Anthropic gives court authority to intervene if chatbot spits out song lyrics (01:35:57) US government says companies are no longer allowed to send bulk data to these nations (01:39:10) Trump announces $20B plan to build new data centers in the US   See Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.

Keeping up with artificial intelligence can feel like drinking from a firehose. Every week brings a new breakthrough, a surprising application, or an urgent ethical debate. Last Week in AI, from the team at Skynet Today, is here to turn that torrent into a clear, digestible stream. Instead of getting lost in the noise, you'll get a thoughtful rundown of the developments that actually have impact, explained without unnecessary jargon. Each episode feels like a conversation with well-informed friends who have done the homework for you, sifting through research papers, product launches, and industry announcements to highlight what's substantive. You'll hear nuanced discussions that go beyond the headlines, considering the real-world implications of new models, policy shifts, and corporate moves in the tech landscape. This podcast doesn't just tell you what happened; it provides context on why it matters for developers, businesses, and society at large. It’s an efficient way to stay informed and critically engaged with a field that is reshaping our world at a breathtaking pace. Tune in for a consistently insightful analysis that makes the complex world of AI feel accessible and relevant, week after week.
Author: Language: English Episodes: 100

Last Week in AI
Podcast Episodes
#180 - Ideogram v2, Imagen 3, AI in 2030, Agent Q, SB 1047 [not-audio_url] [/not-audio_url]

Duration: 2:05:23
Our 180th episode with a summary and discussion of last week's big AI news! With hosts Andrey Kurenkov (https://twitter.com/andrey_kurenkov) and Jeremie Harris (https://twitter.com/jeremiecharris) If you would like to ge…
#179 - Grok 2, Gemini Live, Flux, FalconMamba, AI Scientist [not-audio_url] [/not-audio_url]

Duration: 1:58:26
Our 179th episode with a summary and discussion of last week's big AI news! With hosts Andrey Kurenkov (https://twitter.com/andrey_kurenkov) and Jeremie Harris (https://twitter.com/jeremiecharris) If you would like to ge…
#178 - More Not-Acquihires, More OpenAI drama, More LLM Scaling Talk [not-audio_url] [/not-audio_url]

Duration: 2:05:37
Our 178th episode with a summary and discussion of last week's big AI news! NOTE: this is a re-upload with fixed audio, my bad on the last one! - Andrey With hosts Andrey Kurenkov (https://twitter.com/andrey_kurenkov) an…
#177 - Instagram AI Bots, Noam Shazeer -> Google, FLUX.1, SAM2 [not-audio_url] [/not-audio_url]

Duration: 1:52:58
Our 177th episode with a summary and discussion of last week's big AI news! With guest co-host Jon Krohn from the super data science podcast (https://www.superdatascience.com/podcast)! If you'd like to listen to the inte…
#176 - BIG WEEK for OSS! SearchGPT, Lamma 3.1 405B, Mistral Large 2 [not-audio_url] [/not-audio_url]

Duration: 1:25:45
Our 176th episode with a summary and discussion of last week's big AI news! NOTE: apologies for this episode coming out about a week late, things got in the way of editing it... With hosts Andrey Kurenkov (https://twitte…
#175 - GPT-4o Mini, OpenAI's Strawberry, Mixture of A Million Experts [not-audio_url] [/not-audio_url]

Duration: 1:47:29
Our 175th episode with a summary and discussion of last week's big AI news! With hosts Andrey Kurenkov (https://twitter.com/andrey_kurenkov) and Jeremie Harris (https://twitter.com/jeremiecharris) In this episode of Last…
#174 - Odyssey Text-to-Video, Groq LLM Engine, OpenAI Security Issues [not-audio_url] [/not-audio_url]

Duration: 2:04:25
Our 174rd episode with a summary and discussion of last week's big AI news! With hosts Andrey Kurenkov (https://twitter.com/andrey_kurenkov) and Jeremie Harris (https://twitter.com/jeremiecharris) In this episode of Last…
#173 - Gemini Pro, Llama 400B, Gen-3 Alpha, Moshi, Supreme Court [not-audio_url] [/not-audio_url]

Duration: 1:49:49
Our 173rd episode with a summary and discussion of last week's big AI news! With hosts Andrey Kurenkov (https://twitter.com/andrey_kurenkov) and Jeremie Harris (https://twitter.com/jeremiecharris) See full episode notes…
#172 - Claude and Gemini updates, Gemma 2, GPT-4 Critic [not-audio_url] [/not-audio_url]

Duration: 1:51:04
Our 172nd episode with a summary and discussion of last week's big AI news! With hosts Andrey Kurenkov (https://twitter.com/andrey_kurenkov) and Jeremie Harris (https://twitter.com/jeremiecharris) Read out our text newsl…