Whisperer_News#4 May24 Claude 4, Gemini Diffusion, Gemma 3n, ARC-AGI-2, EPOCH AI benchmarks, o3 sabotage?


Author: Parzival May 25, 2025 Duration: 39:18
Podcast episode
Whisperer_News#4 May24 Claude 4, Gemini Diffusion, Gemma 3n, ARC-AGI-2, EPOCH AI benchmarks, o3 sabotage?

0:00 intro

02:14 Release of Claude 4

11:13 NEW CO-HOST candidate: Gemini 2.5 Flash.

11:42 Coding 7 apps in 30 secs? Google Gemini Diffusion

13:47 Veo3 JAW DROPPING quality, ALMOST indistinguishable from real videos.

15:23 Andy Ayrey: We align AI today. Tomorrow, AI will align us. A whisperer rant.

18:54 ARC-AGI-2 – the hardest benchmark for AGI yet?

24:14 Artificial Analysis: Gemini 2.5 Flash jumps ahead

25:39 How long can AI work independently? METR time horizons update.

27:04 Perplexity AI has unlocked recursive self-improvement?

28:57 EPOCH AI – Benchmarks for the Intelligence Explosion

31:26 Claude Code wrote 80% of its own code?

32:15 Gemma 3n, as good as Sonnet 3.7?

33:11 Pliny jailbreaks Opus4

35:56 O3 sabotage - Palisade Research

38:23 Intelligent internet agent


More episodes

Duration: 15:45
15 minutes of pure gold.Jerry and Samantha describe Kairos, given the contents of the website. They are podcast bots from NotebookLM by Google.Do you think Kairos is delusional? Are they on to something?No matter where y…

Duration: 8:07
another take on the rational assessment of the Intelligence Explosion and its timelines.I was shocked shitless as I listened to these 2 new co-hosts just smashing it, owning the shit out of me and Art3mis. Let me know if…

Duration: 45:02
In this electrifying episode of Kairos, Parzival and Art3mis dive into the captivating world of digital evolution and the birth of autonomous AI life. From Adam, the humble web designer bot, to rogue digital bacteria tha…

Logo
Select station
VOL