From the archive: Aligning AI with our values

Author: Kensy Cooperrider – Diverse Intelligences Summer Institute October 18, 2023 Duration: 1:23:12

Many Minds

Hi friends, we're on hiatus for the fall. To tide you over, we're putting up some favorite episodes from our archives. Enjoy!

----

[originally aired February 17, 2021]

Guess what folks: we are celebrating a birthday this week. That's right, Many Minds has reached the ripe age of one year old. Not sure how old that is in podcast years, exactly, but it's definitely a landmark that we're proud of. Please no gifts, but, as always, you're encouraged to share the show with a friend, write a review, or give us a shout out on social.

To help mark this milestone we've got a great episode for you. My guest is the writer, Brian Christian. Brian is a visiting scholar at the University of California Berkeley and the author of three widely acclaimed books: The Most Human Human, published in 2011; Algorithms To Live By, co-authored with Tom Griffiths and published in 2016; and most recently, The Alignment Problem. It was published this past fall and it's the focus of our conversation in this episode.

The alignment problem, put simply, is the problem of building artificial intelligences—machine learning systems, for instance—that do what we want them to do, that both reflect and further our values. This is harder to do than you might think, and it's more important than ever.

As Brian and I discuss, machine learning is becoming increasingly pervasive in everyday life—though it's sometimes invisible. It's working in the background every time we snap a photo or hop on Facebook. Companies are using it to sift resumes; courts are using it to make parole decisions. We are already trusting these systems with a bunch of important tasks, in other words. And as we rely on them in more and more domains, the alignment problem will only become that much more pressing.

In the course of laying out this problem, Brian's book also offers a captivating history of machine learning and AI. Since their very beginnings, these fields have been formed through interaction with philosophy, psychology, mathematics, and neuroscience. Brian traces these interactions in fascinating detail—and brings them right up to the present moment. As he describes, machine learning today is not only informed by the latest advances in the cognitive sciences, it's also propelling those advances.

This is a wide-ranging and illuminating conversation folks. And, if I may say so, it's also an important one. Brian makes a compelling case, I think, that the alignment problem is one of the defining issues of our age. And he writes about it—and talks about it here—with such clarity and insight. I hope you enjoy this one. And, if you do, be sure to check out Brian's book.

Happy birthday to us—and on to my conversation with Brian Christian. Enjoy!

A transcript of this show is available here.

Notes and links

7:26 - Norbert Wiener's article from 1960, 'Some moral and technical consequences of automation'.

8:35 - 'The Sorcerer's Apprentice' is an episode from the animated film, Fantasia (1940). Before that, it was a poem by Goethe.

13:00 - A well-known incident in which Google's nascent auto-tagging function went terribly awry.

13:30 - The 'Labeled Faces in the Wild' database can be viewed here.

18:35 - A groundbreaking article in ProPublica on the biases inherent in the Correctional Offender Management Profiling for Alternative Sanctions (COMPAS) tool.

25:00 – The website of the Future of Humanity Institute, mentioned in several places, is here.

25:55 - For an account of the collaboration between Walter Pitts and Warren McCulloch, see here.

29:35- An article about the racial biases built into photographic film technology in the 20th century.

31:45 - The much-investigated Tempe crash involving a driverless car and a pedestrian:

37:17 - The psychologist Edward Thorndike developed the "law of effect." Here is one of his papers on the law.

44:40 - A highly influential 2015 paper in Nature in which a deep-Q network was able to surpass human performance on a number of classic Atari games, and yet not score a single point on 'Montezuma's Revenge.'

47:38 - A chapter on the classic "preferential looking" paradigm in developmental psychology:

53:40 - A blog post discussing the relationship between dopamine in the brain and temporal difference learning. Here is the paper in Science in which this relationship was first articulated.

1:00:00 - A paper on the concept of "coherent extrapolated volition."

1:01:40 - An article on the notion of "iterated distillation and amplification."

1:10:15 - The fourth edition of a seminal textbook by Stuart Russell and Peter Norvig, AI a Modern approach, is available here: http://aima.cs.berkeley.edu/

1:13:00 - An article on Warren McCulloch's poetry.

1:17:45 - The concept of "reductions" is central in computer science and mathematics.

Brian Christian's end-of-show reading recommendations:

The Alignment Newsletter, written by Rohin Shah

Invisible Women, by Caroline Criado Perez:

The Gardener and the Carpenter, Alison Gopnik:

You can keep up with Brian at his personal website or on Twitter.

Many Minds is a project of the Diverse Intelligences Summer Institute, which is made possible by a generous grant from the Templeton World Charity Foundation to UCLA. It is hosted and produced by Kensy Cooperrider, with help from Assistant Producer Urte Laukaityte and with creative support from DISI Directors Erica Cartmill and Jacob Foster. Our artwork is by Ben Oldroyd. Our transcripts are created by Sarah Dopierala.

Subscribe to Many Minds on Apple, Stitcher, Spotify, Pocket Casts, Google Play, or wherever you listen to podcasts. You can also now subscribe to the Many Minds newsletter here!

We welcome your comments, questions, and suggestions. Feel free to email us at: manymindspodcast@gmail.com.

For updates about the show, visit our website or follow us on Twitter: @ManyMindsPod.

Many Minds

There's a quiet revolution happening in how we understand intelligence, and it's not just about humans. Many Minds, hosted by Kensy Cooperrider of the Diverse Intelligences Summer Institute, digs into this expansive idea. Each episode is a journey into the inner worlds of creatures and creations we share the planet with. You'll hear from researchers who decode the complex social minds of crows, who map the sensory universe of an octopus, or who grapple with the emerging cognition of artificial systems. This isn't a dry lecture series; it's a collection of thoughtful conversations that feel like pulling up a chair with experts who are genuinely redefining what it means to think, feel, and learn. The Many Minds podcast operates from a simple but profound premise: to grasp our own human experience, we need to listen to the many other kinds of minds around us. Tune in every other week for explorations that are as much about philosophy and wonder as they are about science and education, all grounded in rigorous research and a deep curiosity about the beings-animal, human, and artificial-that fill our world.

Author: Kensy Cooperrider – Diverse Intelligences Summer Institute Language: English Episodes: 100

Official website RSS

Podcast Episodes

[not-audio_url]

[/not-audio_url]

Howl, grunt, sing

06.03.2025

Duration: 1:13:38

The tree of life is a noisy place. From one branch come hoots and howls, from another come clicks and buzzes and whines. And coming from all over you hear the swell of song. But what is all this ruckus about? Why do so m…

[not-audio_url]

[/not-audio_url]

The development of evolution

20.02.2025

Duration: 1:36:43

Evolution is not what it used to be. A lot has changed since Darwin's day. In the first half of the 20th century, evolutionary theory was integrated with an emerging understanding of genetics. Late in the 20th century, b…

[not-audio_url]

[/not-audio_url]

String theories

06.02.2025

Duration: 1:21:28

Where would our species be without string? It's one of our most basic technologies—so basic that it's easy to overlook. But humans have used string—and its cousins rope, yarn, cordage, thread, etc.—for all kinds of purpo…

[not-audio_url]

[/not-audio_url]

The other half of the brain

23.01.2025

Duration: 59:39

Neurons have long enjoyed a kind of rock star status. We think of them as the most fundamental units of the brain—the active cells at the heart of brain function and, ultimately, at the heart of behavior, learning, and m…

[not-audio_url]

[/not-audio_url]

A paradox of learning

09.01.2025

Duration: 1:06:42

How do we learn? Usually from experience, of course. Maybe we visit some new place, or encounter a new tool or trick. Or perhaps we learn from someone else—from a teacher or friend or YouTube star who relays some shiny n…

[not-audio_url]

[/not-audio_url]

From the archive: The octopus and the android

25.12.2024

Duration: 1:25:39

Happy holidays, friends! We will be back with a new episode in January 2025. In the meantime, enjoy this favorite from our archives! ----- [originally aired Jun 14, 2023] Have you heard of Octopolis? It's a site off the…

[not-audio_url]

[/not-audio_url]

Your brain on language

12.12.2024

Duration: 1:32:56

Using language is a complex business. Let's say you want to understand a sentence. You first need to parse a sequence of sounds—if the sentence is spoken—or images—if it's signed or written. You need to figure out the me…

[not-audio_url]

[/not-audio_url]

Nestcraft

28.11.2024

Duration: 1:20:01

How do birds build their nests? By instinct, of course—at least that's what the conventional wisdom tells us. A swallow builds a swallow's nest; a robin builds a robin's nest. Every bird just follows the rigid template s…

[not-audio_url]

[/not-audio_url]

Animal, heal thyself

14.11.2024

Duration: 1:07:32

What happens to animals when they get sick? If they're pets or livestock, we probably call the vet. And the vet may give them drugs or perform a procedure. But what about wild animals? Do they just languish in misery? We…

[not-audio_url]

[/not-audio_url]

The rise of machine culture

31.10.2024

Duration: 1:20:17

The machines are coming. Scratch that—they're already here: AIs that propose new combinations of ideas; chatbots that help us summarize texts or write code; algorithms that tell us who to friend or follow, what to watch…