Advanced Topics in Computer Science: AI Safety and Alignment

COS 597Q

1242

Info tab content

What existential and social risks do modern AI systems and methods pose? How can we mitigate these risks and ensure that AI systems are safe and aligned with the intention of their builder? What is currently being done to ensure that AI systems are safe? Are current safeguards sufficient and if not how can we improve upon them? These are the questions we consider in this advanced seminar. Topics include algorithms and optimization-based methods for alignment, reinforcement learning from human feedback, the mind-body problem as applied to AI, accountability and the free choice problem, and more.

Instructors tab content

Instructor
Elad Hazan

Sections tab content

Section L01

Type: Lecture
Section: L01
Status: O
Enrollment: 35
Capacity: 50
Class Number: 23020
Schedule: W 01:30 PM-04:20 PM - Robertson Hall 016