Advanced Topics in Computer Science: AI Safety and Alignment
COS 597Q
1242
1242
Info tab content
What existential and social risks do modern AI systems and methods pose? How can we mitigate these risks and ensure that AI systems are safe and aligned with the intention of their builder? What is currently being done to ensure that AI systems are safe? Are current safeguards sufficient and if not how can we improve upon them? These are the questions we consider in this advanced seminar. Topics include algorithms and optimization-based methods for alignment, reinforcement learning from human feedback, the mind-body problem as applied to AI, accountability and the free choice problem, and more.
Instructors tab content
Sections tab content
Section L01
- Type: Lecture
- Section: L01
- Status: O
- Enrollment: 35
- Capacity: 50
- Class Number: 23020
- Schedule: W 01:30 PM-04:20 PM - Robertson Hall 016