Skip to main content
Princeton Mobile homeCourses home

Advanced Topics in Computer Science: AI Safety and Alignment

COS 597Q

Info tab content
What existential and social risks do modern AI systems and methods pose? How can we mitigate these risks and ensure that AI systems are safe and aligned with the intention of their builder? What is currently being done to ensure that AI systems are safe? Are current safeguards sufficient and if not how can we improve upon them? These are the questions we consider in this advanced seminar. Topics include algorithms and optimization-based methods for alignment, reinforcement learning from human feedback, the mind-body problem as applied to AI, accountability and the free choice problem, and more.
Instructors tab content
Sections tab content

Section L01