Advanced Topics in Computer Science: Systems for Serving Generative AI
COS 597K
1252
1252
Info tab content
Generative machine learning models, from large language models for chatbots to diffusion models for text-to-image generation, have become key players in important decisions and tasks across societal sectors. Unfortunately, their abilities to engage in conversation and spark creativity is coupled with high costs (both monetary and compute resource) and sometimes sluggish performance, especially when contextualized relative to the millions of requests that such models face for serving in production services. This research-centric course examines a wide range of systems optimizations that enable large-scale serving of generative models.
Instructors tab content
Sections tab content
Section C01
- Type: Class
- Section: C01
- Status: O
- Enrollment: 10
- Capacity: 20
- Class Number: 23435
- Schedule: W 10:00 AM-11:20 AM - Computer Science Building 402