Skip to main content
Facilities Mobile homeCourses home
Detail

Special Topics in Statistics and Operations Research: Transformers and Large Language Models

ORF 570

1254
Info tab content
This course explores cutting-edge aspects of transformers and large language models, which have revolutionized natural language processing and various other domains in artificial intelligence. Key topics include transformer architecture fundamentals, self-attention mechanisms and positional encodings, probabilistic foundations of language modeling and sequence prediction, pretraining strategies and transfer learning in language models, scaling laws and the implications of model size on performance, fine-tuning techniques for specific tasks and domains, and efficiency improvements and model compression techniques.
Instructors tab content
Sections tab content

Section L01

  • Type: Lecture
  • Section: L01
  • Status: O
  • Enrollment: 0
  • Capacity: 40
  • Class Number: 40488
  • Schedule: TTh 03:00 PM-04:20 PM