Special Topics in Statistics and Operations Research: Transformers and Large Language Models
ORF 570
1254
1254
Info tab content
This course explores cutting-edge aspects of transformers and large language models, which have revolutionized natural language processing and various other domains in artificial intelligence. Key topics include transformer architecture fundamentals, self-attention mechanisms and positional encodings, probabilistic foundations of language modeling and sequence prediction, pretraining strategies and transfer learning in language models, scaling laws and the implications of model size on performance, fine-tuning techniques for specific tasks and domains, and efficiency improvements and model compression techniques.
Instructors tab content
Sections tab content
Section L01
- Type: Lecture
- Section: L01
- Status: O
- Enrollment: 0
- Capacity: 40
- Class Number: 40488
- Schedule: TTh 03:00 PM-04:20 PM