Advanced Topics in MultiModal Machine Learning
11-877 • Spring 2023 • Carnegie Mellon University
Multimodal machine learning (MMML) is a vibrant multi-disciplinary research field which addresses some of the original goals of artificial intelligence by integrating and modeling multiple communicative modalities, including language, vision, and acoustic. This research field brings some unique challenges for multimodal researchers given the heterogeneity of the data and the contingency often found between modalities. This course is designed to be a graduate-level course covering recent research papers in multimodal machine learning, including technical challenges with representation, alignment, reasoning, generation, co-learning and quantification. The main goal of the course is to increase critical thinking skills, knowledge of recent technical achievements, and understanding of future research directions.
- Time: Friday 11:00am-12:30pm
- Location: GHC 5222
- Discussion and Q&A: Piazza
- Assignment submissions: Canvas (for registered students only)
- Contact: Students should ask all course-related questions on Piazza, where you will also find announcements.
- Instructor Louis-Philippe Morency
- Email: morency@cs.cmu.edu
- Instructor Paul Liang
- Email: pliang@cs.cmu.edu
Announcements
Jan 21, 2023 | Welcome to 11-877, Advanced Topics in Multimodal Machine Learning, Spring 2023! |