Schedule
Date | Lecture | Topics | ||
---|---|---|---|---|
9/1 |
Lecture 1.1: Course introduction [ slides | video ] |
Research and technical challenges |
||
9/3 |
Lecture 1.2: Multimodal applications and datasets [ slides | video ] |
Research tasks and datasets |
||
9/8 |
Lecture 2.1: Basic concepts: neural networks [ slides | video ] |
Language, visual and acoustic |
||
9/10 |
Lecture 2.2: Basic concepts: network optimization [ slides | video ] |
Gradients and backpropagation |
||
9/15 |
Lecture 3.1: Visual unimodal representations [ slides | video ] |
Convolutional kernels and CNNs |
||
9/17 |
Lecture 3.2: Language unimodal representations [ slides | video ] |
Gated networks and LSTM |
||
9/22 |
Lecture 4.1: Multimodal representation learning [ slides | video ] |
Multimodal auto-encoders |
||
9/24 |
Lecture 4.2: Coordinated representations [ slides | video ] |
Deep canonical correlation analysis |
||
9/29 |
Lecture 5.1: Multimodal alignment [ slides | video ] |
Explicit - dynamic time warping |
||
10/1 |
Lecture 5.2: Alignment and representation [ slides | video ] |
Multi-head attention |
||
10/6 | Lecture 6.1: First project assignment (live working sessions instead of lectures) | |||
10/8 | Lecture 6.2: First project assignment (live working sessions instead of lectures) | |||
10/13 |
Lecture 7.1: Alignment and translation [ slides | video ] |
Module networks |
||
10/15 |
Lecture 7.2: Probabilistic graphical models [ slides | video ] |
Dynamic Bayesian networks |
||
10/20 |
Lecture 8.1: Discriminative graphical models [ slides | video ] |
Conditional random fields |
||
10/22 |
Lecture 8.2: Deep Generative Models [ slides | video ] |
Variational auto-encoder |
||
10/27 |
Lecture 9.1: Reinforcement learning [ slides | video ] |
Markov decision process |
||
10/29 |
Lecture 9.2: Multimodal RL [ slides | video ] |
Policy gradients |
||
11/3 |
Lecture 10.1: Fusion and co-learning [ slides | video ] |
Multi-kernel learning and fusion |
||
11/5 |
Lecture 10.2: New research directions [ slides | video ] |
Recent approaches in multimodal ML |
||
11/10 | Lecture 11.1: Mid-term project assignment (live working sessions instead of lectures) | |||
11/12 | Lecture 11.2: Mid-term project assignment (live working sessions instead of lectures) | |||
11/17 |
Lecture 12.1: Embodied Language Grounding [ slides | video ] |
Connecting language to action |
||
11/19 |
Lecture 12.2: Multimodal Human-inspired Language Learning [ slides | video ] |
Grounded language learning |
||
11/24 | Lecture 13.1: Thanksgiving week (no lectures) | |||
11/26 | Lecture 13.2: Thanksgiving week (no lectures) | |||
12/1 |
Lecture 14.1: Learning to connect text and images [ slides | video ] |
Discourse approaches, text & images |
||
12/3 |
Lecture 14.2: Bias and fairness [ slides | video ] |
Computational Ethics |
||
12/8 | Lecture 15.1: Final project assignment (live working sessions instead of lectures) | |||
12/10 | Lecture 15.2: Final project assignment (live working sessions instead of lectures) |