Multimodal Language Understanding aims to use information from different sources such as text, speech, images, and gestures, to enhance language processing tasks. As we naturally use multiple forms of communication in our daily interactions, enabling machines to do the same enhances their understanding of human communication. For example, sentiment analysis can be improved by incorporating tone of voice or facial expressions alongside text. In this class, we will explore techniques for modeling multiple modalities, identify tasks that benefit from multimodal input, and discuss the challenges when handling multiple modalities.
This course will include reading, writing, and discussion and is intended for students from Computer Science, Linguistics, and related areas. Knowledge in AI is required, including having taken introductory courses in AI, ML or NLP.
Feel free to email at [email protected] if you have any questions.
Building C7 3 - Seminar room 1.14, Mondays 8:30-10:00, First class starts on 20th April
| Tentative date | Topics/Agenda of Discussion | Tentative Papers |
|---|---|---|
| 20/04 | Introduction Class | |
| 27/04 | PREPERATION (No seminar) | To send top three preferences of topic via email by 24/04. This counts as final registration. Total slots is 16. Applicants via the SIC seminar system should also send this. Will be notified via email regarding assigned topic and papers by 27/04. |
| 04/05-11/05 | PREPERATION (No seminar) | Mandatory Feedback Meetings for Presenters until 08/06 to be scheduled during this week |
| 18/05 | The Bag-of-Words Problem in Vision | 1. When and why vision-language models behave like bags-of-words, and what to do about it? |
4 credits • 10% - Attendance and participation: Attendance to all talks and active participation in class • 20% - Weekly Quesions: Send via email, Moderation (Moderators will get the questions sent and you have to collate and chair the discussion by bringing out main questions to presenter) • 70% - Paper Presentation:
Paper Presentation: Lead the discussion on the assigned paper.
7 Credits • 10% - Attendance and participation:(as above) • 20% - Weekly Questions: Send via email, Moderation • 30% - Paper Presentation (as above) • 40% - Hands-on Implementation and Writeup (6-8 pages)
Hands-on Implementation and Writeup: