TDT13 - Advanced Text Analytics and Language Understanding - Kunnskapsbasen
TDT13 - Advanced Text Analytics and Language Understanding
Norsk: Avansert tekstanalyse og språkforståelse, 2024
Given by Björn Gambäck.
The course consists of a set of regular lectures and student presentations.
Note that the course content in general is tailored towards the needs of the students writing a Master's Thesis in Language Technology and that the course in 2024 is open only to those students. Previous years it's been possible to accommodate other students, but not this year, due to the renovation work on the IT-building, which restricts available rooms and their size.
Not in 2024: Students writing a thesis on some other topic are of also welcome to follow the course, but should contact the lecturer before registering.
In 2024: Students who do not follow the course may request access to the lecture slides.
Preference will be given to students have taken the course TDT4310 (Intelligent Text Analytics and Language Understanding/Intelligent tekstanalyse og språkforståelse), or something similar (e.g., during an exchange visit abroad). However, an introduction to/overview of language technology will be included at the beginning of the course.
This year, we will in particular discuss:
- word embeddings and word-space modelling,
- transfer learning, transformers and self-attention,
- computational linguistic creativity,
- semantic representations and processing,
- and in general classification algorithms for language processing, applied to issues such as:
- sentiment analysis
- author profiling
- hate speech
- native language identification
- figurative language
Course Material
The course material (slides, articles, etc.) will be published in the course Teams group.
Examination
The grading will be based on the oral student presentations and a written report on the same subject, with presentation/report themes selected by the students together with the lecturer. The project and presentations can be carried out individually or in groups of two students working together.
Course Schedule
The course will start on Thursday 29.8 at 13:15-15 (week 35) in room 201, which is on the 2nd floor of IT Syd (Map).
Preliminary schedule (with all meetings in room 201):
- Lecture 1 (29.8, 13:15): Introduction (to the course, to language, and to Language Technology)
- Lecture 2 (16.9, 13:15): Machine Learning and Deep Learning for Natural Language Processing
- Lecture 3 (17.9, 13:15): Linguistic Meaning, Semantics and Sentiment Analysis
- Lecture 4 (7.10, 10:15): Digital Forensics, Computational Linguistic Creativity and Evaluation
- Student thesis topic presentations (7.10, 13:15)
- Student TDT13 project presentations
The student presentations (examination) will be tentatively scheduled at different time slots during weeks 40/41 and 46, also in room 201.
The lectures and presentations will be onsite, but will possibly be available to follow also online (though only in case students registered for the course have valid reasons not to attend a specific lecture in person; the student presentations will be in-person only).
Visiting hour
by appointment
For more information about the course, please contact Prof. Björn Gambäck.