Teorimoduler - IDI

TDT13 - Advanced Text Analytics and Language Understanding

Norsk: Avansert tekstanalyse og språkforståelse, 2022

Given by Björn Gambäck.

The course consists of a set of regular lectures and student presentations.

Note that the course content in general is tailored towards the needs of the students writing a Master's Thesis in Language Technology and/or have taken the course TDT4310 (Intelligent Text Analytics and Language Understanding/Intelligent tekstanalyse og språkforståelse), or something similar (e.g., during an exchange visit abroad).
Other students are of course very welcome to follow the course, but are adviced to contact the lecturer beforehand.
(However, a short introduction to/overview of language technology will be included at the beginning of the course.)

This year we will in particular discuss:

  • semantic representations and processing,
  • word embeddings and word-space modelling,
  • transfer learning, transformers and self-attention,
  • and in general classification algorithms for language processing, applied to issues such as:
    • sentiment analysis
    • author profiling
    • hate speech detection
    • native language identification

Course Material

The course material (lecture recordings and slides, articles, etc.) will be published in the course Teams group.


The grading will be based on the oral student presentations and a written report on the same subject, with presentation/report themes selected by the students together with the lecturer.

Course Schedule

The course will start in week 38, on Tuesday 20.9 at 12:15-14:00 in room 454, which is on the 4th floor of IT-bygget (Map).

The lectures thereafter will be every second Tuesday at 11:15-13:00, also in room 454 (so tentatively 4.10, 18.10, 1.11 and 15.11).

The student presentations (examination) will be scheduled at different time slots during the week 28.11 - 2.12.

The lectures and presentations will primarily be onsite, but tentatively available in hybrid format (in-person and online in parallel). All meetings in the course will be in Room 454 (IT-bygget, 4th floor), which is the IDI department's largest meeting room (so social distancing should be no problem).

Preliminary schedule:

  1. Tuesday September 20, 12:15-14:00, room 454 + Teams: Lecture 1, Introduction
  2. Tuesday October 4, 11:15-13:00, room 454 + Teams: Lecture 2, Machine Learning and Deep Learning for Natural Language Processing
  3. Tuesday October 18, 11:15-13:00, room 454 + Teams: Student thesis topic presentations
  1. Tuesday November 1, 11:15-13:00, room 454 + Teams: Lecture 3, Linguistic Meaning, Evaluation and Sentiment Analysis
  2. Tuesday November 15, 11:15-13:00, room 454 + Teams: Lecture 4, Semantics and Digital Forensics

Visiting hour

by appointment

For nærmere informasjon om emnet, kontakt faglærer Björn Gambäck.

954 Visninger