Wikier

Teorimoduler - IDI

TDT13 - Advanced Text Analytics and Language Understanding

Norsk: Avansert tekstanalyse og språkforståelse, 2023

Given by Björn Gambäck.

The course consists of a set of regular lectures and student presentations.

Note that the course content in general is tailored towards the needs of the students writing a Master's Thesis in Language Technology.
Students writing a thesis on some other topic are of course also very welcome to follow the course, but should contact the lecturer before registering.


Overall, preference will be given to students have taken the course TDT4310 (Intelligent Text Analytics and Language Understanding/Intelligent tekstanalyse og språkforståelse), or something similar (e.g., during an exchange visit abroad). However, an introduction to/overview of language technology will be included at the beginning of the course.

This year we will in particular discuss:

  • semantic representations and processing,
  • word embeddings and word-space modelling,
  • transfer learning, transformers and self-attention,
  • and in general classification algorithms for language processing, applied to issues such as:
    • sentiment analysis
    • author profiling
    • hate speech detection
    • native language identification

Course Material

The course material (lecture recordings and slides, articles, etc.) will be published in the course Teams group.

Examination

The grading will be based on the oral student presentations and a written report on the same subject, with presentation/report themes selected by the students together with the lecturer.

Course Schedule

The course will start in week 35, on Wednesday 30.8 at 10:15-12:00 in room 454, which is on the 4th floor of IT-bygget (Map).

The lectures thereafter will be on Friday 1.9, Wednesday 20.9 and Friday 22.9, all of them also at 10:15-12:00 in room 454.

The student presentations (examination) will be scheduled at different time slots during weeks 40 and 48 (tentatively 4.10 and 6.10 resp 29.11-1.12), also in room 454.

The lectures and presentations will be onsite, but possibly available to follow also online and/or recorded.

Preliminary schedule:

  1. Wednesday August 30, 10:15-12:00, room 454: Lecture 1, Introduction
  2. Friday September 1, 10:15-12:00, room 454: Lecture 2, Machine Learning and Deep Learning for Natural Language Processing
  3. Wednesday September 20, 10:15-12:00, room 454: Lecture 3, Linguistic Meaning, Evaluation and Sentiment Analysis
  4. Friday September 22, 10:15-12:00, room 454: Lecture 4, Semantics and Digital Forensics
  5. Wednesday October 4, 10:15-12:00, room 454: Student thesis topic presentations
  6. Friday October 6, 10:15-12:00, room 454: Student thesis topic presentations
  7. Wednesday November 29, 10:15-12:00, room 454: Student TDT13 project presentations
  8. Thursday November 30, 10:15-12:00 OR 13:15-15, room 454: Student TDT13 project presentations
  9. Friday December 1, 10:15-12:00, room 454: Student TDT13 project presentations

Visiting hour

by appointment


For more information about the course, please contact Prof. Björn Gambäck.

2730 Visninger