Welcome to Big Data Integration
Organizer: Matteo Lissandrini, Katja Hose
Lecturers: Giovanni Simonini, University of Modena and Reggio Emilia (Italy)
ECTS: 2
Date/Time: June 2024
Deadline: 10 May 2024
Max no. Of participants: 20
Description: The course aims at illustrating recent advancements in the field of big data integration from both the practical and methodological perspective. In particular, the focus will be on tools and techniques for large and heterogenous datasets, such as data lakes and open data. The main tackled topics will be: (i) Data discovery; (ii) Entity Resolution, i.e., the task of identifying and integrating records that refer to the same real-world entity in different datasets when an explicit identifier is not provided; (iii) data preparation, i.e., the set of preprocessing operations performed to transform the data at the structural and syntactical level.
Prerequisites: Familiarity with a programming language.
Learning objectives: Students will learn core techniques and technologies for the tasks of (i) Data discovery; (ii) Entity Resolution; (iii) data preparation.
Disclaimer:
DDSA has explicit permission from Arcanic and the owners of the https://phdcourses.dk/ website to display the courses on ddsa.dk.