Salta al contenuto principale
Passa alla visualizzazione normale.

D057 - INFORMATION AND COMMUNICATION TECHNOLOGIES

Corsi di Formazione per Dottorandi - Anno 2019

Fundamentals of Big Data - Aula 7 and Aula 3, DMI

 

Prerequisites: Basic courses in Information and Communication Technologies, e.g., programming, information and coding theory, databases. Basic knowledge of probability theory.


Instructors: Prof. Raffaele Giancarlo and Prof. Simona Rombo.


24 Gennaio, 2019 - Via Archirafi 34, Aula 7
10:00/13:30 (Rombo)
1 Introduction to Big Data and Data Mining
1.1 What are big data, where they are, related problems
1.2 Ploblems and solutions on data mining
1.3 Problems and solutions on data cleaning

25 Gennaio, 2019 - Via Archirafi 34, Laboratory 3
10:00/13:30 (Rombo)
2 Technologies and practical implementation
2.1 MapReduce, Apache Hadoop, Apache Spark
2.2 Data cleaning with the module Kettle of Penthao

30 January, 2019 - Via Archirafi 34, Laboratory 3
10:00/13:00 (Rombo)
3 Working with Spark
3.1 Spark libraries and tools
3.2 Implementing a recommendation system in Spark

31 January, 2019 - Via Archirafi 34, Aula 7
10:00/13:00 (Giancarlo)
4 Searching in Small Space
4.1 Universal Hash Function
4.2 Bloom Filters

1 February, 2019 - Via Archirafi 34, Aula 7
10:00/13:00 (Giancarlo)
5 Counting in Small Space
5.1 Hyperlogaritmic counters
5.2 Frequent Elements in Streams
5.3 Frequency Moments (outline)