Presenter: Luca Colomba
Monday, April 27th, 2022 17:00
Location: SmartData@Covivio
Clustering is among the most popular unsupervised data mining tasks. Within the years, density-based clustering algorithms proved to be one of the most prominent strategies to identify clusters and noisy points. Such methodology encompasses a large variety of clustering algorithms that focus on the identification of core points, i.e. points in cluster centers. The main drawback is the difficulty in handling clusters with different densities and thus several variants of the DBSCAN algorithm were proposed.
Recently, the data mining community has explored alternative approaches towards cluster identification, aimed at identifying cluster borders instead of cluster centers. In this SmartTalk, the proposed Bridge-Aware Clustering algorithm is presented: it relies on the concept of bridge point to separate different clusters, i.e. points whose neighborhood includes points of different clusters, and label propagation. To correctly identify bridge points without any prior cluster information, we leverage on a well-known unsupervised outlier detection technique.
We validated our methodology with over 20 datasets with different density distributions.
Biography: Luca Colomba is a second-year PhD Student at Dipartimento di Automatica e Informatica, Politecnico di Torino and member of the SmartData@Polito center. His research focuses on big data analytics, applications of machine learning and data mining techniques to spatio-temporal data. He received his MSc degree in Computer Engineering from Politecnico di Torino in 2019.