Social networks allow personalities and companies to communicate directly with the public, without the filter of traditional media. People rely on social networks to keep up to date with their hobbies or interest. Recently the political debate has also moved online. Here we present an ongoing study on the interactions […]
Open Datasets & Libraries
This repository contains data and information regarding for the paper M. Trevisan, D. Giordano, I. Drago, M. Mellia, M. Munafò, “Five Years at the Edge: Watching Internet From the ISP Network,” in IEEE/ACM Transactions on Networking, vol. 28, no. 2, pp. 561-574, April 2020, doi: 10.1109/TNET.2020.2967588. A preliminary version of the […]
We make available to the community our categorization scripts (e.g., how categories or sections assigned by editors to articles are grouped together to form broad categories) as well as a sample of 80K categorized articles using this method (dataset Published-articles in our paper). This should give the reader (who masters […]
This simulator is a trace-drive simulator which relies on the real data coming from Car2go, a car sharing provider working in 25 cities spread around the world. Please notice that the code is available, but the data is subjected to some restrictions. The simulator has been used in the following […]
The data are available to the community in anonymized form for further investigation at the following link (external) The dataset has been used for the following papers: A. Morichetta, M. Trevisan, L. Vassio, J. Krickl. Understanding web pornography usage from traffic analysis. Computer Networks, Elsevier, 189, 107909, 2021. DOI: 10.1016/j.comnet.2021.107909 […]
Create publication-quality plots with a simple interface over matplotlib. Are you bored of copying and pasting the code to make a plot every time? Try this! This module provides only one (highly customizable) function to plot some data. It uses matplotlib in its internal, but helps in setting all graphical […]
In this page we show the results of applying LENTA to a HTTPS trace we collected from volunteers. For the details on the methodology – please check the paper Morichetta, Andrea, and Marco Mellia. “LENTA: Longitudinal Exploration for Network Traffic Analysis.” 2018 30th International Teletraffic Congress (ITC 30). Vol. 1. IEEE, […]
This repository contains the source code of the core part of AWESoME (Github). In particular, you can find the source code of the training and the classification modules. Moreover, a sample dataset is provided. It contains the traffic generated by an automatic browser visiting a (quite large) set of popular […]
This page contains the DNS datasets used by REMeDy for automatic identification of DNS manipulations. The source code of REMeDy can be found here. These datasets come from passive measurements and have been collected in real operational networks. They are CSV file, where an entry corresponds to a DNS response. […]
Summary Here you can find scripts and datasets of anonymized clickstreams presented in the following papers: Vassio, L., Drago, I., Mellia, M., Ben-Houidi, Z., and Lamali, M.L.. (2018) You, the Web and Your Device: Longitudinal Characterization of Browsing Habits. ACM Transactions on the Web 12, 4, Article 24 (September 2018), 30 […]