Towards NLP-based Processing of Honeypot Logs

This webpage contains additional material on the paper:

“Towards NLP-based Processing of Honeypot Logs”

Published in: 2022 IEEE European Symposium on Security and Privacy Workshops (EuroS&PW)

The data

You can find the data on this link:


Each line represents an interaction between Cowrie and the attacker. The column “session_id” is a session identifier. The column “statements” contains the lists of statements associated with the interaction under analysis. The column “timestamps” contains the datetimes in which the interaction occurred. To obtain the entire bash session (from login to logou), group by session_id.

The code

You can find the paper’s code at this link:

Toward NLP-Processing code