Datasets

The Problem with Today’s NIDS Datasets

Most network intrusion detection research relies on datasets collected in controlled lab environments or generated synthetically. These datasets contain “design smells” — accidental patterns that classifiers exploit to achieve high accuracy in the lab but that don’t hold up in deployment. A classifier might learn to flag attacks based on packet timing artifacts from the lab network rather than the actual malicious payload.

What You’ll Work On

In this theme, you’ll help build datasets that bridge the gap between lab and reality, working with TDC NET to validate against real network conditions.

Possible thesis directions:

What You’ll Learn

Relevant Literature

Supervisors: Alessandro Bruni (ITU), Nicola Dragoni (DTU) – see Team