Dataset Overview
This site provides an overview of available datasets providing network data that can be used for anomalie detection in computer networks.
Datasets Based on Research Network Traffic
Comprehensive, Multi-Source Cyber-Security Events
Link: https://csr.lanl.gov/data/cyber1/
Description: The dataset consists of network flow data collected within Los Alamos National Laboratoy's corporate, internal computer network. For more details see official website (see Link).
Content: Anonymized network flow data exported by several key routers within the internal computer network.
Format: CSV
Features: Time, Duration, Source Computer, Source Port, Destination Computer, Destination Port, Packet Count, Byte Count
Availability: Freely available
Date: 2015
FRPG Continous Flow Data
Link: https://ant.isi.edu/datasets/all.html
Description: This dataset contains of flows collected on a 1 Gbps link between FRGP.net and CenturyLink. The flows have origin in several academic institutions.
Content: tbd
Format: Argus file format
Features: tbd
Availability: Needs Requesting
Date: 2009-2020
SimpleWeb NetFlow Data
Link: https://www.simpleweb.org/wiki/index.php/Traces#NetFlow_Traces
Description: The flows this dataset contains of were exported by the central access router connecting a university to its ISP over a 10 Gbps link with actual loads between 650 Mbps and 1 Gbps.
Content: tbd
Format: NetFlow v5 Data
Features: Ingress interface, Source IP, Destination IP, IP Protocol, Source Port Destination Port, IP Type of Service
Availability: Freely available
Date: 2007
Kyoto2006+ Dataset
Link: http://www.takakura.com/Kyoto_data/
Description: The raw traffic data is obtained by honeypot systems that are deployed in Kyoto University. The dataset was built on 3 years of real traffic data recorded by the reployed honeypots.
Content: tbd
Format: Tabular (CSV-like)
Features: 24 features including all the features Netflow v9 has except packet count . See link for more information.
Availability: Freely available
Date: 2009
UNIBS
Link: http://netweb.ing.unibs.it/~ntw/tools/traces/
Description: This dataset contains of traces that were collected by the tool tcpdump running on the edge router of the campus network of the University of Brescia. The data was collected between 30.09.2009 and 2.10.2009. The recorded traffic was simulated by several workstations running the GT client daemon. The bandwirth were around 100 Mbps.
Content: tbd
Format: PCAP
Features: Time, Duration, Source IP, Source Port, Destination IP, Destination Port, Protocol, Packet Count, Byte Count
Availability: Needs Requesting
Date: 2009