Catch Me If You Can Dataset

Dataset Supporting the CPSIOT 2020 Workshop Paper "Catch Me If You Can: An In-Depth Study of CVE Discovery Time and Inconsistencies for Managing Risk in Critical Infrastructures" by Richard J. Thomas, Joe Gardiner, Tom Chothia, Awais Rashid, Manolis Samanis and Joshua Perrett, a joint collaboration between the University of Birmingham and University of Bristol.

Read the Paper » Referencing the Dataset » Dataset Information and Release »

Referencing this Dataset

We encourage the use of our Dataset by the research community. If you do use it, we ask that you cite the Dataset and credit the University of Birmingham and the Bristol Cyber Security Group.

This Dataset is licensed under the Creative Commons Attribution 4.0 International license.

The Citation and BibTeX can be exported using the buttons below.

R.J. Thomas, J. Gardiner, T. Chothia, A. Rashid, M. Samanis and J. Perrett. (2020) "Catch Me If You Can: An In-Depth Study of CVE Discovery Time and Inconsistencies for Managing Risks in Critical Infrastructure" in: Proceedings of the ACM Workshop on Cyber-Physical Systems Security & IOT Security and Privacy.
@InProceedings{cpsiotsec2020, author="Thomas, Richard J. and Gardiner, Joe and Chothia, Tom and Rashid, Awais and Samanis, Manolis and Perrett, Joshua", title="Catch Me If You Can: An In-Depth Study of CVE Discovery Time and Inconsistencies for Managing Risks in Critical Infrastructure", booktitle={Proceedings of the ACM Workshop on Cyber-Physical Systems Security \& IOT Security and Privacy}, year="2020" }

The Dataset

Everything you need to know about this Dataset.

Dataset Information:

The 'Catch Me If You Can' Dataset was curated by scraping CISA ICS-CERT Advisories, the NIST NVD CVE feeds, MITRE CVE exports and the MITRE CWE list. The workflow that imports the data held in these sources to form our Dataset is given in our paper.

This Dataset contains all ICS advisories between 2011 and March 2020. Some key statistics are given below:

Data Schema

The Dataset has been broken down for simple referencing and for intuitive use. The schema is given below, with a description of the fields contained in this Dataset. Each schema has a matching SQL and CSV file for download.

Dataset Releases

This Dataset is available as SQL and CSV files for database servers and integration with other data analysis tools and software.

Please note: this Dataset contains only pre-processed data, and not the device sales, release and firmware update data that was used in our paper.

Full Dataset

The full dataset of ICS Advisories, CVE and CPE information and other data through to March 2020.

Full Dataset Files
Schema File SQL CSV JSON
cpsiot2020-cpe_listing SQL CSV JSON

Have Questions?

If you have any questions, please feel free to get in touch with us. Our contact addresses are in the paper.