File(s) under embargo
Supplementary data for the thesis "Development and Validation of Explainable Machine-Learning Prediction Systems: A Study of Biomedical and Clinical Data"
The files contain the dataset for the thesis "Development and Validation of Explainable Machine-Learning Prediction Systems: A Study of Biomedical and Clinical Data".
Chapter 3 includes a patient dataset with CDI (Clostridioides difficile infection) admissions from 2009-2014 in Hong Kong.
Chapter 4 includes a list of protein structure data derived from UniProt (www.uniprot.org) (release 2021_03) and their corresponding enzyme functions. The protein structure data file can be downloaded from the open-source database Protein Data Bank (www.rcsb.org). Additionally, a list of AlphaFold 2 predicted structures is also included, and the structural data can be downloaded from www.alphafold.com.
Chapter 5 contains a list of PDB structures derived from UniProt (release 2023_01).