This README.txt file was generated on 2021/06/16 by ZHENG Lichuan ------------------- GENERAL INFORMATION ------------------- 1. Title of Dataset: Marcus_Dataset_RNASeq 2. Author Information First Author Contact Information Name: ZHENG Lichuan Faculty: Department of Paediatrics and Adolescent Medicine, LKS Faculty of Medicine Email: u3006674@connect.hku.hk Corresponding Author Contact Information Name: ZHENG Lichuan Faculty: Department of Paediatrics and Adolescent Medicine, LKS Faculty of Medicine Email: u3006674@connect.hku.hk Author Contact Information (if applicable) Name: Faculty: Email: --------------------- DATA & FILE OVERVIEW --------------------- Directory of Files: A. Filename: Transcript_DE.xlsx Short description: The first file contains 86 genes corresponding to differentially expressed (DE) transcripts with IDs, log2 fold change values and p values for Jining cohort and Public cohort. B. Filename: Transcript_ToppgeneEnrichment.xlsx Short description: The second file contains Toppgene enrichment results including gene ontology, pathways, diseases with corresponding p-value and adjusted p-value. Additional Notes on File Relationships, Context, or Content (for example, if a user wants to reuse and/or cite your data, what information would you want them to know?): N/A File Naming Convention: DataType_Analysis Example: Transcript_DE.xlsx ----------------------------------------- DATA DESCRIPTION FOR: [Transcript_DE.xlsx] ----------------------------------------- 1. Number of variables: 11 2. Number of cases/rows: 87 3. Missing data codes: N/A Code/symbol Definition Code/symbol Definition 4. Variable List A. Name: geneName Description: HGNC gene name B. Name: transcriptENST Description: Transcript ENSEMBL id C. Name: transcriptType Description: Transcript type D. Name: log2FoldChange_G_JN Description: Gene level log2 fold change value in Jining cohort E. Name: padj_G_JN Description: Gene level adjusted p-value in Jining cohort F. Name: log2FoldChange_T_JN Description: Transcript level log2 fold change value in Jining cohort G. Name: padj_T_JN Description: Transcript level adjusted p-value in Jining cohort H. Name: log2FoldChange_G_Rep Description: Gene level log2 fold change value in Public cohort I. Name: padj_G_Rep Description: Gene level adjusted p-value in Jining Public cohort J. Name: log2FoldChange_T_Rep Description: Transcript level log2 fold change value in Public cohort K. Name: padj_T_Rep Description: Transcript level adjusted p-value in Public cohort ----------------------------------------- DATA DESCRIPTION FOR: [Transcript_ToppgeneEnrichment.xlsx] ----------------------------------------- 1. Number of variables: 4 2. Number of cases/rows: 155 3. Missing data codes: N/A Code/symbol Definition Code/symbol Definition 4. Variable List A. Name: ID Description: Toppgene enrichment term id B. Name: Name Description: Toppgene description on the corresponding term C. Name: p-value Description: p-value of the enrichment term D. Name: q-value Bonferroni Description: q-value of the enrichment term adjusted by Bonferroni correction -------------------------- METHODOLOGICAL INFORMATION -------------------------- 1. Software-specific information: N/A Name: Version: System Requirements: Open Source? (Y/N): (if available and applicable) Executable URL: Source Repository URL: Developer: Product URL: Software source components: Additional Notes(such as, will this software not run on certain operating systems?): 2. Equipment-specific information: N/A Manufacturer: Model: (if applicable) Embedded Software / Firmware Name: Embedded Software / Firmware Version: Additional Notes: