<p dir="ltr">Code repository for each chapter in candidate's PhD thesis:</p><ul><li>Chapter 2 Scope+</li><li>Chapter 3 Vulture</li><li>Chapter 4 AtlasAgent</li><li>Chapter 6 GeneAgent</li></ul><p dir="ltr">For each chapter, here is detailed description on the dataset:</p><p dir="ltr">Chapter 2: Scope+ (Published) </p><ul><li>Scope+ is an open-source, generalizable architecture for large-scale single-cell RNA-seq atlas analysis at both sample and cell levels. The source code and documentation are publicly available: </li><li>GitHub repository: https://github.com/holab-hku/scopeplus (Full source code, installation guides, and example workflows) </li><li>Related tool (Covidscope): https://github.com/holab-hku/Covidscope (A specialized extension of Scope+ for COVID-19 single-cell data analysis) </li></ul><p dir="ltr">Chapter 3: Vulture (Published) </p><ul><li>Vulture is a cloud-enabled, scalable framework for mining microbial reads from public single-cell RNA-seq datasets. The implementation and resources are accessible at: </li><li>GitHub repository: https://github.com/holab-hku/Vulture (Implementation code, documentation, and deployment resources) </li></ul><p dir="ltr">Chapter 4: AtlasAgent (Preprint available) </p><ul><li>AtlasAgent introduces a vision–language model (VLM) framework for the evaluation of atlas-scale single-cell data integration. The codebase and preprint are available at: </li><li>GitHub repository: https://github.com/holab-hku/AtlasAgent (Prototype code, preprint reference, and evaluation pipelines) </li></ul><p dir="ltr">Chapter 5: RagOmics (Not publicly available) </p><ul><li>RagOmics is a no-code, multi-agent platform for automated and scalable single-cell analysis. At present, the system is under active development and not yet released publicly. This Chapter's work is under preparation, due to confidentiality the code will be relased once the manuscript is on preprint or published. </li></ul><p dir="ltr">Chapter 6: GeneAgent (Preprint available) </p><ul><li>GeneAgent presents a prompt-only large language model (LLM) pipeline for extracting gene–disease–phenotype relationships from biomedical literature. The prototype and associated preprint are accessible at: </li><li>GitHub repository: https://github.com/holab-hku/GeneAgent (Code and resources for reproducing preprint results)</li></ul><p dir="ltr">Supporting Data to Thesis <b>Scalable and AI-empowered computational methods for large-scale single-cell omics data analytics</b></p><p dir="ltr"><br>Please find README for details on navigating the code repository</p>