RESEARCH DATA INTEGRATION

BII_Research-BDHD-RDI-2023

Research

Data science is an important component of biomedical and translational research, where data of multiple modalities are being constantly generated at unprecedented scale. The Research Data Integration group in the Biomedical Datahub Division aims to bridge the complexity of computational biology and data science with the needs of biologists and clinicians to drive biological discoveries and predict translational outcomes. To achieve this goal, we are developing capabilities to analyze and integrate multi-omics, imaging and clinical data generated by biomedical institutes in A*STAR, healthcare institutions and national platforms in Singapore to improve the usability and interpretability of large-scale multimodal datasets of cancer and other diseases.

Our group develops a wide range of computational, statistical, annotation and machine learning approaches to analyze large-scale multi-modal datasets generated from patients and patient derived models (xenografts, organoids and cell lines). We work closely with biologists, bioinformaticians and clinicians to build clinical and experimental data resources for diverse diseases that can scale research analysis and inform treatment options in the clinic, thereby improving patient outcomes.

Our focus is to manage, analyze, integrate and interpret large-scale biomedical and biological multi-modal data generated through large collaborative programs, to drive biomedical discoveries and scale data science efforts in Bioinformatics Institute. We are building data coordination centers and end-to-end data integration platforms for disease-related research programs, comprising multi-modal data. The novelty lies in bringing together and transforming Asian-centric datasets generated from multi-ethnic Singaporean patients across our local healthcare clusters, and ensuring sustainability for long term use. The strategic significance and impact of a centralized data effort can create synergy across multi-disciplinary collaborations to drive new discoveries for clinical translation and create harmonized datasets for future AI applications. We are driving, in partnership with clinical and wet-lab collaborators, high-impact scientific discoveries to address clinical unmet needs. Currently, our team comprise different computational expertise to build the data infrastructure that suits end-to-end data integration for data resources of different diseases and programs.

BII_Research-BDHD-RDI_figure-1 BII_Research-BDHD-RDI_figure-2

Members

 Senior Principal Scientist WOO Xing Yi   |    [View Bio]
 Senior Scientist GOH Jia Ni Janice
 Scientist LEE Wei Qi Audrey
 Research Officer PHUA Xuan Ming Brandon
 Research Officer CHEN Junqi
 Senior Research Officer (Collaborator) ZHANG Qinze Arthur

Selected Publications


  1. Woo et al. A Genomically and Clinically Annotated Patient Derived Xenograft (PDX) Resource for Preclinical Research in Non-Small Cell Lung Cancer. Cancer Research 82(22): 4126–4138 (2022).

  2. Sargent et al. Genetically Diverse Mouse Platform to Xenograft Cancer Cells. Disease Models & Mechanisms 15(9): dmm049457 (2022).

  3. Koc et al. PDXNet portal: patient-derived Xenograft model, data, workflow and tool discovery, NAR Cancer 4(2), zcac014 (2022).  

  4. Guillen et al. A breast cancer patient-derived xenograft and organoid platform for drug discovery and precision oncology. Nature Cancer 3, 232–250 (2022).

  5. Sun et al. Comprehensive characterization of 536 patient-derived xenograft models prioritizes candidates for targeted treatment. Nature Communications 12, 5086 (2021).

  6. Woo et al. Conservation of copy number profiles during engraftment and passaging of patient-derived cancer xenografts. Nature Genetics 53, 86–99 (2021). 

  7. Evrard et al. Systematic establishment of robustness and standards in patient-derived xenograft experiments and analysis. Cancer Research 80(11):2286-2297 (2020).

  8. Woo et al. Genomic data analysis workflows for tumors from patient-derived xenografts (PDXs): challenges and guidelines. BMC Medical Genomics 12(1):92 (2019).

  9. Menghi et al. The tandem duplicator phenotype as a distinct genomic configuration in cancer. Proceedings of the National Academy of Sciences 113(17): E2373-E2382 (2016). 

  10. Inaki et al. Systems consequences of amplicon formation in human breast cancer. Genome research 24(10): 1559-1571 (2014). 

  11. Hillmer et al. Comprehensive long-span paired-end-tag mapping reveals characteristic patterns of structural variations in epithelial cancer genomes. Genome Research 21:676-687 (2011).

  12. Inaki et al. Transcriptional consequences of genomic structural aberrations in breast cancer. Genome Research 21:665-675 (2011).