Advancing AI for multi-omics and clinical data integration in basic and translational cancer research

Yang, Z., Guan, F., Bronk, L. & Zhao, L. Multi-omics approaches for biomarker discovery in predicting the response of esophageal cancer to neoadjuvant therapy: a multidimensional perspective. Pharmacol. Ther. 254, 108591 (2024).

Article  CAS  PubMed  Google Scholar 

The ICGC/TCGA Pan-Cancer Analysis of Whole Genomes Consortium. Pan-cancer analysis of whole genomes. Nature 578, 82–93 (2020). This consortium paper provides one of the most comprehensive genomic and transcriptomic datasets across multiple cancers, forming a foundational resource for pan-cancer computational analyses.

Article  CAS  Google Scholar 

Charlotte, J. H. & Jeffrey, M. D. Artificial intelligence and machine learning in clinical medicine. N. Engl. J. Med. 388, 1201–1208 (2023).

Article  Google Scholar 

Shi, Z., Lei, J. T., Elizarraras, J. M. & Zhang, B. Mapping the functional network of human cancer through machine learning and pan-cancer proteogenomics. Nat. Cancer 6, 205–222 (2025).

Article  CAS  PubMed  Google Scholar 

Zeng, Q. et al. Understanding tumour endothelial cell heterogeneity and function from single-cell omics. Nat. Rev. Cancer 23, 544–564 (2023).

Article  CAS  PubMed  Google Scholar 

Gomes, B. & Ashley, E. A. Artificial intelligence in molecular medicine. N. Engl. J. Med. 388, 2456–2465 (2023).

Article  CAS  PubMed  Google Scholar 

Lipkova, J. et al. Artificial intelligence for multimodal data integration in oncology. Cancer Cell 40, 1095–1110 (2022). This review offers a foundational synopsis of the strategies, applications and challenges of using AI for multimodal data integration in oncology.

Article  CAS  PubMed  PubMed Central  Google Scholar 

Binder, A. et al. Morphological and molecular breast cancer profiling through explainable machine learning. Nat. Mach. Intell. 3, 355–366 (2021).

Article  Google Scholar 

Zhao, M. et al. Integrated multiomics signatures to optimize the accurate diagnosis of lung cancer. Nat. Commun. 16, 84 (2025). This paper provides a compelling clinical use-case, demonstrating that a multi-omics model integrating radiomics and cell-free DNA fragmentomics improves the diagnostic accuracy for indeterminate pulmonary nodules.

Article  PubMed  PubMed Central  Google Scholar 

Alcazer, V. et al. Evaluation of a machine-learning model based on laboratory parameters for the prediction of acute leukaemia subtypes: a multicentre model development and validation study in France. Lancet Digit. Health 6, e323–e333 (2024).

Article  CAS  PubMed  Google Scholar 

Wang, T. et al. MOGONET integrates multi-omics data using graph convolutional networks allowing patient classification and biomarker identification. Nat. Commun. 12, 3445 (2021).

Article  CAS  PubMed  PubMed Central  Google Scholar 

Osipov, A. et al. The Molecular Twin artificial-intelligence platform integrates multi-omic data to predict outcomes for pancreatic adenocarcinoma patients. Nat. Cancer 5, 299–314 (2024). This paper introduces the ‘Molecular Twin’ concept, demonstrating a powerful platform that successfully integrates thousands of multi-omics features to accurately predict survival in pancreatic cancer.

Article  CAS  PubMed  PubMed Central  Google Scholar 

Tsai, P. C. et al. Histopathology images predict multi-omics aberrations and prognoses in colorectal cancer patients. Nat. Commun. 14, 2102 (2023). A key demonstration of the genotype–phenotype link, this work shows that deep learning on standard histopathology images can predict a wide range of multi-omics aberrations, including gene expression and copy number alterations.

Article  CAS  PubMed  PubMed Central  Google Scholar 

Boehm, K. M. et al. Multimodal data integration using machine learning improves risk stratification of high-grade serous ovarian cancer. Nat. Cancer 3, 723–733 (2022). This study provides a key example of how fusing histopathological, radiological and clinicogenomic features can improve prognostic accuracy over any single modality alone.

Article  CAS  PubMed  PubMed Central  Google Scholar 

Niazi, M. K. K., Parwani, A. V. & Gurcan, M. N. Digital pathology and artificial intelligence. Lancet Oncol. 20, e253–e261 (2019).

Article  PubMed  PubMed Central  Google Scholar 

Sunami, K. et al. A learning program for treatment recommendations by molecular tumor boards and artificial intelligence. JAMA Oncol. 10, 95–102 (2024).

Article  PubMed  PubMed Central  Google Scholar 

Chen, R. J. et al. Pan-cancer integrative histology-genomic analysis via multimodal deep learning. Cancer Cell 40, 865–878.e6 (2022).

Article  CAS  PubMed  PubMed Central  Google Scholar 

Jee, J. et al. Automated real-world data integration improves cancer outcome prediction. Nature 636, 728–736 (2024). This paper details the creation of a massive, multi-modal real-world dataset (MSK-CHORD) and shows that integrating NLP-derived features from clinical notes improves survival prediction beyond genomics or stage alone.

Article  CAS  PubMed  PubMed Central  Google Scholar 

Vanguri, R. S. et al. Multimodal integration of radiology, pathology and genomics for prediction of response to PD-(L)1 blockade in patients with non-small cell lung cancer. Nat. Cancer 3, 1151–1164 (2022). This study demonstrated that a model integrating radiology, pathology and genomics can predict immunotherapy response in NSCLC more accurately than established biomarkers like TMB or PDL1.

Article  CAS  PubMed  PubMed Central  Google Scholar 

Qiu, W. et al. Deep profiling of gene expression across 18 human cancers. Nat. Biomed. Eng. 9, 333–355 (2025).

Article  CAS  PubMed  Google Scholar 

Chang, T. G., Park, S., Schaffer, A. A., Jiang, P. & Ruppin, E. Hallmarks of artificial intelligence contributions to precision oncology. Nat. Cancer 6, 417–431 (2025).

Article  PubMed  PubMed Central  Google Scholar 

Unger, M. & Kather, J. N. Deep learning in cancer genomics and histopathology. Genome Med. 16, 44 (2024).

Article  PubMed  PubMed Central  Google Scholar 

LeCun, Y., Bengio, Y. & Hinton, G. Deep learning. Nature 521, 436–444 (2015).

Article  CAS  PubMed  Google Scholar 

Bravo González-Blas, C. et al. Single-cell spatial multi-omics and deep learning dissect enhancer-driven gene regulatory networks in liver zonation. Nat. Cell Biol. 26, 153–167 (2024).

Article  PubMed  PubMed Central  Google Scholar 

Wornow, M. et al. Zero-shot clinical trial patient matching with LLMs. NEJM AI 2, AIcs2400360 (2025).

Article  Google Scholar 

Andani, S. et al. Histopathology-based protein multiplex generation using deep learning. Nat. Mach. Intell. 7, 1292–1307 (2025).

Article  PubMed  PubMed Central  Google Scholar 

Kondepudi, A. et al. Foundation models for fast, label-free detection of glioma infiltration. Nature 637, 439–445 (2025).

Article  CAS  PubMed  Google Scholar 

Wang, J. et al. An interpretable artificial intelligence framework for designing synthetic lethality-based anti-cancer combination therapies. J. Adv. Res. 65, 329–343 (2024).

Article  CAS  PubMed  Google Scholar 

Su, X. et al. Interpretable identification of cancer genes across biological networks via transformer-powered graph representation learning. Nat. Biomed. Eng. 9, 371–389 (2025). This paper presents a novel approach using a transformer-based model on biological networks to not only predict cancer driver genes with high accuracy but also to provide interpretable insights into their regulatory mechanisms.

Article  PubMed  Google Scholar 

Prelaj, A. et al. Artificial intelligence for predictive biomarker discovery in immuno-oncology: a systematic review. Ann. Oncol. 35, 29–65 (2024).

Article  CAS  PubMed  Google Scholar 

Wang, K. et al. A full life cycle biological clock based on routine clinical data and its impact in health and diseases. Nat. Med. 31, 4225–4235 (2025).

Article  CAS  PubMed  PubMed Central 

Comments (0)

No login
gif