Refining the genetic landscape of anophthalmia and microphthalmia: a comprehensive framework with deep learning and updated gene panels

Abstract

Importance Anophthalmia and microphthalmia (A/M) are rare congenital eye disorders with a low molecular diagnosis rate, which limits clinical management and genetic counselling. Improved detection and interpretation of pathogenic variants is essential for advancing diagnosis and care in affected individuals. Objective To improve the molecular diagnostic yield in A/M patients by refining the methodology of variant investigation and association using an updated rigorously curated gene panel, and a refined bioinformatic pipeline incorporating structural variant detection, in silico Artificial Intelligence assisted predictive tools, and molecular dynamics simulations. Methodology We curated an updated A/M gene panel through a systematic literature review and screened for rare variants in these genes using data from the UK's 100,000 Genomes Project, a national whole-genome sequencing initiative conducted by Genomics England. The cohort comprised 306 individuals recruited to the Rare Disease programme with a clinical diagnosis of anophthalmia or microphthalmia, recorded either as the primary phenotype or within HPO, SNOMED, or ICD-10 terms. Variants, including loss-of-function, missense, RNA splicing, and structural variants, were annotated with deep learning tools (AlphaMissense, SpliceAI), and missense variants were further assessed using REVEL, Missense3D, and molecular dynamics simulations. Results We identified pathogenic or likely pathogenic variants in 37 (12.1%) individuals, with an additional 23 (7.5%) harbouring strong candidate variants of uncertain significance. Our literature review identified the biggest contributors to A/M phenotypes to be MFRP, OTX2, PRSS56 and SOX2, each with over 100 patients reported in the literature, with a total number of 124 genes found be associated to A/M. Variants from our screen were most often found in genes with high A/M association, but also included novel findings within genes with a weaker association to A/M such as ACTG1, HDAC6, RERE and SIX3; adding support to their disease relevance. Conclusions and Relevance This study increased the diagnostic yield in A/M patients recruited to the 100KGP, and provides further evidence of genotype-phenotype associations within the aetiology of A/M. We also provide an updated framework for enhancing clinical genetic diagnosis in A/M that may inform broader strategies for other complex congenital disorders. However, as molecular diagnosis of A/M remains low, further research in understanding the genetic aetiology of A/M is necessary.

Competing Interest Statement

The authors have declared no competing interest.

Funding Statement

This research was supported by the NIHR Moorfields Biomedical Research Centre (GA), the Moorfields Eye Charity Career Development award (GR001155 to RMY), Springboard Awards (GR001210 and GR001662 to RMY), PhD Studentship (GR001661 to MIM), Medical Research Council (MR/X001067/1 to RMY), FONDECYT (1221843 to RMY), Biotechnology and Biological Sciences Research Council (BB/T002212/1 to FF as principal investigator) and the British Heart Foundation grant (RG/F/22/110079 to FF and OGC).

Author Declarations

I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.

Yes

The details of the IRB/oversight body that provided approval or exemption for the research described are given below:

The East of England, Cambridge South Research Ethics Committee of the UK Health Research Authority gave ethical approval for this work (REC reference: 14/EE/1112). Access to the data was granted under Genomics England's Data Access Agreement (Project ID: RR439).

I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.

Yes

Data Availability

All data produced in the present work are contained in the manuscript.

Comments (0)

No login
gif