Detecting pathogenic structural variation in families with undiagnosed rare disease in a national genome project

Abstract

Background Whole-genome sequencing (WGS) projects for rare disease diagnosis typically yield a diagnostic rate of approximately 25-40%, dependent particularly on patient selection and the extent of prior genetic testing. The Scottish Genomes Partnership (SGP) is a collaborative research programme involving four Scottish Regional Genetics Centres, four Scottish Medical Schools, and Genomics England’s 100,000 Genomes Project. It aims to facilitate genome sequencing and diagnosis for patients in the Scottish NHS with suspected rare Mendelian diseases. Within SGP, short-read sequencing (SRS) achieved a diagnostic rate of 23% in affected families.

Methods To increase the diagnostic yield, we applied Oxford Nanopore Technologies (ONT) long-read sequencing (LRS) to a cohort of 24 SGP families (74 individuals) who remained undiagnosed after SRS. We also re-analysed previously generated SRS data to identify pathogenic structural variants (SVs). We benchmarked several existing software tools for SV detection using LRS and defined key requirements for sample processing and DNA quality. Custom SV prioritisation and bioinformatics pipelines were developed to integrate SV discovery with genotype-phenotype analysis.

Results Benchmarking showed that minimap2 + cuteSV was optimal for single-sample SV discovery, while minimap2 + Sniffles2 performed best for family-based analysis. SV calling across the cohort yielded 60,022 filtered SVs spanning autosomes and sex chromosomes. Each family had between 23,024 and 25,009 SVs genome-wide (median: 23,814). A total of 392 SVs genome-wide and 8 within a disease-gene panel were prioritised across autosomal dominant/de novo, recessive, compound heterozygous, and X-linked modes, with counts varying between families. In three exemplar families, pathogenic or likely pathogenic de novo SVs were identified in both LRS and SRS data: one at the DLX5/6 locus, one in AUTS2, and one in FN1. We provide genome-wide de novo SVs and compound heterozygous (SV + SNV) variants, and deposit raw and processed sequencing data for all families in the Genomics England Research Environment to support future gene discovery.

Conclusions This study demonstrates that in-depth SV analysis can increase molecular diagnostic rates in rare disease patients with presumed monogenic aetiology. Pathogenic or likely pathogenic de novo SVs were identified in three families, resolving the diagnostic odyssey for at least two of the 24 families.

Competing Interest Statement

Timothy J. Aitman is a Council Member and Trustee of the UK Academy of Medical Sciences and is cofounder and equity holder of BioCaptiva plc.

Funding Statement

The Scottish Genomes Partnership was funded by the Chief Scientist Office of the Scottish Government Health Directorates (SGP/1 and SGP/2) and the Medical Research Council Whole Genome Sequencing for Health and Wealth Initiative (MC/PC/15080). P.D. was additionally supported by funding awarded to J.C.T. (Grant Ref: MR/W01761X/1) and by the National Institute of Health Research (NIHR) Oxford Biomedical Research Centre (BRC).

Author Declarations

I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.

Yes

The details of the IRB/oversight body that provided approval or exemption for the research described are given below:

Samples were collected from 73 individuals for the long-read sequence study within the Scottish Genomes Partnership programme project "NHS Scotland in 100,000 genomes study". All gave informed consent for use of samples and phenotype data for research purposes, including genome sequencing, data store and de-identified data sharing. Pedigree details and details of the clinical phenotype were included for three families, all of whom gave additional consent for inclusion of their data and clinical features in this publication. This research study was approved by North of Scotland Research Ethics Committee (16/NS/0137) and Scotland A Research Ethics Committee (17/SS/0113); the Public Benefit and Privacy Panel (1516-0377); and NHS Scotland health board Research and Development departments.

I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.

Yes

Comments (0)

No login
gif