Systematic evaluation of de novo mutation calling tools using whole genome sequencing data

Shah, Anushi and Monger, Steven and Troup, Michael and Ip, Eddie K K and Giannoulatou, Eleni (2025) Systematic evaluation of de novo mutation calling tools using whole genome sequencing data. Briefings in Bioinformatics, 26 (6). ISSN 1467-5463

Full text not available from this repository.
Link to published document: https://doi.org/10.1093/bib%2Fbbaf543

Abstract

Abstract

De novo mutations (DNMs) are genetic alterations that occur for the first time in an offspring. DNMs have been found to be a significant cause of severe developmental disorders. With the widespread use of next-generation sequencing (NGS) technologies, accurate detection of DNMs is crucial. Several bioinformatics tools have been developed to call DNMs from NGS data, but no study to date has systematically compared these tools. We used both real whole genome sequencing (WGS) data from a trio from the 1000 Genomes Project (1000G) and an in-house simulated trio dataset to evaluate five DNM calling tools: DeNovoGear, TrioDeNovo, PhaseByTransmission, VarScan 2, and DeNovoCNN. For DNMs called in the real dataset, we observed 8.4% concordance of variants between all tools, while 83.8% of DNMs variants were identified by only one caller. For simulated trio WGS dataset spiked with 100 DNMs, the concordance rate was also low at 3.9%. DeNovoGear achieved the highest F1 score on the real 1000G dataset, while DeNovoCNN had the highest F1 score on the simulated data. Our study provides valuable recommendations for the selection and application of DNM callers on WGS trio data.

Item Type: Article
Subjects: R Medicine > R Medicine (General)
Depositing User: Repository Administrator
Date Deposited: 22 Dec 2025 00:24
Last Modified: 22 Dec 2025 00:24
URI: http://eprints.victorchang.edu.au/id/eprint/1795

Actions (login required)

View Item View Item