U.S. flag

An official website of the United States government, Department of Justice.

NCJRS Virtual Library

The Virtual Library houses over 235,000 criminal justice resources, including all known OJP works.
Click here to search the NCJRS Virtual Library

A novel phylogenetic approach for de novo discovery of putative nuclear mitochondrial (pNumt) haplotypes

NCJ Number
303153
Journal
Forensic Science International-Genetics Volume: 43 Dated: 2019
Author(s)
U. Smart; et al
Date Published
2019
Annotation

Since current approaches for parsing true variation (i.e. signal) from noise broadly involve estimating a baseline value of the latter, below which all sequence data are ignored, the current study sought to deliver a more objective criterion for setting such thresholds, i.e., a novel approach based on phylogenetic principles. 

Abstract

The proposed method deconstructs a special category of noise from true mitochondrial genome data, namely nuclear insertions of mitochondrial DNA (Numts). This bioinformatic approach leverages the relationship of massively parallel sequence reads and can discover putative Numts (pNumts) in the absence of a reference genome. The new method was tested on a whole mitochondrial genome dataset (n = 41 individuals from an admixed population sample from Rio de Janeiro) and led to the discovery of 451 pNumt variants. Comparison of these pNumts haplotypes against an existing Numt database revealed 147 exact matches to previously discovered Numts, while 122 haplotypes differed only by a single base pair and none matched exclusively to the mitochondrial genome. In general, these sequences were considerably more divergent from the mitochondrial genome than from those of the Numt database, supporting that the novel pNumts were probably hitherto uncatalogued variants. Unlike previous techniques, this method appears to be able to detect both polymorphic and fixed Numt sequences. It was also found that the region containing the D-Loop and associated Promoters (DLP) in the human mitochondrial genome, which harbors markers of forensic genetics importance, is the origin of several Numts. Although currently designed for the mitochondrial genome, this novel approach has the potential to be expanded to other scenarios that might require construing signal from noise, including the deconvolution of mixtures, thus significantly improving how analytical thresholds may be established. (publisher abstract modified)