Since current approaches for parsing true variation (i.e. signal) from noise broadly involve estimating a baseline value of the latter, below which all sequence data are ignored, the current study sought to deliver a more objective criterion for setting such thresholds, i.e., a novel approach based on phylogenetic principles.
The proposed method deconstructs a special category of noise from true mitochondrial genome data, namely nuclear insertions of mitochondrial DNA (Numts). This bioinformatic approach leverages the relationship of massively parallel sequence reads and can discover putative Numts (pNumts) in the absence of a reference genome. The new method was tested on a whole mitochondrial genome dataset (n = 41 individuals from an admixed population sample from Rio de Janeiro) and led to the discovery of 451 pNumt variants. Comparison of these pNumts haplotypes against an existing Numt database revealed 147 exact matches to previously discovered Numts, while 122 haplotypes differed only by a single base pair and none matched exclusively to the mitochondrial genome. In general, these sequences were considerably more divergent from the mitochondrial genome than from those of the Numt database, supporting that the novel pNumts were probably hitherto uncatalogued variants. Unlike previous techniques, this method appears to be able to detect both polymorphic and fixed Numt sequences. It was also found that the region containing the D-Loop and associated Promoters (DLP) in the human mitochondrial genome, which harbors markers of forensic genetics importance, is the origin of several Numts. Although currently designed for the mitochondrial genome, this novel approach has the potential to be expanded to other scenarios that might require construing signal from noise, including the deconvolution of mixtures, thus significantly improving how analytical thresholds may be established. (publisher abstract modified)
Downloads
Similar Publications
- Just Science Podcast: Just Using Inadvertently Photographed Ridge Detail as Evidence
- Enhancing Fault Ride-Through Capacity of DFIG-Based WPs by Adaptive Backstepping Command Using Parametric Estimation in Non-Linear Forward Power Controller Design
- Development and Validation of a Method for Analysis of 25 Cannabinoids in Oral Fluid and Exhaled Breath Condensate