The authors used their previously published next generation sequencing (NGS) mRNA approach for body fluid identification to analyze 183 body fluids/tissues, including mock casework samples. The resulting data set was used to build a probabilistic model that predicts the origin of a stain. This approach uses partial least squares followed by linear discriminant analysis to classify samples into six commonly occurring forensic body fluids. The model differs from the ones previously suggested in that it incorporates quantitative information (NGS read counts) rather than just presence/absence of markers. The suggested approach also enables visualization of important markers and their correlation with the different body fluids. This model was compared to previously published methods to show that the inclusion of read count information improves the prediction. Finally, the model was applied to mixed body fluid samples to test its ability to identify the individual components in a mixture. (publisher abstract modified)
Downloads
Similar Publications
- Sex Estimation Using Metrics of the Innominate: A Test of the DSP2 Method
- ILIAD: A Suite of Automated Snakemake Workflows for Processing Genomic Data for Downstream Applications
- Large-scale Selection of Highly Informative Microhaplotypes for Ancestry Inference and Population Specific Informativeness