U.S. flag

An official website of the United States government, Department of Justice.

NCJRS Virtual Library

The Virtual Library houses over 235,000 criminal justice resources, including all known OJP works.
Click here to search the NCJRS Virtual Library

Hidden Markov Models for DNA Sequencing

NCJ Number
309116
Author(s)
Petros Boufounos; Sameh El-Difrawy; Dan Ehrlich
Date Published
2002
Length
4 pages
Annotation

This research presents using Hidden Markov Models to resolve DNA basecalling problems. 

Abstract

This paper proposes Hidden Markov Models (HMMs) as an approach to the DNA basecalling problem. The authors model the state emission densities using Artificial Neural Networks, and provide a modified Baum-Welch re-estimation procedure to perform training. Moreover, the authors develop a method that exploits consensus sequences to label training data, thus minimizing the need for hand-labeling. The results demonstrate the potential of these models and suggest further research. The authors also perform a careful study of the basecalling errors and propose alternative HMM topologies that might further improve performance. The authors conclude by suggesting further research directions.