This study demonstrated that direct analysis in real time-high-resolution mass spectrometry (DART-HRMS) combined with supervised Kohonen Self-Organizing Maps (SOM) enables accomplishment of species-level identification of larva, pupa, and adult life stages of carrion flies.
Species determination of the various life stages of flies (Order: Diptera) is challenging, particularly for the immature forms, because analogous life stages of different species are difficult to differentiate based on morphological features alone. In the current study, DART-HRMS data for each life stage were acquired from analysis of ethanol suspensions representing Calliphoridae, Phoridae, and Sarcophagidae families, without additional sample preparation. After preprocessing, the data were subjected to a combination of minimum Redundancy Maximal Relevance (mRMR) and Sparse Discriminant Analysis (SDA) methods to select the most significant variables for creating accurate SOM models. The resulting data were divided into training and validation sets and then analyzed by the SOM method to define the proper discrimination models. The 5-fold venetian blind cross-validation misclassification error was below 7 percent for all life stages, and the validation samples were correctly identified in all cases. The multiclass SOM model also revealed which chemical components were the most significant markers for each species, with several of these being amino acids. The results show that processing of DART-HRMS data using artificial neural networks (ANNs) based on the Kohonen SOM approach enables rapid discrimination and identification of fly species even for the immature life stages. The ANNs can be continuously expanded to include a larger number of species and can be used to screen DART-HRMS data from unknowns to rapidly determine species identity. (publisher abstract modified)