U.S. flag

An official website of the United States government, Department of Justice.

NCJRS Virtual Library

The Virtual Library houses over 235,000 criminal justice resources, including all known OJP works.
Click here to search the NCJRS Virtual Library

ATM: A distributed, collaborative, scalable system for automated machine learning

NCJ Number
308341
Journal
IEEE Transactions on Big Data Volume: 2017 IEEE International Dated: 2017 Pages: 151-162
Author(s)
Thomas Swearingen; Will Drevo; Bennett Cyphers; Alfredo Cuesta-Infante; Arun Ross; Kalyan Veeramachaneni
Date Published
2017
Annotation

The authors present Auto-Tuned Models for automated machine learning; they describe the purpose of their research and demonstrate the effectiveness of their system compared to human-generated solutions.

Abstract

In this paper, the authors present Auto-Tuned Models, or ATM, a distributed, collaborative, scalable system for automated machine learning. Users of ATM can simply upload a dataset, choose a subset of modeling methods, and choose to use ATM's hybrid Bayesian and multi-armed bandit optimization system. The distributed system works in a load-balanced fashion to quickly deliver results in the form of ready-to-predict models, confusion matrices, cross-validation results, and training timings. By automating hyperparameter tuning and model selection, ATM returns the emphasis of the machine learning workflow to its most irreducible part: feature engineering. The authors demonstrate the usefulness of ATM on 420 datasets from OpenML and train over three million classifiers. Their initial results show ATM can beat human-generated solutions for 30 percent of the datasets, and can do so in 1/100th of the time. (Published Abstract Provided)