Anomaly Detection via Latent Variables Learned by Variational Autoencoders
MetadataShow full item record
Modeling normality is a popular approach to anomaly detection. Learning good models of normality and choosing appropriate ways of identifying differences from normality are critical tasks for effective anomaly detection. Variational autoencoders are state of the art modeling techniques that incorporate latent variables, hidden variables that are not directly observed but instead inferred from observed variables. Approaches to anomaly detection via variational autoencoders either adopt reconstruction error as a sole anomaly detection metric, ignoring the potential of latent variables in the anomaly detection task, or consider the latent space as whole. However, reconstruction error on its own creates an incomplete picture of the latent space representation, while individual latent dimensions often fail to represent useful features for anomaly detection, dragging down the overall performance. I focus on making subtle choices using latent variables discovered via variational autoencoders to help find anomalies, from two different perspectives. The first leverages the stochastic nature of the latent variables, as each point in the latent space is sampled from probability distributions that are parameterized during the learning process. I develop the concept of expected latent representation, which I use for anomaly detection by evaluating differences between the expected latent representation and the prior. Additionally, I extend the expected latent representation to reconstruction error, by adopting the reconstruction error of the expected latent representation as an anomaly detection measure. Results from evaluations on benchmark datasets reveal similar performance or incremental improvements, in comparison to unmodified variational autoencoders, and competitive results to comparable anomaly detection techniques. The second perspective focuses on finding latent dimensions that contain valuable features for anomaly detection, where I explore subsets of dimensions. Evaluations on benchmark datasets suggest that subsets and individual dimensions, can significantly outperform the entire latent space. I propose a heuristic method for carefully selecting subsets of latent dimensions in a supervised manner. Although dataset dependent, the relative performance is improved by as much as twofold. In the end, these two perspectives demonstrate that subtle choices in crafting models of normality and measuring differences from normality, can significantly improve the anomaly detection task in a variety of datasets.
URI for this recordhttp://hdl.handle.net/1974/28854
Request an alternative formatIf you require this document in an alternate, accessible format, please contact the Queen's Adaptive Technology Centre
Showing items related by title, author, creator and subject.
The development of sulfamates as latent directed metalation groups. Total synthesis of schumanniophytine. Divergent synthesis of substituted chromone 3- and 8-carboxamides. Macklin, Todd Kristopher (2007-12-04)N,N-Diethyl-O-sulfamate (OSO2NEt2) has been established as a new directed metalation group (DMG). Its similarity to the established O-carbamate DMG has prompted investigation into its potential as a partner in transition ...
Ross-White, Amanda; Tanon, A.; Ranji, S. (Gower Publishing, Surrey, UK, 2013)
An Evaluation of Ontario Provincial Land Use and Resource Management Policies and Their Intersection with First Nations with Respect to Manifest and Latent Content - Summary Table McLeod, Fraser; Viswanathan, Leela; Whitelaw, Graham; Macbeth, Jared; McCarthy, Daniel; Alexiuk, Erin (2016-01-25)This summary table entitled An Evaluation of Ontario Provincial Land Use and Resource Management Policies and Their Intersection with First Nations with Respect to Manifest and Latent Content represents a condensed version ...