Using Cluster Analysis, Cluster Validation, and Consensus Clustering to Identify Subtypes
Shen, Jess Jiangsheng
MetadataShow full item record
Pervasive Developmental Disorders (PDDs) are neurodevelopmental disorders characterized by impairments in social interaction, communication and behaviour [Str04]. Given the diversity and varying severity of PDDs, diagnostic tools attempt to identify homogeneous subtypes within PDDs. The diagnostic system Diagnostic and Statistical Manual of Mental Disorders - Fourth Edition (DSM-IV) divides PDDs into five subtypes. Several limitations have been identified with the categorical diagnostic criteria of the DSM-IV. The goal of this study is to identify putative subtypes in the multidimensional data collected from a group of patients with PDDs, by using cluster analysis. Cluster analysis is an unsupervised machine learning method. It offers a way to partition a dataset into subsets that share common patterns. We apply cluster analysis to data collected from 358 children with PDDs, and validate the resulting clusters. Notably, there are many cluster analysis algorithms to choose from, each making certain assumptions about the data and about how clusters should be formed. A way to arrive at a meaningful solution is to use consensus clustering to integrate results from several clustering attempts that form a cluster ensemble into a unified consensus answer, and can provide robust and accurate results [TJPA05]. In this study, using cluster analysis, cluster validation, and consensus clustering, we identify four clusters that are similar to – and further refine three of the five subtypes defined in the DSM-IV. This study thus confirms the existence of these three subtypes among patients with PDDs.
URI for this recordhttp://hdl.handle.net/1974/920
Request an alternative formatIf you require this document in an alternate, accessible format, please contact the Queen's Adaptive Technology Centre
Showing items related by title, author, creator and subject.
Misiak, Andrew (2015-09-29)The transition from industrial to post-industrial economies has left many port and industrial areas, like Toronto's Port Lands, derelict and underutilized. Creative clusters have been turned to as an effective policy ...
Design and Evaluation of Efficient Collective Communications on Modern Interconnects and Multi-core Clusters Qian, Ying (2010-01-11)Two driving forces behind high-performance clusters are the availability of modern interconnects and the advent of multi-core systems. As multi-core clusters become commonplace, where each core will run at least one process ...
Rashti, Mohammad Javad (2011-01-26)High Performance Computing (HPC) is the key to solving many scientific, financial, and engineering problems. Computer clusters are now the dominant architecture for HPC. The scale of clusters, both in terms of processor ...