Queen's University - Utility Bar

QSpace at Queen's University >
Graduate Theses, Dissertations and Projects >
Queen's Graduate Theses and Dissertations >

Please use this identifier to cite or link to this item: http://hdl.handle.net/1974/5390

Title: Data Mining the Genetics of Leukemia
Authors: Morton, Geoffrey

Files in This Item:

File Description SizeFormat
Morton_Geoffrey_S_201001_MSc.pdf594.19 kBAdobe PDFView/Open
Keywords: Data Mining
Acute Lymphoblastic Leukemia
Issue Date: 2010
Series/Report no.: Canadian theses
Abstract: Acute Lymphoblastic Leukemia (ALL) is the most common cancer in children under the age of 15. At present, diagnosis, prognosis and treatment decisions are made based upon blood and bone marrow laboratory testing. With advances in microarray technology it is becoming more feasible to perform genetic assessment of individual patients as well. We used Singular Value Decomposition (SVD) on Illumina SNP, Affymetrix and cDNA gene-expression data and performed aggressive attribute se- lection using random forests to reduce the number of attributes to a manageable size. We then explored clustering and prediction of patient-specific properties such as disease sub-classification, and especially clinical outcome. We determined that integrating multiple types of data can provide more meaningful information than individual datasets, if combined properly. This method is able to capture the cor- relation between the attributes. The most striking result is an apparent connection between genetic background and patient mortality under existing treatment regimes. We find that we can cluster well using the mortality label of the patients. Also, using a Support Vector Machine (SVM) we can predict clinical outcome with high accu-racy. This thesis will discuss the data-mining methods used and their application to biomedical research, as well as our results and how this will affect the diagnosis and treatment of ALL in the future.
Description: Thesis (Master, Computing) -- Queen's University, 2010-01-12 18:40:44.2
URI: http://hdl.handle.net/1974/5390
Appears in Collections:Queen's Graduate Theses and Dissertations
School of Computing Graduate Theses

Items in QSpace are protected by copyright, with all rights reserved, unless otherwise indicated.


  DSpace Software Copyright © 2002-2008  The DSpace Foundation - TOP