Drexel University Home Pagewww.drexel.edu DREXEL UNIVERSITY LIBRARIES HOMEPAGE >>
iDEA DREXEL ARCHIVES >>

iDEA: Drexel E-repository and Archives > Drexel Theses and Dissertations > Drexel Theses and Dissertations > Identifying differentially expressed genes in DNA microarray data

Please use this identifier to cite or link to this item: http://hdl.handle.net/1860/375

Title: Identifying differentially expressed genes in DNA microarray data
Authors: Loo, Lit-Hsin
Keywords: DNA microarrays;Bioinformatics;Statistics;Computer engineering
Issue Date: 28-Dec-2004
Abstract: We have developed two new nonparametric statistical tests for identifying differentially expressed genes in DNA microarray data. These are the average difference score (ADS) and the mean di erence score (MDS). The ADS generalizes the independently consistent expression (ICE) discriminator proposed by Bijlani and his co-workers. The MDS extends the Welch t-test and the Fisher correlation score. The new tests replace the serial noise estimator used in existing tests by a parallel noise estimator. The result is better detection of changes in the variance of expression levels, which t-test type criteria tend to under-emphasize. We compare the performance of the new tests to that of several commonly used non-parametric tests, including the non-parametric Welch t-test, the Fisher correlation score, the Wilcoxon rank sum test, and ICE. We use the commonly used feature selection performance criteria, namely the feature selection accuracy and classification accuracy. We also developed a new criterion, the ensemble diversity. Using these criteria, we have demonstrated that ADS and MDS outperform the other tests by exhibiting higher sensitivity and comparable speci city, thus being more useful in identifying di erentially expressed genes. To demonstrate this claim, we use synthetic data generated from normal and mixed normal models, and real biological data obtained from acute lymphoblastic leukemia and acute myeloid leukemia patients. ADS is able to flag several biologically important genes that are missed by the non-parametric Welch t-test.
URI: http://hdl.handle.net/1860/375
Appears in Collections:Drexel Theses and Dissertations

Files in This Item:

File Description SizeFormat
Loo_LitHsin.pdf1.68 MBAdobe PDFView/Open
View Statistics

Items in iDEA are protected by copyright, with all rights reserved, unless otherwise indicated.

 

Valid XHTML 1.0! iDEA Software Copyright © 2002-2010  Duraspace - Feedback