166836 Model-based identification of outliers in gene expression

Tuesday, November 6, 2007: 9:10 AM

Xiaogang Zhong , Department of Applied Mathematics and Statistics, Johns Hopkins University, Baltimore, MD
Giovanni Parmigiani , Department of Biostatistics, Johns Hopkins University, Baltimore, MD
Detecting genes whose expression is altered in tumor cells is a major challenge in cancer research. Microarray technologies, which can measure the expression of thousands of genes simultaneously, has been widely used for this purpose. Because of the variety of molecular mechanisms that can lead to cancer, genes of interest are expected to be differentially expressed in a small subset of the samples, which, statistically, behave like “outliers”.

We developed an expression-based molecular classification method (POE) to discover novel biological classes and identify genes associated with them. The key idea of this method is that it models the gene expression using the latent categories that a gene is turned “on” or “off” compared to the baseline genes, and therefore estimates the probabilities of being differentially expressed. With the probabilities calculated, POE provides a numeric measure of how the genes are expressed, therefore helps researchers to determine whether a gene is outlier or not.

We compare our method with the previous approaches using simulation studies and a public dataset for transcriptional response of patients after radiation therapy. POE is proved to have more competitive power of finding the outliers and better control of the error rate.

Learning Objectives:
* Develop and demonstrate a new statistical method for gene outliers detection. * Evaluate several previously developed approaches and POE via synthetic data and a skin cancer data set. * Discuss remaining questions and possible future approaches.

Keywords: Genetics, Biostatistics

Presenting author's disclosure statement:

Any relevant financial relationships? No
Any institutionally-contracted trials related to this submission?

I agree to comply with the American Public Health Association Conflict of Interest and Commercial Support Guidelines, and to disclose to the participants any off-label or experimental uses of a commercial product or service discussed in my presentation.