RESEARCH ARTICLE


Pattern-Based Gene-Set Recognition for Interpreting Genome-Wide Gene Expression Profiles



Xutao Deng*, 1, 2, Charles Wang1, 2
1 Transcriptional Genomics Core, Burns Allen Research Institute, Cedars-Sinai Medical Center, Los Angeles, CA, USA
2 Department of Medicine, UCLA David Geffen School of Medicine, Los Angeles, CA, USA


Article Metrics

CrossRef Citations:
0
Total Statistics:

Full-Text HTML Views: 526
Abstract HTML Views: 1150
PDF Downloads: 652
Total Views/Downloads: 2328
Unique Statistics:

Full-Text HTML Views: 341
Abstract HTML Views: 768
PDF Downloads: 466
Total Views/Downloads: 1575



Creative Commons License
© 2008 Deng and Wang

open-access license: This is an open access article distributed under the terms of the Creative Commons Attribution 4.0 International Public License (CC-BY 4.0), a copy of which is available at: https://creativecommons.org/licenses/by/4.0/legalcode. This license permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

* Address correspondence to this author at the 8700 Beverly Blvd., Davis Bldg., G150-151, Los Angeles, CA90048, USA; Tel: 1-310-423-7361; Fax: 1-310-423-2303; E-mail: charles.wang@cshs.org, dengx@ucla.edu


Abstract

Background:

Accurate recognition of important gene sets from genome-wide gene expression profiles provides great insights into the underlying biological mechanisms that govern the gene expression dynamics. However, most gene set recognition algorithms rely solely on supervised sample phenotypic information, overlooking the unsupervised gene-gene expression correlations that are inherently informative in the gene expression profiles.

Results:

We developed a computational framework named PAGER (Pattern Acquisition and GEne-set Recognition) for identifying gene sets showing significant supervised and unsupervised patterns. We showcased the use of PAGER in several recent expression profiling studies including cadmium treated rat primary hepatocyte toxicogenomics study and adrenal gland periodical gene expression profiling. Our results indicate that PAGER achieved better performance in discovering truly important pathways from expression profiles which were undetected using current other existing tools. These results were further corroborated by literature and cytotoxicity experiments.

Conclusions:

PAGER integrated both supervised and unsupervised pattern metrics for gene set summarization. For each given gene set, PAGER provides a two-dimensional view showing its external activity and internal coherence pattern. PAGER employed statistical methods such as Relaxed Intersection-Union Tests, Stouffer’s method and Fisher’s method for integration of pattern significance. In addition, PAGER can be used for recognizing user-defined arbitrary gene set as demonstrated in one of our previous publications. PAGER is freely available for academic user at http://dengx.bol.ucla.edu/PAGER/PAGER.htm.