Chapter title |
Integrative Exploratory Analysis of Two or More Genomic Datasets
|
---|---|
Chapter number | 2 |
Book title |
Statistical Genomics
|
Published in |
Methods in molecular biology, January 2016
|
DOI | 10.1007/978-1-4939-3578-9_2 |
Pubmed ID | |
Book ISBNs |
978-1-4939-3576-5, 978-1-4939-3578-9
|
Authors |
Chen Meng MS, Aedin Culhane PhD, Chen Meng, Aedin Culhane, Meng, Chen, Culhane, Aedin |
Editors |
Ewy Mathé, Sean Davis |
Abstract |
Exploratory analysis is an essential step in the analysis of high throughput data. Multivariate approaches such as correspondence analysis (CA), principal component analysis, and multidimensional scaling are widely used in the exploratory analysis of single dataset. Modern biological studies often assay multiple types of biological molecules (e.g., mRNA, protein, phosphoproteins) on a same set of biological samples, thereby creating multiple different types of omics data or multiassay data. Integrative exploratory analysis of these multiple omics data is required to leverage the potential of multiple omics studies. In this chapter, we describe the application of co-inertia analysis (CIA; for analyzing two datasets) and multiple co-inertia analysis (MCIA; for three or more datasets) to address this problem. These methods are powerful yet simple multivariate approaches that represent samples using a lower number of variables, allowing a more easily identification of the correlated structure in and between multiple high dimensional datasets. Graphical representations can be employed to this purpose. In addition, the methods simultaneously project samples and variables (genes, proteins) onto the same lower dimensional space, so the most variant variables from each dataset can be selected and associated with samples, which can be further used to facilitate biological interpretation and pathway analysis. We applied CIA to explore the concordance between mRNA and protein expression in a panel of 60 tumor cell lines from the National Cancer Institute. In the same 60 cell lines, we used MCIA to perform a cross-platform comparison of mRNA gene expression profiles obtained on four different microarray platforms. Last, as an example of integrative analysis of multiassay or multi-omics data we analyzed transcriptomic, proteomic, and phosphoproteomic data from pluripotent (iPS) and embryonic stem (ES) cell lines. |
X Demographics
Geographical breakdown
Country | Count | As % |
---|---|---|
United States | 1 | 50% |
Ireland | 1 | 50% |
Demographic breakdown
Type | Count | As % |
---|---|---|
Scientists | 2 | 100% |
Mendeley readers
Geographical breakdown
Country | Count | As % |
---|---|---|
Unknown | 12 | 100% |
Demographic breakdown
Readers by professional status | Count | As % |
---|---|---|
Student > Ph. D. Student | 5 | 42% |
Student > Bachelor | 2 | 17% |
Unknown | 5 | 42% |
Readers by discipline | Count | As % |
---|---|---|
Biochemistry, Genetics and Molecular Biology | 3 | 25% |
Agricultural and Biological Sciences | 1 | 8% |
Medicine and Dentistry | 1 | 8% |
Neuroscience | 1 | 8% |
Engineering | 1 | 8% |
Other | 0 | 0% |
Unknown | 5 | 42% |