Joint and Individual Variation Explained (JIVE) for integrated analysis of multiple data types. Research in several fields now requires the analysis of data sets in which multiple high-dimensional types of data are available for a common set of objects. In particular, The Cancer Genome Atlas (TCGA) includes data from several diverse genomic technologies on the same cancerous tumor samples. In this paper we introduce Joint and Individual Variation Explained (JIVE), a general decomposition of variation for the integrated analysis of such data sets. The decomposition consists of three terms: a low-rank approximation capturing joint variation across data types, low-rank approximations for structured variation individual to each data type, and residual noise. JIVE quantifies the amount of joint variation between data types, reduces the dimensionality of the data and provides new directions for the visual exploration of joint and individual structures. The proposed method represents an extension of Principal Component Analysis and has clear advantages over popular two-block methods such as Canonical Correlation Analysis and Partial Least Squares. A JIVE analysis of gene expression and miRNA data on Glioblastoma Multiforme tumor samples reveals gene-miRNA associations and provides better characterization of tumor types. par Data and software are available at https://genome.unc.edu/jive/.

References in zbMATH (referenced in 17 articles )

Showing results 1 to 17 of 17.
Sorted by year (citations)

  1. Moran, Kelly R.; Dunson, David; Wheeler, Matthew W.; Herring, Amy H.: Bayesian joint modeling of chemical structure and dose response curves (2021)
  2. Oseledets, I. V.; Kharyuk, P. V.: Structuring data with block term decomposition: decomposition of joint tensors and variational block term decomposition as a parametrized mixture distribution model (2021)
  3. Perry, Ronan; Mischler, Gavin; Guo, Richard; Lee, Theodore; Chang, Alexander; Koul, Arman; Franz, Cameron; Richard, Hugo; Carmichael, Iain; Ablin, Pierre; Gramfort, Alexandre; Vogelstein, Joshua T.: mvlearn: multiview machine learning in Python (2021)
  4. Risk, Benjamin B.; Gaynanova, Irina: Simultaneous non-Gaussian component analysis (SING) for data integration in neuroimaging (2021)
  5. Roy, Arkaprava; Lavine, Isaac; Herring, Amy H.; Dunson, David B.: Perturbed factor analysis: accounting for group differences in exposure profiles (2021)
  6. Godichon-Baggioni, Antoine; Maugis-Rabusseau, Cathy; Rau, Andrea: Multiview cluster aggregation and splitting, with an application to multiomic breast cancer data (2020)
  7. Shu, Hai; Wang, Xiao; Zhu, Hongtu: D-CCA: a decomposition-based canonical correlation analysis for high-dimensional datasets (2020)
  8. Huang, Lei; Bai, Jiawei; Ivanescu, Andrada; Harris, Tamara; Maurer, Mathew; Green, Philip; Zipunnikov, Vadim: Multilevel matrix-variate analysis and its application to accelerometry-measured physical activity in clinical populations (2019)
  9. Marron, J. S.: Comments on “Data science, big data and statistics” (2019)
  10. Marron, J. S.: Discussion: “A spatial modeling approach for linguistic object data: analysing dialect sound variations across Great Britain” (2019)
  11. Wang, Miaoyan; Fischer, Jonathan; Song, Yun S.: Three-way clustering of multi-tissue multi-individual gene expression data using semi-nonnegative tensor decomposition (2019)
  12. Aoshima, Makoto; Shen, Dan; Shen, Haipeng; Yata, Kazuyoshi; Zhou, Yi-Hui; Marron, J. S.: A survey of high dimension low sample size asymptotics (2018)
  13. Feng, Qing; Jiang, Meilei; Hannig, Jan; Marron, J. S.: Angle-based joint and individual variation explained (2018)
  14. Jung, Sungkyu: Continuum directions for supervised dimension reduction (2018)
  15. Li, Gen; Gaynanova, Irina: A general framework for association analysis of heterogeneous data (2018)
  16. Li, Gen; Yang, Dan; Nobel, Andrew B.; Shen, Haipeng: Supervised singular value decomposition and its asymptotic properties (2016)
  17. Lock, Eric F.; Hoadley, Katherine A.; Marron, J. S.; Nobel, Andrew B.: Joint and individual variation explained (JIVE) for integrated analysis of multiple data types (2013)