In:
Bioinformatics, Oxford University Press (OUP), Vol. 33, No. 12 ( 2017-06-15), p. 1892-1894
Abstract:
Non-negative Matrix Factorization (NMF) algorithms associate gene expression with biological processes (e.g. time-course dynamics or disease subtypes). Compared with univariate associations, the relative weights of NMF solutions can obscure biomarkers. Therefore, we developed a novel patternMarkers statistic to extract genes for biological validation and enhanced visualization of NMF results. Finding novel and unbiased gene markers with patternMarkers requires whole-genome data. Therefore, we also developed Genome-Wide CoGAPS Analysis in Parallel Sets (GWCoGAPS), the first robust whole genome Bayesian NMF using the sparse, MCMC algorithm, CoGAPS. Additionally, a manual version of the GWCoGAPS algorithm contains analytic and visualization tools including patternMatcher, a Shiny web application. The decomposition in the manual pipeline can be replaced with any NMF algorithm, for further generalization of the software. Using these tools, we find granular brain-region and cell-type specific signatures with corresponding biomarkers in GTEx data, illustrating GWCoGAPS and patternMarkers ascertainment of data-driven biomarkers from whole-genome data. Availability and Implementation PatternMarkers & GWCoGAPS are in the CoGAPS Bioconductor package (3.5) under the GPL license. Supplementary information Supplementary data are available at Bioinformatics online.
Type of Medium:
Online Resource
ISSN:
1367-4803
,
1367-4811
DOI:
10.1093/bioinformatics/btx058
Language:
English
Publisher:
Oxford University Press (OUP)
Publication Date:
2017
detail.hit.zdb_id:
1468345-3
SSG:
12
Permalink