GLORIA — GEOMAR Library Ocean Research Information Access

1

Electronic Resource

Polarographic Characteristics of Carbohydrates. The Aldose Oximes and Semicarbazones (1962)

Haas, J. W. ; Storey, J. D. ; Lynch, C. C.

s.l. : American Chemical Society

Analytical chemistry 34 (1962), S. 145-147

add to mindlist on the mindlist

Details

ISSN: 1520-6882

Source: ACS Legacy Archives

Topics: Chemistry and Pharmacology

Type of Medium: Electronic Resource

URL: http://dx.doi.org/10.1021/ac60181a044

Permalink

	Location	Call Number	Limitation	Availability

Others were also interested in ...

Paper (German National Licenses)

Fulltext

2

Unknown

Design and Analysis of Bar-seq Experiments (2014)

Robinson, D. G., Chen, W., Storey, J. D., Gresham, D.

Genetics Society of America (GSA)

In: G3: Genes, Genomes, Genetics

add to mindlist on the mindlist

Details

Publication Date: 2014-01-11

Description: High-throughput quantitative DNA sequencing enables the parallel phenotyping of pools of thousands of mutants. However, the appropriate analytical methods and experimental design that maximize the efficiency of these methods while maintaining statistical power are currently unknown. Here, we have used Bar-seq analysis of the Saccharomyces cerevisiae yeast deletion library to systematically test the effect of experimental design parameters and sequence read depth on experimental results. We present computational methods that efficiently and accurately estimate effect sizes and their statistical significance by adapting existing methods for RNA-seq analysis. Using simulated variation of experimental designs, we found that biological replicates are critical for statistical analysis of Bar-seq data, whereas technical replicates are of less value. By subsampling sequence reads, we found that when using four-fold biological replication, 6 million reads per condition achieved 96% power to detect a two-fold change (or more) at a 5% false discovery rate. Our guidelines for experimental design and computational analysis enables the study of the yeast deletion collection in up to 30 different conditions in a single sequencing lane. These findings are relevant to a variety of pooled genetic screening methods that use high-throughput quantitative DNA sequencing, including Tn-seq.

Electronic ISSN: 2160-1836

Topics: Biology

Published by Genetics Society of America (GSA)

Permalink

	Location	Call Number	Limitation	Availability

Others were also interested in ...

PAPER CURRENT

Fulltext

3

Unknown

Probabilistic models of genetic variation in structured populations applied to global human studies (2016)

Hao, W., Song, M., Storey, J. D.

Oxford University Press

In: Bioinformatics

add to mindlist on the mindlist

Details

Publication Date: 2016-02-27

Description: Motivation: Modern population genetics studies typically involve genome-wide genotyping of individuals from a diverse network of ancestries. An important problem is how to formulate and estimate probabilistic models of observed genotypes that account for complex population structure. The most prominent work on this problem has focused on estimating a model of admixture proportions of ancestral populations for each individual. Here, we instead focus on modeling variation of the genotypes without requiring a higher-level admixture interpretation. Results: We formulate two general probabilistic models, and we propose computationally efficient algorithms to estimate them. First, we show how principal component analysis can be utilized to estimate a general model that includes the well-known Pritchard–Stephens–Donnelly admixture model as a special case. Noting some drawbacks of this approach, we introduce a new ‘logistic factor analysis’ framework that seeks to directly model the logit transformation of probabilities underlying observed genotypes in terms of latent variables that capture population structure. We demonstrate these advances on data from the Human Genome Diversity Panel and 1000 Genomes Project, where we are able to identify SNPs that are highly differentiated with respect to structure while making minimal modeling assumptions. Availability and Implementation: A Bioconductor R package called lfa is available at http://www.bioconductor.org/packages/release/bioc/html/lfa.html . Contact: jstorey@princeton.edu Supplementary information: Supplementary data are available at Bioinformatics online.

Print ISSN: 1367-4803

Electronic ISSN: 1460-2059

Topics: Biology , Computer Science , Medicine

Published by Oxford University Press

Permalink

	Location	Call Number	Limitation	Availability

Others were also interested in ...

PAPER CURRENT

Fulltext

4

Unknown

Cell-type chromatin programming of gene expression [Statistics] (2014)

Marstrand, T. T., Storey, J. D.

National Academy of Sciences

In: PNAS - Proceedings of the National Academy of Sciences

add to mindlist on the mindlist

Details

Publication Date: 2014-02-12

Description: A problem of substantial interest is to systematically map variation in chromatin structure to gene-expression regulation across conditions, environments, or differentiated cell types. We developed and applied a quantitative framework for determining the existence, strength, and type of relationship between high-resolution chromatin structure in terms of DNaseI hypersensitivity and genome-wide...

Print ISSN: 0027-8424

Electronic ISSN: 1091-6490

Topics: Biology , Medicine , Natural Sciences in General

Published by National Academy of Sciences

Permalink

	Location	Call Number	Limitation	Availability

Others were also interested in ...

PAPER CURRENT

Fulltext

5

Unknown

Human transcriptome array for high-throughput clinical studies [Medical Sciences] (2011)

Xu, W., Seok, J., Mindrinos, M. N., Schweitzer, A. C., Jiang, H., Wilhelmy, J., Clark, T. A., Kapur, K., Xing, Y., Faham, M., Storey, J. D., Moldawer, L. L., Maier, R. V., Tompkins, R. G., Wong, W. H., Davis, R. W., Xiao, W., the Inflammation and Host Response to Injury Large-Scale Collaborative Research Program

National Academy of Sciences

In: PNAS - Proceedings of the National Academy of Sciences

add to mindlist on the mindlist

Details

Publication Date: 2011-03-02

Description: A 6.9 million-feature oligonucleotide array of the human transcriptome [Glue Grant human transcriptome (GG-H array)] has been developed for high-throughput and cost-effective analyses in clinical studies. This array allows comprehensive examination of gene expression and genome-wide identification of alternative splicing as well as detection of coding SNPs and noncoding transcripts. The performance of the array was examined and compared with mRNA sequencing (RNA-Seq) results over multiple independent replicates of liver and muscle samples. Compared with RNA-Seq of 46 million uniquely mappable reads per replicate, the GG-H array is highly reproducible in estimating gene and exon abundance. Although both platforms detect similar expression changes at the gene level, the GG-H array is more sensitive at the exon level. Deeper sequencing is required to adequately cover low-abundance transcripts. The array has been implemented in a multicenter clinical program and has generated high-quality, reproducible data. Considering the clinical trial requirements of cost, sample availability, and throughput, the GG-H array has a wide range of applications. An emerging approach for large-scale clinical genomic studies is to first use RNA-Seq to the sufficient depth for the discovery of transcriptome elements relevant to the disease process followed by high-throughput and reliable screening of these elements on thousands of patient samples using custom-designed arrays.

Print ISSN: 0027-8424

Electronic ISSN: 1091-6490

Topics: Biology , Medicine , Natural Sciences in General

Published by National Academy of Sciences

Permalink

	Location	Call Number	Limitation	Availability

Others were also interested in ...

PAPER CURRENT

Fulltext

6

Unknown

The sva package for removing batch effects and other unwanted variation in high-throughput experiments (2012)

Leek, J. T., Johnson, W. E., Parker, H. S., Jaffe, A. E., Storey, J. D.

Oxford University Press

In: Bioinformatics

add to mindlist on the mindlist

Details

Publication Date: 2012-03-20

Description: : Heterogeneity and latent variables are now widely recognized as major sources of bias and variability in high-throughput experiments. The most well-known source of latent variation in genomic experiments are batch effects—when samples are processed on different days, in different groups or by different people. However, there are also a large number of other variables that may have a major impact on high-throughput measurements. Here we describe the sva package for identifying, estimating and removing unwanted sources of variation in high-throughput experiments. The sva package supports surrogate variable estimation with the sva function, direct adjustment for known batch effects with the ComBat function and adjustment for batch and latent variables in prediction problems with the fsva function. Availability: The R package sva is freely available from http://www.bioconductor.org . Contact: jleek@jhsph.edu Supplementary information: Supplementary data are available at Bioinformatics online.

Print ISSN: 1367-4803

Electronic ISSN: 1460-2059

Topics: Biology , Computer Science , Medicine

Published by Oxford University Press

Permalink

	Location	Call Number	Limitation	Availability

Others were also interested in ...

PAPER CURRENT

Fulltext

7

Unknown

subSeq: Determining Appropriate Sequencing Depth Through Efficient Read Subsampling (2014)

Robinson, D. G., Storey, J. D.

Oxford University Press

In: Bioinformatics

add to mindlist on the mindlist

Details

Publication Date: 2014-11-26

Description: Motivation: Next-generation sequencing experiments, such as RNA-Seq, play an increasingly important role in biological research. One complication is that the power and accuracy of such experiments depend substantially on the number of reads sequenced, so it is important and challenging to determine the optimal read depth for an experiment or to verify whether one has adequate depth in an existing experiment. Results: By randomly sampling lower depths from a sequencing experiment and determining where the saturation of power and accuracy occurs, one can determine what the most useful depth should be for future experiments, and furthermore, confirm whether an existing experiment had sufficient depth to justify its conclusions. We introduce the subSeq R package, which uses a novel efficient approach to perform this subsampling and to calculate informative metrics at each depth. Availability and Implementation: The subSeq R package is available at http://github.com/StoreyLab/subSeq/ . Contact: dgrtwo@princeton.edu or jstorey@princeton.edu Supplementary information: Supplementary data are available at Bioinformatics online.

Print ISSN: 1367-4803

Electronic ISSN: 1460-2059

Topics: Biology , Computer Science , Medicine

Published by Oxford University Press

Permalink

	Location	Call Number	Limitation	Availability

Others were also interested in ...

PAPER CURRENT

Fulltext

8

Unknown

A nested parallel experiment demonstrates differences in intensity-dependence between RNA-seq and microarrays (2015)

Robinson, D. G., Wang, J. Y., Storey, J. D.

Oxford University Press

In: Nucleic Acids Research

add to mindlist on the mindlist

Details

Publication Date: 2015-11-17

Description: Understanding the differences between microarray and RNA-Seq technologies for measuring gene expression is necessary for informed design of experiments and choice of data analysis methods. Previous comparisons have come to sometimes contradictory conclusions, which we suggest result from a lack of attention to the intensity-dependent nature of variation generated by the technologies. To examine this trend, we carried out a parallel nested experiment performed simultaneously on the two technologies that systematically split variation into four stages (treatment, biological variation, library preparation and chip/lane noise), allowing a separation and comparison of the sources of variation in a well-controlled cellular system, Saccharomyces cerevisiae . With this novel dataset, we demonstrate that power and accuracy are more dependent on per-gene read depth in RNA-Seq than they are on fluorescence intensity in microarrays. However, we carried out quantitative PCR validations which indicate that microarrays may demonstrate greater systematic bias in low-intensity genes than in RNA-seq.

Keywords: Microarray Technology, Massively Parallel (Deep) Sequencing

Print ISSN: 0305-1048

Electronic ISSN: 1362-4962

Topics: Biology

Published by Oxford University Press

Permalink

	Location	Call Number	Limitation	Availability

Others were also interested in ...

PAPER CURRENT

Fulltext

9

Unknown

Statistical significance of variables driving systematic variation in high-dimensional data (2015)

Chung, N. C., Storey, J. D.

Oxford University Press

In: Bioinformatics

add to mindlist on the mindlist

Details

Publication Date: 2015-02-13

Description: Motivation : There are a number of well-established methods such as principal component analysis (PCA) for automatically capturing systematic variation due to latent variables in large-scale genomic data. PCA and related methods may directly provide a quantitative characterization of a complex biological variable that is otherwise difficult to precisely define or model. An unsolved problem in this context is how to systematically identify the genomic variables that are drivers of systematic variation captured by PCA. Principal components (PCs) (and other estimates of systematic variation) are directly constructed from the genomic variables themselves, making measures of statistical significance artificially inflated when using conventional methods due to over-fitting. Results : We introduce a new approach called the jackstraw that allows one to accurately identify genomic variables that are statistically significantly associated with any subset or linear combination of PCs. The proposed method can greatly simplify complex significance testing problems encountered in genomics and can be used to identify the genomic variables significantly associated with latent variables. Using simulation, we demonstrate that our method attains accurate measures of statistical significance over a range of relevant scenarios. We consider yeast cell-cycle gene expression data, and show that the proposed method can be used to straightforwardly identify genes that are cell-cycle regulated with an accurate measure of statistical significance. We also analyze gene expression data from post-trauma patients, allowing the gene expression data to provide a molecularly driven phenotype. Using our method, we find a greater enrichment for inflammatory-related gene sets compared to the original analysis that uses a clinically defined, although likely imprecise, phenotype. The proposed method provides a useful bridge between large-scale quantifications of systematic variation and gene-level significance analyses. Availability and implementation : An R software package, called jackstraw , is available in CRAN. Contact : jstorey@princeton.edu

Print ISSN: 1367-4803

Electronic ISSN: 1460-2059

Topics: Biology , Computer Science , Medicine

Published by Oxford University Press

Permalink

	Location	Call Number	Limitation	Availability

Others were also interested in ...

PAPER CURRENT

Fulltext