Multiple Testing Procedures with Applications to Genomics PDF Download

Are you looking for read ebook online? Search for your book and save it on your Kindle device, PC, phones or tablets. Download Multiple Testing Procedures with Applications to Genomics PDF full book. Access full book title Multiple Testing Procedures with Applications to Genomics by Sandrine Dudoit. Download full books in PDF and EPUB format.

Multiple Testing Procedures with Applications to Genomics

Multiple Testing Procedures with Applications to Genomics PDF Author: Sandrine Dudoit
Publisher: Springer
ISBN: 9780387517094
Category : Science
Languages : en
Pages : 0

Book Description
This book establishes the theoretical foundations of a general methodology for multiple hypothesis testing and discusses its software implementation in R and SAS. These are applied to a range of problems in biomedical and genomic research, including identification of differentially expressed and co-expressed genes in high-throughput gene expression experiments; tests of association between gene expression measures and biological annotation metadata; sequence analysis; and genetic mapping of complex traits using single nucleotide polymorphisms. The procedures are based on a test statistics joint null distribution and provide Type I error control in testing problems involving general data generating distributions, null hypotheses, and test statistics.

Multiple Testing Procedures with Applications to Genomics

Multiple Testing Procedures with Applications to Genomics PDF Author: Sandrine Dudoit
Publisher: Springer
ISBN: 9780387517094
Category : Science
Languages : en
Pages : 0

Book Description
This book establishes the theoretical foundations of a general methodology for multiple hypothesis testing and discusses its software implementation in R and SAS. These are applied to a range of problems in biomedical and genomic research, including identification of differentially expressed and co-expressed genes in high-throughput gene expression experiments; tests of association between gene expression measures and biological annotation metadata; sequence analysis; and genetic mapping of complex traits using single nucleotide polymorphisms. The procedures are based on a test statistics joint null distribution and provide Type I error control in testing problems involving general data generating distributions, null hypotheses, and test statistics.

Multiple Testing Procedures with Applications to Genomics

Multiple Testing Procedures with Applications to Genomics PDF Author: Sandrine Dudoit
Publisher: Springer Science & Business Media
ISBN: 0387493174
Category : Science
Languages : en
Pages : 611

Book Description
This book establishes the theoretical foundations of a general methodology for multiple hypothesis testing and discusses its software implementation in R and SAS. These are applied to a range of problems in biomedical and genomic research, including identification of differentially expressed and co-expressed genes in high-throughput gene expression experiments; tests of association between gene expression measures and biological annotation metadata; sequence analysis; and genetic mapping of complex traits using single nucleotide polymorphisms. The procedures are based on a test statistics joint null distribution and provide Type I error control in testing problems involving general data generating distributions, null hypotheses, and test statistics.

Multiple Hypothesis Testing

Multiple Hypothesis Testing PDF Author: Houston Nash Gilbert
Publisher:
ISBN:
Category :
Languages : en
Pages : 372

Book Description


Resampling-Based Multiple Testing

Resampling-Based Multiple Testing PDF Author: Peter H. Westfall
Publisher: John Wiley & Sons
ISBN: 9780471557616
Category : Mathematics
Languages : en
Pages : 382

Book Description
Combines recent developments in resampling technology (including the bootstrap) with new methods for multiple testing that are easy to use, convenient to report and widely applicable. Software from SAS Institute is available to execute many of the methods and programming is straightforward for other applications. Explains how to summarize results using adjusted p-values which do not necessitate cumbersome table look-ups. Demonstrates how to incorporate logical constraints among hypotheses, further improving power.

Multiple Testing Procedures Controlling False Discovery Rate with Applications to Genomic Data

Multiple Testing Procedures Controlling False Discovery Rate with Applications to Genomic Data PDF Author: Iris Mirales Gauran
Publisher:
ISBN:
Category :
Languages : en
Pages : 320

Book Description
In recent mutation studies, analyses based on protein domain positions are gaining popularity over traditional gene-centric approaches since the latter have limitations in considering the functional context that the position of the mutation provides. This presents a large-scale simultaneous inference problem, with hundreds of hypothesis tests to consider at the same time. The overarching objective of this thesis is to propose different multiple testing procedures which can address the problems posed by discrete genomic data. Specifically, we are interested in identifying significant mutation counts while controlling a given level of Type I error via False Discovery Rate (FDR) procedures. One main assumption is that the mutation counts follow a zero-inflated model in order to account for the true zeros in the count model and the excess zeros. The class of models considered is the Zero-inflated Generalized Poisson (ZIGP) distribution.

Large-scale Multiple Hypothesis Testing with Complex Data Structure

Large-scale Multiple Hypothesis Testing with Complex Data Structure PDF Author: Xiaoyu Dai
Publisher:
ISBN:
Category : Electronic dissertations
Languages : en
Pages : 104

Book Description
In the last decade, motivated by a variety of applications in medicine, bioinformatics, genomics, brain imaging, etc., a growing amount of statistical research has been devoted to large-scale multiple testing, where thousands or even greater numbers of tests are conducted simultaneously. However, due to the complexity of real data sets, the assumptions of many existing multiple testing procedures, e.g. that tests are independent and have continuous null distributions of p-values, may not hold. This poses limitations in their performances such as low detection power and inflated false discovery rate (FDR). In this dissertation, we study how to better proceed the multiple testing problems under complex data structures. In Chapter 2, we study the multiple testing with discrete test statistics. In Chapter 3, we study the discrete multiple testing with prior ordering information incorporated. In Chapter 4, we study the multiple testing under complex dependency structure. We propose novel procedures under each scenario, based on the marginal critical functions (MCFs) of randomized tests, the conditional random field (CRF) or the deep neural network (DNN). The theoretical properties of our procedures are carefully studied, and their performances are evaluated through various simulations and real applications with the analysis of genetic data from next-generation sequencing (NGS) experiments.

Some New Developments on Multiple Testing Procedures

Some New Developments on Multiple Testing Procedures PDF Author: Lilun Du
Publisher:
ISBN:
Category :
Languages : en
Pages : 134

Book Description
In the context of large-scale multiple testing, hypotheses are often accompanied with certain prior information. In chapter 2, we present a single-index modulated multiple testing procedure, which maintains control of the false discovery rate while incorporating prior information, by assuming the availability of a bivariate p-value for each hypothesis. To find the optimal rejection region for the bivariate p-value, we propose a criteria based on the ratio of probability density functions of the bivariate p-value under the true null and non-null. This criteria in the bivariate normal setting further motivates us to project the bivariate p-value to a single index p-value, for a wide range of directions. The true null distribution of the single index p-value is estimated via parametric and nonparametric approaches, leading to two procedures for estimating and controlling the false discovery rate. To derive the optimal projection direction, we propose a new approach based on power comparison, which is further shown to be consistent under some mild conditions. Multiple testing based on chi-squared test statistics is commonly used in many scientific fields such as genomics research and brain imaging studies. However, the challenges associated with designing a formal testing procedure when there exists a general dependence structure across the chi-squared test statistics have not been well addressed. In chapter 3, we propose a Factor Connected procedure to fill in this gap. We first adopt a latent factor structure to construct a testing framework for approximating the false discovery proportion (FDP) for a large number of highly correlated chi-squared test statistics with finite degrees of freedom k. The testing framework is then connected to simultaneously testing k linear constraints in a large dimensional linear factor model involved with some observable and unobservable common factors, resulting in a consistent estimator of FDP based on the associated unadjusted p-values.

Statistical Evidence

Statistical Evidence PDF Author: Richard Royall
Publisher: Routledge
ISBN: 1351414550
Category : Mathematics
Languages : en
Pages : 212

Book Description
Interpreting statistical data as evidence, Statistical Evidence: A Likelihood Paradigm focuses on the law of likelihood, fundamental to solving many of the problems associated with interpreting data in this way. Statistics has long neglected this principle, resulting in a seriously defective methodology. This book redresses the balance, explaining why science has clung to a defective methodology despite its well-known defects. After examining the strengths and weaknesses of the work of Neyman and Pearson and the Fisher paradigm, the author proposes an alternative paradigm which provides, in the law of likelihood, the explicit concept of evidence missing from the other paradigms. At the same time, this new paradigm retains the elements of objective measurement and control of the frequency of misleading results, features which made the old paradigms so important to science. The likelihood paradigm leads to statistical methods that have a compelling rationale and an elegant simplicity, no longer forcing the reader to choose between frequentist and Bayesian statistics.

Multiple Comparisons Using R

Multiple Comparisons Using R PDF Author: Frank Bretz
Publisher: CRC Press
ISBN: 1420010905
Category : Mathematics
Languages : en
Pages : 202

Book Description
Adopting a unifying theme based on maximum statistics, Multiple Comparisons Using R describes the common underlying theory of multiple comparison procedures through numerous examples. It also presents a detailed description of available software implementations in R. The R packages and source code for the analyses are available at http://CRAN.R-project.org After giving examples of multiplicity problems, the book covers general concepts and basic multiple comparisons procedures, including the Bonferroni method and Simes’ test. It then shows how to perform parametric multiple comparisons in standard linear models and general parametric models. It also introduces the multcomp package in R, which offers a convenient interface to perform multiple comparisons in a general context. Following this theoretical framework, the book explores applications involving the Dunnett test, Tukey’s all pairwise comparisons, and general multiple contrast tests for standard regression models, mixed-effects models, and parametric survival models. The last chapter reviews other multiple comparison procedures, such as resampling-based procedures, methods for group sequential or adaptive designs, and the combination of multiple comparison procedures with modeling techniques. Controlling multiplicity in experiments ensures better decision making and safeguards against false claims. A self-contained introduction to multiple comparison procedures, this book offers strategies for constructing the procedures and illustrates the framework for multiple hypotheses testing in general parametric models. It is suitable for readers with R experience but limited knowledge of multiple comparison procedures and vice versa. See Dr. Bretz discuss the book.

Bioinformatics and Computational Biology Solutions Using R and Bioconductor

Bioinformatics and Computational Biology Solutions Using R and Bioconductor PDF Author: Robert Gentleman
Publisher: Springer Science & Business Media
ISBN: 0387293620
Category : Computers
Languages : en
Pages : 478

Book Description
Full four-color book. Some of the editors created the Bioconductor project and Robert Gentleman is one of the two originators of R. All methods are illustrated with publicly available data, and a major section of the book is devoted to fully worked case studies. Code underlying all of the computations that are shown is made available on a companion website, and readers can reproduce every number, figure, and table on their own computers.