Header

UZH-Logo

Maintenance Infos

How to analyze many contingency tables simultaneously in genetic association studies


Dickhaus, Thorsten; Straßburger, Klaus; Schunk, Daniel; Morcillo-Suarez, Carlos; Illig, Thomas; Navarro, Arcadi (2012). How to analyze many contingency tables simultaneously in genetic association studies. Statistical Applications in Genetics and Molecular Biology, 11(4):Article 12.

Abstract

We study exact tests for (2 x 2) and (2 x 3) contingency tables, in particular exact chi-squared tests and exact tests of Fisher type. In practice, these tests are typically carried out without randomization, leading to reproducible results but not exhausting the significance level. We discuss that this can lead to methodological and practical issues in a multiple testing framework when many tables are simultaneously under consideration as in genetic association studies.Realized randomized p-values are proposed as a solution which is especially useful for data-adaptive (plug-in) procedures. These p-values allow to estimate the proportion of true null hypotheses much more accurately than their non-randomized counterparts. Moreover, we address the problem of positively correlated p-values for association by considering techniques to reduce multiplicity by estimating the "effective number of tests" from the correlation structure.An algorithm is provided that bundles all these aspects, efficient computer implementations are made available, a small-scale simulation study is presented and two real data examples are shown

Abstract

We study exact tests for (2 x 2) and (2 x 3) contingency tables, in particular exact chi-squared tests and exact tests of Fisher type. In practice, these tests are typically carried out without randomization, leading to reproducible results but not exhausting the significance level. We discuss that this can lead to methodological and practical issues in a multiple testing framework when many tables are simultaneously under consideration as in genetic association studies.Realized randomized p-values are proposed as a solution which is especially useful for data-adaptive (plug-in) procedures. These p-values allow to estimate the proportion of true null hypotheses much more accurately than their non-randomized counterparts. Moreover, we address the problem of positively correlated p-values for association by considering techniques to reduce multiplicity by estimating the "effective number of tests" from the correlation structure.An algorithm is provided that bundles all these aspects, efficient computer implementations are made available, a small-scale simulation study is presented and two real data examples are shown

Statistics

Citations

Dimensions.ai Metrics

Altmetrics

Downloads

21 downloads since deposited on 06 Nov 2018
21 downloads since 12 months
Detailed statistics

Additional indexing

Item Type:Journal Article, refereed, original work
Communities & Collections:National licences > 142-005
Dewey Decimal Classification:Unspecified
Language:English
Date:27 January 2012
Deposited On:06 Nov 2018 16:07
Last Modified:24 Sep 2019 23:41
Publisher:Bepress
ISSN:1544-6115
OA Status:Green
Publisher DOI:https://doi.org/10.1515/1544-6115.1776
Related URLs:https://www.swissbib.ch/Search/Results?lookfor=nationallicencegruyter101515154461151776 (Library Catalogue)
PubMed ID:22850061

Download

Download PDF  'How to analyze many contingency tables simultaneously in genetic association studies'.
Preview
Content: Published Version
Language: English
Filetype: PDF (Nationallizenz 142-005)
Size: 878kB
View at publisher