Navigation auf zora.uzh.ch

Search

ZORA (Zurich Open Repository and Archive)

DRIMSeq: a Dirichlet-multinomial framework for multivariate count outcomes in genomics

Nowicka, Malgorzata; Robinson, Mark D (2016). DRIMSeq: a Dirichlet-multinomial framework for multivariate count outcomes in genomics. F1000Research, 5:1356.

Abstract

There are many instances in genomics data analyses where measurements are made on a multivariate response. For example, alternative splicing can lead to multiple expressed isoforms from the same primary transcript. There are situations where differences (e.g. between normal and disease state) in the relative ratio of expressed isoforms may have significant phenotypic consequences or lead to prognostic capabilities. Similarly, knowledge of single nucleotide polymorphisms (SNPs) that affect splicing, so-called splicing quantitative trait loci (sQTL) will help to characterize the effects of genetic variation on gene expression. RNA sequencing (RNA-seq) has provided an attractive toolbox to carefully unravel alternative splicing outcomes and recently, fast and accurate methods for transcript quantification have become available. We propose a statistical framework based on the Dirichlet-multinomial distribution that can discover changes in isoform usage between conditions and SNPs that affect relative expression of transcripts using these quantifications. The Dirichlet-multinomial model naturally accounts for the differential gene expression without losing information about overall gene abundance and by joint modeling of isoform expression, it has the capability to account for their correlated nature. The main challenge in this approach is to get robust estimates of model parameters with limited numbers of replicates. We approach this by sharing information and show that our method improves on existing approaches in terms of standard statistical performance metrics. The framework is applicable to other multivariate scenarios, such as Poly-A-seq or where beta-binomial models have been applied (e.g., differential DNA methylation). Our method is available as a Bioconductor R package called DRIMSeq.

Additional indexing

Item Type:Journal Article, refereed, original work
Communities & Collections:07 Faculty of Science > Institute of Molecular Life Sciences
Dewey Decimal Classification:570 Life sciences; biology
Scopus Subject Areas:Life Sciences > General Biochemistry, Genetics and Molecular Biology
Life Sciences > General Immunology and Microbiology
Life Sciences > General Pharmacology, Toxicology and Pharmaceutics
Language:English
Date:2016
Deposited On:07 Feb 2017 12:19
Last Modified:17 Aug 2024 03:33
Publisher:Faculty of 1000 Ltd.
ISSN:2046-1402
OA Status:Gold
Free access at:PubMed ID. An embargo period may apply.
Publisher DOI:https://doi.org/10.12688/f1000research.8900.2
PubMed ID:28105305
Download PDF  'DRIMSeq: a Dirichlet-multinomial framework for multivariate count outcomes in genomics'.
Preview
  • Content: Published Version
  • Language: English
  • Licence: Creative Commons: Attribution 4.0 International (CC BY 4.0)

Metadata Export

Statistics

Citations

Dimensions.ai Metrics

Altmetrics

Downloads

87 downloads since deposited on 07 Feb 2017
3 downloads since 12 months
Detailed statistics

Authors, Affiliations, Collaborations

Similar Publications