Navigation auf zora.uzh.ch

Search

ZORA (Zurich Open Repository and Archive)

Enabling population assignment from cancer genomes with SNP2pop

Huang, Qingyao; Baudis, Michael (2018). Enabling population assignment from cancer genomes with SNP2pop. bioRxiv 368647, Cold Spring Harbor Laboratory.

Abstract

For a variety of human malignancies, incidence, treatment efficacy and overall prognosis show considerable variation between different populations and ethnic groups. Disentangling the effects related to particular population backgrounds can help in both understanding cancer biology and in tailoring therapeutic interventions. Because self-reported or inferred patient data can be incomplete or misleading due to migration and genomic admixture, a data-driven ancestry estimation should be preferred. While algorithms to analyze ancestry structure from healthy individuals have been developed, an easy-to-use tool to assign population groups based on genotyping data from SNP profiles is still missing and benchmarking for the validity of population assignment strategy for aberrant cancer genomes was not tested.We benchmarked the consistency and accuracy of cross-platform population assignment. We also demonstrated its high accuracy to process unaltered as well as cancer genomes. Despite widespread and extensive somatic mutations of cancer profiling data, population assignment consistency between germline and highly mutated samples from cancer patients reached of 97% and 92% for assignment into 5 and 26 populations re-spectively. Comparison of our benchmarked results with self-reported meta-data estimated a matching rate between 88% to 92%. Despite a relatively high matching rate, the ethnicity labels indicated in meta-data are vague compared to the standardized output from our tool.We have developed a bioinformatics tool to assign the populations from genome profiling data and validated its performance in healthy as well as aberrant cancer genomes. It is ready-to-use for genotyping data from nine commercial SNP array platforms or sequencing data. This tool is effective to scrutinize the population structure in cancer genomes and provides better measure to integrate genotyping data from various platforms instead of self-reported information. It will facilitate research on interplay between ethnicity related genetic background and molecular patterns in cancer entities and disentangling possible hereditary contributions.The docker image of the tool is provided in DockerHub as “baudisgroup/snp2pop”.

Additional indexing

Item Type:Working Paper
Communities & Collections:07 Faculty of Science > Institute of Molecular Life Sciences
Dewey Decimal Classification:570 Life sciences; biology
Language:English
Date:14 July 2018
Deposited On:27 Aug 2021 14:52
Last Modified:27 May 2024 15:24
Series Name:bioRxiv
ISSN:2164-7844
OA Status:Green
Publisher DOI:https://doi.org/10.1101/368647
Official URL:https://www.biorxiv.org/content/10.1101/368647v1
Related URLs:https://www.nature.com/articles/s41598-020-61854-x
https://www.zora.uzh.ch/id/eprint/186543/
Download PDF  'Enabling population assignment from cancer genomes with SNP2pop'.
Preview
  • Content: Published Version
  • Licence: Creative Commons: Attribution 4.0 International (CC BY 4.0)

Metadata Export

Statistics

Citations

Dimensions.ai Metrics

Altmetrics

Downloads

43 downloads since deposited on 27 Aug 2021
28 downloads since 12 months
Detailed statistics

Authors, Affiliations, Collaborations

Similar Publications