Header

UZH-Logo

Maintenance Infos

PCorral - interactive mining of protein interactions from MEDLINE


Li, Chen; Jimeno-Yepes, Antonio; Arregui, Miguel; Kirsch, Harald; Rebholz-Schuhmann, Dietrich (2013). PCorral - interactive mining of protein interactions from MEDLINE. Database, 2013:bat030.

Abstract

The extraction of information from the scientific literature is a complex task-for researchers doing manual curation and for automatic text processing solutions. The identification of protein-protein interactions (PPIs) requires the extraction of protein named entities and their relations. Semi-automatic interactive support is one approach to combine both solutions for efficient working processes to generate reliable database content. In principle, the extraction of PPIs can be achieved with different methods that can be combined to deliver high precision and/or high recall results in different combinations at the same time. Interactive use can be achieved, if the analytical methods are fast enough to process the retrieved documents. PCorral provides interactive mining of PPIs from the scientific literature allowing curators to skim MEDLINE for PPIs at low overheads. The keyword query to PCorral steers the selection of documents, and the subsequent text analysis generates high recall and high precision results for the curator. The underlying components of PCorral process the documents on-the-fly and are available, as well, as web service from the Whatizit infrastructure. The human interface summarizes the identified PPI results, and the involved entities are linked to relevant resources and databases. Altogether, PCorral serves curator at both the beginning and the end of the curation workflow for information retrieval and information extraction. Database URL: http://www.ebi.ac.uk/Rebholz-srv/pcorral.

Abstract

The extraction of information from the scientific literature is a complex task-for researchers doing manual curation and for automatic text processing solutions. The identification of protein-protein interactions (PPIs) requires the extraction of protein named entities and their relations. Semi-automatic interactive support is one approach to combine both solutions for efficient working processes to generate reliable database content. In principle, the extraction of PPIs can be achieved with different methods that can be combined to deliver high precision and/or high recall results in different combinations at the same time. Interactive use can be achieved, if the analytical methods are fast enough to process the retrieved documents. PCorral provides interactive mining of PPIs from the scientific literature allowing curators to skim MEDLINE for PPIs at low overheads. The keyword query to PCorral steers the selection of documents, and the subsequent text analysis generates high recall and high precision results for the curator. The underlying components of PCorral process the documents on-the-fly and are available, as well, as web service from the Whatizit infrastructure. The human interface summarizes the identified PPI results, and the involved entities are linked to relevant resources and databases. Altogether, PCorral serves curator at both the beginning and the end of the curation workflow for information retrieval and information extraction. Database URL: http://www.ebi.ac.uk/Rebholz-srv/pcorral.

Statistics

Citations

Dimensions.ai Metrics
10 citations in Web of Science®
12 citations in Scopus®
Google Scholar™

Altmetrics

Downloads

88 downloads since deposited on 23 Oct 2013
4 downloads since 12 months
Detailed statistics

Additional indexing

Item Type:Journal Article, refereed, original work
Communities & Collections:03 Faculty of Economics > Department of Informatics
Dewey Decimal Classification:000 Computer science, knowledge & systems
Scopus Subject Areas:Physical Sciences > Information Systems
Life Sciences > General Biochemistry, Genetics and Molecular Biology
Life Sciences > General Agricultural and Biological Sciences
Uncontrolled Keywords:General Biochemistry, Genetics and Molecular Biology, General Agricultural and Biological Sciences, Information Systems
Language:English
Date:2013
Deposited On:23 Oct 2013 08:14
Last Modified:24 Jan 2022 01:48
Publisher:Oxford University Press
ISSN:1758-0463
Funders:Cambridge Overseas Trust, EMBL-EBI
OA Status:Gold
Free access at:Publisher DOI. An embargo period may apply.
Publisher DOI:https://doi.org/10.1093/database/bat030
Project Information:
  • : FunderFP7
  • : Grant ID296410
  • : Project TitleMANTRA - Multilingual Annotation of Named Entities and Terminology Resources Acquisition
  • : Funder
  • : Grant ID
  • : Project TitleCambridge Overseas Trust
  • : Funder
  • : Grant ID
  • : Project TitleEMBL-EBI
  • Content: Published Version