Journal Article

Bayesian inference of protein–protein interactions from biological literature

Rajesh Chowdhary, Jinfeng Zhang and Jun S. Liu

in Bioinformatics

Volume 25, issue 12, pages 1536-1542
Published in print June 2009 | ISSN: 1367-4803
Published online April 2009 | e-ISSN: 1460-2059 | DOI: http://dx.doi.org/10.1093/bioinformatics/btp245
Bayesian inference of protein–protein interactions from biological literature

Show Summary Details

Preview

Motivation: Protein–protein interaction (PPI) extraction from published biological articles has attracted much attention because of the importance of protein interactions in biological processes. Despite significant progress, mining PPIs from literatures still rely heavily on time- and resource-consuming manual annotations.

Results: In this study, we developed a novel methodology based on Bayesian networks (BNs) for extracting PPI triplets (a PPI triplet consists of two protein names and the corresponding interaction word) from unstructured text. The method achieved an overall accuracy of 87% on a cross-validation test using manually annotated dataset. We also showed, through extracting PPI triplets from a large number of PubMed abstracts, that our method was able to complement human annotations to extract large number of new PPIs from literature.

Availability: Programs/scripts we developed/used in the study are available at http://stat.fsu.edu/~jinfeng/datasets/Bio-SI-programs-Bayesian-chowdhary-zhang-liu.zip

Contact: jliu@stat.harvard.edu

Supplementary information: Supplementary data are available at Bioinformatics online.

Journal Article.  6555 words.  Illustrated.

Subjects: Bioinformatics and Computational Biology

Full text: subscription required

How to subscribe Recommend to my Librarian

Users without a subscription are not able to see the full content. Please, subscribe or login to access all content.