Journal Article

Parallelized prediction error estimation for evaluation of high-dimensional models

Christine Porzelius, Harald Binder and Martin Schumacher

in Bioinformatics

Volume 25, issue 6, pages 827-829
Published in print March 2009 | ISSN: 1367-4803
Published online January 2009 | e-ISSN: 1460-2059 | DOI:
Parallelized prediction error estimation for evaluation of high-dimensional models

Show Summary Details


Summary: There is a multitude of new techniques that promise to extract predictive information in bioinformatics applications. It has been recognized that a first step for validation of the resulting model fits should rely on proper use of resampling techniques. However, this advice is frequently not followed, potential reasons being difficulty of correct implementation and computational demand. This is addressed by the R package peperr, which is designed for reliable prediction error estimation through resampling, potentially accelerated by parallel execution on a compute cluster. Its interface allows easy connection to newly developed model fitting routines. Performance evaluation of the latter is furthermore guided by diagnostic plots, which helps to detect specific problems due to high-dimensional data structures.



Supplementary information: Supplementary data are available at Bioinformatics online.

Journal Article.  1672 words.  Illustrated.

Subjects: Bioinformatics and Computational Biology

Full text: subscription required

How to subscribe Recommend to my Librarian

Users without a subscription are not able to see the full content. Please, subscribe or login to access all content.