Journal Article

Temporal clustering by affinity propagation reveals transcriptional modules in <i>Arabidopsis thaliana</i>

Steven J. Kiddle, Oliver P. F. Windram, Stuart McHattie, Andrew Mead, Jim Beynon, Vicky Buchanan-Wollaston, Katherine J. Denby and Sach Mukherjee

in Bioinformatics

Volume 26, issue 3, pages 355-362
Published in print February 2010 | ISSN: 1367-4803
Published online December 2009 | e-ISSN: 1460-2059 | DOI: http://dx.doi.org/10.1093/bioinformatics/btp673
Temporal clustering by affinity propagation reveals transcriptional modules in Arabidopsis thaliana

Show Summary Details

Preview

Motivation: Identifying regulatory modules is an important task in the exploratory analysis of gene expression time series data. Clustering algorithms are often used for this purpose. However, gene regulatory events may induce complex temporal features in a gene expression profile, including time delays, inversions and transient correlations, which are not well accounted for by current clustering methods. As the cost of microarray experiments continues to fall, the temporal resolution of time course studies is increasing. This has led to a need to take account of detailed temporal features of this kind. Thus, while standard clustering methods are both widely used and much studied, their shared shortcomings with respect to such temporal features motivates the work presented here.

Results: Here, we introduce a temporal clustering approach for high-dimensional gene expression data which takes account of time delays, inversions and transient correlations. We do so by exploiting a recently introduced, message-passing-based algorithm called Affinity Propagation (AP). We take account of temporal features of interest following an approximate but efficient dynamic programming approach due to Qian et al. The resulting approach is demonstrably effective in its ability to discern non-obvious temporal features, yet efficient and robust enough for routine use as an exploratory tool. We show results on validated transcription factor–target pairs in yeast and on gene expression data from a study of Arabidopsis thaliana under pathogen infection. The latter reveals a number of biologically striking findings.

Availability: Matlab code for our method is available at http://www.wsbc.warwick.ac.uk/stevenkiddle/tcap.html.

Contact: s.j.kiddle@warwick.ac.uk; s.n.mukherjee@warwick.ac.uk

Journal Article.  5947 words.  Illustrated.

Subjects: Bioinformatics and Computational Biology

Full text: subscription required

How to subscribe Recommend to my Librarian

Users without a subscription are not able to see the full content. Please, subscribe or login to access all content.