Journal Article

Analysis of Sequence Patterns Surrounding the Translation Initiation Sites on Cyanobacterium Genome Using the Hidden Markov Model

Tetsushi Yada, Takashi Sazuka and Makoto Hirosawa

in DNA Research

Published on behalf of Kazusa DNA Research Institute

Volume 4, issue 1, pages 1-7
Published in print January 1997 | ISSN: 1340-2838
Published online January 1997 | e-ISSN: 1756-1663 | DOI:

Show Summary Details


Sequence patterns surrounding the translation initiation sites of Cyanobacterium were precisely analyzed by the hidden Markov model (HMM) based on the actual translation initiation sites. In a previous study, 72 actual protein coding regions and their translation initiation sites on the genome of Synechocystis sp. strain PCC6803 were determined by Sazuka et al. using protein two-dimensional electrophoresis and microsequening. In this work, we extracted the sequence patterns surrounding translation initiation sites as HMM using the computer program YEBIS. The constructed HMM could recognize all but one translation initiation site. The HMM contains an AG-rich region (5.7 bp on average), as the Shine-Dalgarno sequence exclusively contains purines, upstream of the translation initiation site (−9.7 position on average) and a CT-rich region (4.2 bp on average) just upstreamfrom the translation initiation site. In addition, we found that the second amino acid (+4,5,6) could be classified into two types, one of which had C as their second codon while another of which has a nucleotide distribution relatively similar to the distribution among amino acids in the 72 proteins. This fact corresponds well to our earlier finding that when the second nucleotide of the second amino acid of a translated protein was C, an initial methionine was processed and that otherwise the methionine was intact with high frequency.

Keywords: translation initiation site; hidden Markov model; rare initiation codon

Journal Article.  0 words. 

Subjects: Genetics and Genomics

Users without a subscription are not able to see the full content. Please, subscribe or login to access all content.