Journal Article

Sequencing and Analysis of Approximately 40 000 Soybean cDNA Clones from a Full-Length-Enriched cDNA Library

Taishi Umezawa, Tetsuya Sakurai, Yasushi Totoki, Atsushi Toyoda, Motoaki Seki, Atsushi Ishiwata, Kenji Akiyama, Atsushi Kurotani, Takuhiro Yoshida, Keiichi Mochida, Mie Kasuga, Daisuke Todaka, Kyonoshin Maruyama, Kazuo Nakashima, Akiko Enju, Saho Mizukado, Selina Ahmed, Kyoko Yoshiwara, Kyuya Harada, Yasutaka Tsubokura, Masaki Hayashi, Shusei Sato, Toyoaki Anai, Masao Ishimoto, Hideyuki Funatsuki, Masayoshi Teraishi, Mitsuru Osaki, Takuro Shinano, Ryo Akashi, Yoshiyuki Sakaki, Kazuko Yamaguchi-Shinozaki and Kazuo Shinozaki

in DNA Research

Published on behalf of Kazusa DNA Research Institute

Volume 15, issue 6, pages 333-346
Published in print December 2008 | ISSN: 1340-2838
Published online October 2008 | e-ISSN: 1756-1663 | DOI:

Show Summary Details


A large collection of full-length cDNAs is essential for the correct annotation of genomic sequences and for the functional analysis of genes and their products. We obtained a total of 39 936 soybean cDNA clones (GMFL01 and GMFL02 clone sets) in a full-length-enriched cDNA library which was constructed from soybean plants that were grown under various developmental and environmental conditions. Sequencing from 5′ and 3′ ends of the clones generated 68 661 expressed sequence tags (ESTs). The EST sequences were clustered into 22 674 scaffolds involving 2580 full-length sequences. In addition, we sequenced 4712 full-length cDNAs. After removing overlaps, we obtained 6570 new full-length sequences of soybean cDNAs so far. Our data indicated that 87.7% of the soybean cDNA clones contain complete coding sequences in addition to 5′- and 3′-untranslated regions. All of the obtained data confirmed that our collection of soybean full-length cDNAs covers a wide variety of genes. Comparative analysis between the derived sequences from soybean and Arabidopsis, rice or other legumes data revealed that some specific genes were involved in our collection and a large part of them could be annotated to unknown functions. A large set of soybean full-length cDNA clones reported in this study will serve as a useful resource for gene discovery from soybean and will also aid a precise annotation of the soybean genome.

Keywords: EST; full-length cDNA; functional annotation; legume; soybean

Journal Article.  6638 words.  Illustrated.

Subjects: Genetics and Genomics

Users without a subscription are not able to see the full content. Please, subscribe or login to access all content.