Journal Article

Intrinsic Promoter Activities of Primary DNA Sequences in the Human Genome

Yuta Sakakibara, Takuma Irie, Yutaka Suzuki, Riu Yamashita, Hiroyuki Wakaguri, Akinori Kanai, Joe Chiba, Toshihisa Takagi, Junko Mizushima-Sugano, Shin-ichi Hashimoto, Kenta Nakai and Sumio Sugano

in DNA Research

Published on behalf of Kazusa DNA Research Institute

Volume 14, issue 2, pages 71-77
Published in print January 2007 | ISSN: 1340-2838
Published online May 2007 | e-ISSN: 1756-1663 | DOI: http://dx.doi.org/10.1093/dnares/dsm006

Show Summary Details

Preview

Abstract

In order to understand an overview of promoter activities intrinsic to primary DNA sequences in the human genome within a particular cell type, we carried out systematic quantitative luciferase assays of DNA fragments corresponding to putative promoters for 472 human genes which are expressed in HEK (human embryonic kidney epithelial) 293 cells. We observed the promoter activities of them were distributed in a bimodal manner; putative promoters belonging to the first group (with strong promoter activities) were designated as P1 and the latter (with weak promoter activities) as P2. The frequencies of the TATA-boxes, the CpG islands, and the overall G + C-contents were significantly different between these two populations, indicating there are two separate groups of promoters. Interestingly, similar analysis using 251 randomly isolated genomic DNA fragments showed that P2-type promoter occasionally occurs within the human genome. Furthermore, 35 DNA fragments corresponding to putative promoters of non-protein-coding transcripts (ncRNAs) shared similar features with the P2 in both promoter activities and sequence compositions. At least, a part of ncRNAs, which have been massively identified by full-length cDNA projects with no functional relevance inferred, may have originated from those sporadic promoter activities of primary DNA sequences inherent to the human genome.

Keywords: human genome; promoter; transcriptional start site

Journal Article.  4242 words.  Illustrated.

Subjects: Genetics and Genomics

Users without a subscription are not able to see the full content. Please, subscribe or login to access all content.