Courses home > Promoter analysis

Promoters play an essential role in controlling gene expression. It is at this site that the RNA polymerase binds for transcription. Understanding promoter strength and regulation will enhance our understanding of gene expression. Multiple functional sites are involved in the binding of the polymerase. Such elements as the TATA box, GC box, CAAT box serve as binding sites for transcription factors. By analyzing these individual elements within the promoter sites, as well as their combinatorial effects, our understanding of promoter strength and regulation will be enhanced, thus increasing our comprehension of gene expression. Transcription factors play a central role in gene regulation and there are many databases that are dedicated to them. These programs work with such databases to predict and identify characteristics of the queried putative promoter sequences.

[ MPromDB ]

Mammalian Promoter Database (MPromDb) is an integrated novel database for gene promoters with experimentally supported annotation of transcription start sites (TSS), cis-regulatory elements, CpG islands and ChIP-chip experimental results with intuitively visualized presentation.

[ eukaryotic promoter database ]

The Eukaryotic Promoter Database at the Swiss Institute for Bioinformatics (SIB-EPD) is an annotated non-redundant database of eukaryotic polymerase II promoters for which the transcription start sites (TSS) have been determined. The EPD also has a suite of signal search analysis programs.

[ TransFac ]

TRANSFAC is the database on eukaryotic transcription factors, their genomic binding sites and DNA-binding profiles.

[ JASPAR ]

JASPAR is a collection of transcription factor DNA-binding preferences, modelled as matrices.

[ ConSite ]

ConSite allows the analysis of either one, or a number or, promoter sequences to find conserved transcription factor binding sites.

[ MatchTM ]

MatchTM is designed for searching potential binding sites for transcription factors (TF binding sites) nucleotide sequences. MatchTM uses a library of mononucleotide weight matrices from TRANSFAC 6.0.

[ PROMO ]

PROMO is a virtual laboratory for the identification of putative transcription factor binding sites (TFBS) in DNA sequences from a species or groups of species of interest. TFBS defined in the TRANSFAC database are used to construct specific binding site weight matrices for TFBS prediction.

[ OTFBS ]

The Over-represented Transcription Factor Binding Site (OTFBS) Prediction Tool tries to detect over-represented motifs of known transcription factors in a set of upstream sequences of similarly regulated genes. These genes can be clustered together with microarray data, or just be genes from the same functional protein from a series of related species.

[ CONFAC ]

The CONFAC software finds the conserved Transcription Factor Binding Sites (TFBS) in the promoter regions of the genes of given Human genes and the corresponding Mouse homolog.

[ ModTools ]

The WeederH program in the suite of ModTools allows the discovery of transcription factor binding sites and regulatory regions in sequences from homologous genes.

[ Genomatix ]

Genomatix is a commercial company that has a number of software tools and databases related to promoter analysis. One of the more interesting things that Genomatix can do is to find "frameworks" of transcription factor binding sites that are commmon to a set of related sequences. PolII promoters usually consist of multiple binding sites for transcription factors which mediate the promoter function. Frameworks are sets of sequences that share a common orientation and distance organization between a set of related sequences.

sample promoters file

Sequence retrievalHomology searchingSequence alignmentPhylogenetic analysis
News
Jun, 2008; Bioinformatics meets Alzheimer's disease research. Read about the discovery of the CALHM1 P86L polymorphism. The study appeared in the June 27th issue of Cell. [More]
Mar, 2008; A free bioinformatics walk-in clinic will be available every Monday, 1-3pm at the Weill Cornell Medical Library, in the Computer Room on the lower level. [More]

[News Archives] [Mailing List]


Events
Aug 25-29, 2008: Stanford University, CA - 7th Annual International Conference on Computational Systems Bioinformatics. Hosted by: Life Sciences Society [More]
Sep 22-26, 2008: Goettingen, Germany - Fall Course on Computational Neuroscience at the Max Planck Institute for Dynamics and Self-Organization. This annual course comprises tutorial lectures and seminar style coverage of selected current topics. Registration deadline: Aug 8, 2008. [More]