| |||||||||
In genetics, transcription is the first process in gene expression. In transcription, DNA is copied to RNA by an enzyme called RNA polymerase (RNAP). Transcription to yield an mRNA is the first step of protein biosynthesis.
| Contents |
A (simple) model for a bacterial gene to be transcribed looks like this:
where the -35 and -10 region base pairs is called the promoter, and |T| stands for terminator. The DNA on the template strand between the +1 site and the terminator is transcribed to mRNA, which is then translated into protein.
Promoters can differ in strength, that is, how attractive they are for RNAP. The more similar they are to a consensus sequence, the stronger they are. The "ideal" promoter in E. coli looks like this:
The followings steps occur upon initiation:
RNAP prefers to start transcripts with ATP, and to a lesser extent GTP (purine nucleotide triphosphates). UTP, and CTP are disfavoured (pyrimidine nucleotide triphosphates).
Selective transcription is mainly responsible for the differential protein synthesis among various types of cells in the same organism.
The RNAP runs along the DNA, synthesizing mRNA in the process. In bacteria, the nascending mRNA is processed right away by ribosomes.
Two termination mechanisms are well known:
Other termination mechanisms include the fact that transcription will terminate if the RNAP comes across a region with repetitious base pairs (for example, TTTTTT).
Gene expression in eukaryotes is also controlled by complex interactions between cis-acting sites within the regulatory regions of the DNA, and trans-acting factors that include transcription factors and the basal transcription complex, but eukaryotes have evolved a much more complex system for regulation of transcription. For example, eukaryotes have three RNA polymerases, in contrast to prokaryotes, which only have one.
The C-terminus of all RNAPs is highly conserved and binds to two enzyme factors which sense a poly-adenylation sequence. These factors bind to the DNA and attach approximately 200 adenines to the 3' end of the mRNA.
The basal transcription complex includes the RNA polymerase and additional proteins that are necessary for correct initiation and elongation of RNA synthesis. Eukaryotes have evolved more complex regulatory mechanisms than prokaryotes. For instance, in eukaryotes the genetic material (DNA) is synthesized in the nucleus, separated from the site of translation, the cytoplasm, by the nuclear membrane. This allows temporal regulation of gene expression by sequestration of the RNA in the nucleus, and allows for selective transport of RNAs to the cytoplasm, where the translation machinery resides. Primary transcripts in eukaryotic cells are also synthesized as a larger precursor RNAs that must be processed by splicing out non-coding sequences (introns) and ligating non-contiguous coding sequences (exons) into the mature mRNA. Primary transcripts for some genes can be quite large. The primary transcripts of the neurexin genes, for instance, are as large as 1.7 megabases (1,700,000 bases), while the mature neurexin mRNAs are under 10 kilobases (10,000 bases), with as many as 24 exons and thousands of possible alternative splice variants that produce proteins with different activities.
The core promoter of eukaryotic genes, where the core transcription complex, including RNA polymerase, is usually a region within 50 bases upstream of the transcription intitiation site. Additionally, there can be an upstream control element usually present within 2000 bases upstream of the transcription initiation site. Some genes use enhancer elements that can be thousands of bases upstream or downstream of the transcription initiation site. Combinations of these upstream elements regulate and amplify the formation of the basal transcription complex. This UCE usually contains a TATA box, a highly conserved DNA sequence that reads
A similar sequence, thus not that highly conserved, is found in the INR element (initiator element, part of the complex core promoter).
Elongation in eukaryotes is identical to elongation in prokaryotes.
A major difference between prokaryotic and eukaryotic transcription is that the latter have splicing of the primary transcript, modifying the mRNA created during transcription.
Transcription can be measured and detected in a variety of ways:
RNA synthesis by RNA polymerase had been established in vitro by several laboratories by 1965, however the RNA synthesized by these enzymes had properties that suggested the existence of an additional factor needed to terminate trascription correctly.
By the late 1960s several papers that came out of the Harvard University Biological Laboratories established the basic mechanics of gene expression in bacteria.