10.4: Details of mRNA Processing in Eukaryotic nuclei
- Page ID
- 88957
\( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)
\( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)
\( \newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\)
( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\)
\( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)
\( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\)
\( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)
\( \newcommand{\Span}{\mathrm{span}}\)
\( \newcommand{\id}{\mathrm{id}}\)
\( \newcommand{\Span}{\mathrm{span}}\)
\( \newcommand{\kernel}{\mathrm{null}\,}\)
\( \newcommand{\range}{\mathrm{range}\,}\)
\( \newcommand{\RealPart}{\mathrm{Re}}\)
\( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)
\( \newcommand{\Argument}{\mathrm{Arg}}\)
\( \newcommand{\norm}[1]{\| #1 \|}\)
\( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)
\( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\AA}{\unicode[.8,0]{x212B}}\)
\( \newcommand{\vectorA}[1]{\vec{#1}} % arrow\)
\( \newcommand{\vectorAt}[1]{\vec{\text{#1}}} % arrow\)
\( \newcommand{\vectorB}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)
\( \newcommand{\vectorC}[1]{\textbf{#1}} \)
\( \newcommand{\vectorD}[1]{\overrightarrow{#1}} \)
\( \newcommand{\vectorDt}[1]{\overrightarrow{\text{#1}}} \)
\( \newcommand{\vectE}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash{\mathbf {#1}}}} \)
\( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)
\( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)
\(\newcommand{\avec}{\mathbf a}\) \(\newcommand{\bvec}{\mathbf b}\) \(\newcommand{\cvec}{\mathbf c}\) \(\newcommand{\dvec}{\mathbf d}\) \(\newcommand{\dtil}{\widetilde{\mathbf d}}\) \(\newcommand{\evec}{\mathbf e}\) \(\newcommand{\fvec}{\mathbf f}\) \(\newcommand{\nvec}{\mathbf n}\) \(\newcommand{\pvec}{\mathbf p}\) \(\newcommand{\qvec}{\mathbf q}\) \(\newcommand{\svec}{\mathbf s}\) \(\newcommand{\tvec}{\mathbf t}\) \(\newcommand{\uvec}{\mathbf u}\) \(\newcommand{\vvec}{\mathbf v}\) \(\newcommand{\wvec}{\mathbf w}\) \(\newcommand{\xvec}{\mathbf x}\) \(\newcommand{\yvec}{\mathbf y}\) \(\newcommand{\zvec}{\mathbf z}\) \(\newcommand{\rvec}{\mathbf r}\) \(\newcommand{\mvec}{\mathbf m}\) \(\newcommand{\zerovec}{\mathbf 0}\) \(\newcommand{\onevec}{\mathbf 1}\) \(\newcommand{\real}{\mathbb R}\) \(\newcommand{\twovec}[2]{\left[\begin{array}{r}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\ctwovec}[2]{\left[\begin{array}{c}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\threevec}[3]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\cthreevec}[3]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\fourvec}[4]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\cfourvec}[4]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\fivevec}[5]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\cfivevec}[5]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\mattwo}[4]{\left[\begin{array}{rr}#1 \amp #2 \\ #3 \amp #4 \\ \end{array}\right]}\) \(\newcommand{\laspan}[1]{\text{Span}\{#1\}}\) \(\newcommand{\bcal}{\cal B}\) \(\newcommand{\ccal}{\cal C}\) \(\newcommand{\scal}{\cal S}\) \(\newcommand{\wcal}{\cal W}\) \(\newcommand{\ecal}{\cal E}\) \(\newcommand{\coords}[2]{\left\{#1\right\}_{#2}}\) \(\newcommand{\gray}[1]{\color{gray}{#1}}\) \(\newcommand{\lgray}[1]{\color{lightgray}{#1}}\) \(\newcommand{\rank}{\operatorname{rank}}\) \(\newcommand{\row}{\text{Row}}\) \(\newcommand{\col}{\text{Col}}\) \(\renewcommand{\row}{\text{Row}}\) \(\newcommand{\nul}{\text{Nul}}\) \(\newcommand{\var}{\text{Var}}\) \(\newcommand{\corr}{\text{corr}}\) \(\newcommand{\len}[1]{\left|#1\right|}\) \(\newcommand{\bbar}{\overline{\bvec}}\) \(\newcommand{\bhat}{\widehat{\bvec}}\) \(\newcommand{\bperp}{\bvec^\perp}\) \(\newcommand{\xhat}{\widehat{\xvec}}\) \(\newcommand{\vhat}{\widehat{\vvec}}\) \(\newcommand{\uhat}{\widehat{\uvec}}\) \(\newcommand{\what}{\widehat{\wvec}}\) \(\newcommand{\Sighat}{\widehat{\Sigma}}\) \(\newcommand{\lt}{<}\) \(\newcommand{\gt}{>}\) \(\newcommand{\amp}{&}\) \(\definecolor{fillinmathshade}{gray}{0.9}\)Eukaryotic mRNA primary transcripts undergo extensive processing, including splicing, capping and polyadenylation. The steps described here are considered in order of (sometimes overlapping!) occurrence. We begin with splicing—an mRNA phenomenon.
10.4.1. Spliceosomal Introns
The coding regions of bacterial genes are continuous. The discovery of eukaryotic split genes with introns and exons came as quite a surprise. It seemed incongruous for evolution to have stuck irrelevant DNA in the middle of coding DNA! For their discovery of split genes, Richard J. Roberts and Phillip A. Sharp shared the Nobel Prize for Physiology or Medicine in 1993. In fact, all but a few eukaryotic genes are split, and some have two or more introns separating bits of coding DNA, the exons. Figure 10.14 summarizes splicing to remove introns in pre-mRNAs and to splice exons (in the right order!) to make a mature, functional mRNA.

snRNPs are particles composed of RNA and ribonuclear proteins. They bind to specific sites in an mRNA and then direct a sequential series of cuts and ligations (the splicing) necessary to process the mRNAs. The process was reminiscent of the way in which movies were edited (see the photo of an early splicing “machine” at the top of this chapter), hence the term splicing to describe mRNA processing. Figure 10.15 below highlights the role of small ribonuclear proteins (snRNPs) in splicing.

When snRNPs (often pronounced “snurps”) bind to a pair of splice sites flanking a pre-mRNA intron, they form a spliceosome, which completes the splicing, including the removal of intermediate lariat structures of the intron. The last step is to ligate exons into a continuous mRNA, with all its codons intact and in the right order—an impressive trick for some pre-mRNAs with as many as fifty introns! Spliceosome action is summarized in Figure 10.16 and in the “mRNA Splicing” animation at one of the following links.

10.4.2 Specific Nuclear-Body Proteins Facilitate snRNP Assembly and Function
Recall the organization of nuclei facilitated by nuclear bodies. Nuclear speckles are associated regions of high mRNA transcription, processing, and splicing. (Check out the animation at Nuclear Bodies and Transcription for a 3D localization of markers of nuclear body activity.)
Cajal bodies (CBs) and Gems are nuclear bodies that are similar in size and have related functions in assembling spliceosomal snRNPs. Some splicing defects correlate with mutations in the coil protein associated with Cajal bodies; others correlate with mutations in SMN proteins normally associated with Gems. One hypothesis was that CBs and Gems interact in SnRNP and spliceosome assembly—but how? Consider the results of an experiment in which antibodies to coilin and the SMN protein were localized in undifferentiated and differentiated neuroblastoma cells (Figure 10.17).

Panels A and C above are undifferentiated cells in culture; panels B and D are cells that were stimulated to differentiate. In the fluorescence micrographs at the right, arrows point to fluorescent nuclear bodies. In panel C, the green, fluorescent antibodies to coilin (arrows) localize to CBs and the red fluorescent antibodies to SMN protein (arrowhead) binds to Gems—as expected. But in panel D, the two antibodies co-localize, suggesting that the CBs and Gems aggregate in the differentiated cells. This would explain the need for both coilin and SMN proteins to produce functional snRNPs.
The CBs and gems may aggregate in differentiated cells because of an observed increase in expression of the SMN protein, producing more Gems and driving the association with CBs. This and similar experiments demonstrate that different nuclear bodies do have specific functions. They are not random structural artifacts; rather, they have evolved to organize nuclear activities in time and space in ways that are essential to the cell.
10.4.3. Group I and Group II Self-Splicing Introns
While eukaryotic spliceosomal introns are spliced using snRNPs as described above, Group I or Group II introns are removed by different mechanisms.
Group I introns interrupt mRNA and tRNA genes in bacteria and in the genomes of mitochondria and chloroplasts. They are also occasionally found in bacteriophage genes, but rarely in nuclear genes (and then only in lower eukaryotes). Group I introns are self-splicing! Thus, they are themselves ribozymes. They do not require a spliceosome with its snRNPs or other proteins for splicing activity. Instead, they fold into a secondary stem-loop structure that positions catalytic nucleotides at appropriate splice sites to excise their own introns and to religate the exons.
Group II introns are found in chloroplast and mitochondrial rRNA, mRNA, tRNA and some bacterial mRNAs. These can be quite long. They form complex stem-loop tertiary structures and self-splice, at least in a test tube! However, Group II introns encode proteins required for their own splicing in vivo. Like spliceosomal introns, they form a lariat structure at an Aresidue branch site. All of this suggests that the mechanism of spliceosomal intron splicing evolved from that of Group II introns.
10.4.4. So Why Splicing?
The puzzle implied by this question is, of course, why do higher organisms have split genes in the first place? While the following discussion can apply to all splicing, it will reference mainly spliceosomal introns. Here are some answers to the question, “Why splicing?”
- Introns in nuclear genes are typically longer (much longer!) than exons. Because introns are noncoding (i.e., non-informational) making them large targets for mutation. In effect, most noncoding DNA—including introns—can buffer the ill effects of random mutations.
- You may recall that gene duplication on one chromosome and the loss of a copy from its homologue arise from unequal recombination (nonhomologous crossing over). It occurs when similar DNA sequences align during synapsis of meiosis. In an organism that inherits a chromosome with both gene copies, the duplicate can accumulate mutations if the other retains original function. The diverging gene then becomes part of a pool of selectable DNA, the grist of evolution. Descendants of organisms that inherit the duplicated genes have diversified the gene pool, again increasing the potential for evolution and species diversity.
- Unequal recombination can also occur between similar sequences (e.g., in introns) in the same or different genes, resulting in a sharing of exons between genes. After unequal recombination between the introns that flank an exon, one gene will acquire another exon while the other will lose it. Once again, if an organism retains a copy of the participating genes with original function, the organism can make the required protein and survive. Meanwhile, the gene with the extra exon may produce a similar protein, but one with a new structural domain and function.
- Just like a complete duplicate gene, one with a new exon that adds a new function to an old gene has been entered in the pool of selectable DNA. By creating proteins with different overall functions that nonetheless share at least one domain and one common function, the phenomenon of exon shuffling increases species diversity!
An example discussed earlier involves calcium-binding proteins that regulate many cellular processes. Structurally related calcium (\(\rm Ca^{++}\)) binding domains are common to many otherwise structurally and functionally unrelated proteins. Consider exon shuffling in the unequal crossover (nonhomologous recombination) shown in Figure 10.18.

In this example, regions of strong similarity exist in different (nonhomologous) introns in the same gene. These regions align during synapsis of meiosis. Unequal crossing over between the genes inserts exon C into one of the genes. The other gene loses the exon (not shown in the illustration).
In sum, introns are buffers against deleterious mutations, and they are equally valuable as potential targets for gene duplication and exon shuffling. This makes introns key players in creating genetic diversity, the hallmark of evolution.
10.4.5 5-Prime Capping
Using GTP as a precursor, a methyl guanosine CAP added 5’-to-5’ to an mRNA functions, in part, to help mRNAs leave the nucleus and associate with ribosomes. The CAP is added to an exposed 5’ end, even as transcription and splicing are still in progress. A capping enzyme places a methylated guanosine residue at the 5’ end of the mature mRNA. Figure 10.19 shows the 5’ CAP structure (the check marks are 5’-3’ linked nucleotides).

10.4.6 3-Prime Polyadenylation
A poly(A) polymerase catalyzes the addition of adenine monophosphates (AMPs) to the 3’ end of most eukaryotic mRNAs, even before any splicing is complete. Polyadenylation requires ATP and can add several hundred AMPs to the 3’ terminus of an mRNA. The enzyme binds to an -A-A-U-A-A- sequence near the 3’ end of an mRNA and starts adding the AMPs. Polyadenylation after transcription termination is illustrated in Figure 10.20.

The result of polyadenylation is a 3’ poly(A) tail whose functions include assisting in the transit of mRNAs from the nucleus and regulating the half-life of mRNAs in the cytoplasm. The poly(A) tail shortens each time a ribosome finishes translating the mRNA.
198 mRNA 5' Capping and 3' Polyadenylation
Bacterial mRNAs have been found with very short poly(A) tails. Would you expect poly(A) tails on bacterial mRNAs to function the same way they do in eukaryotic cells? Explain your expectation.