2.5: Nucleotides
- Page ID
- 16097
\( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)
\( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)
\( \newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\)
( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\)
\( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)
\( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\)
\( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)
\( \newcommand{\Span}{\mathrm{span}}\)
\( \newcommand{\id}{\mathrm{id}}\)
\( \newcommand{\Span}{\mathrm{span}}\)
\( \newcommand{\kernel}{\mathrm{null}\,}\)
\( \newcommand{\range}{\mathrm{range}\,}\)
\( \newcommand{\RealPart}{\mathrm{Re}}\)
\( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)
\( \newcommand{\Argument}{\mathrm{Arg}}\)
\( \newcommand{\norm}[1]{\| #1 \|}\)
\( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)
\( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\AA}{\unicode[.8,0]{x212B}}\)
\( \newcommand{\vectorA}[1]{\vec{#1}} % arrow\)
\( \newcommand{\vectorAt}[1]{\vec{\text{#1}}} % arrow\)
\( \newcommand{\vectorB}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)
\( \newcommand{\vectorC}[1]{\textbf{#1}} \)
\( \newcommand{\vectorD}[1]{\overrightarrow{#1}} \)
\( \newcommand{\vectorDt}[1]{\overrightarrow{\text{#1}}} \)
\( \newcommand{\vectE}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash{\mathbf {#1}}}} \)
\( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)
\( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)
\(\newcommand{\avec}{\mathbf a}\) \(\newcommand{\bvec}{\mathbf b}\) \(\newcommand{\cvec}{\mathbf c}\) \(\newcommand{\dvec}{\mathbf d}\) \(\newcommand{\dtil}{\widetilde{\mathbf d}}\) \(\newcommand{\evec}{\mathbf e}\) \(\newcommand{\fvec}{\mathbf f}\) \(\newcommand{\nvec}{\mathbf n}\) \(\newcommand{\pvec}{\mathbf p}\) \(\newcommand{\qvec}{\mathbf q}\) \(\newcommand{\svec}{\mathbf s}\) \(\newcommand{\tvec}{\mathbf t}\) \(\newcommand{\uvec}{\mathbf u}\) \(\newcommand{\vvec}{\mathbf v}\) \(\newcommand{\wvec}{\mathbf w}\) \(\newcommand{\xvec}{\mathbf x}\) \(\newcommand{\yvec}{\mathbf y}\) \(\newcommand{\zvec}{\mathbf z}\) \(\newcommand{\rvec}{\mathbf r}\) \(\newcommand{\mvec}{\mathbf m}\) \(\newcommand{\zerovec}{\mathbf 0}\) \(\newcommand{\onevec}{\mathbf 1}\) \(\newcommand{\real}{\mathbb R}\) \(\newcommand{\twovec}[2]{\left[\begin{array}{r}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\ctwovec}[2]{\left[\begin{array}{c}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\threevec}[3]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\cthreevec}[3]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\fourvec}[4]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\cfourvec}[4]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\fivevec}[5]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\cfivevec}[5]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\mattwo}[4]{\left[\begin{array}{rr}#1 \amp #2 \\ #3 \amp #4 \\ \end{array}\right]}\) \(\newcommand{\laspan}[1]{\text{Span}\{#1\}}\) \(\newcommand{\bcal}{\cal B}\) \(\newcommand{\ccal}{\cal C}\) \(\newcommand{\scal}{\cal S}\) \(\newcommand{\wcal}{\cal W}\) \(\newcommand{\ecal}{\cal E}\) \(\newcommand{\coords}[2]{\left\{#1\right\}_{#2}}\) \(\newcommand{\gray}[1]{\color{gray}{#1}}\) \(\newcommand{\lgray}[1]{\color{lightgray}{#1}}\) \(\newcommand{\rank}{\operatorname{rank}}\) \(\newcommand{\row}{\text{Row}}\) \(\newcommand{\col}{\text{Col}}\) \(\renewcommand{\row}{\text{Row}}\) \(\newcommand{\nul}{\text{Nul}}\) \(\newcommand{\var}{\text{Var}}\) \(\newcommand{\corr}{\text{corr}}\) \(\newcommand{\len}[1]{\left|#1\right|}\) \(\newcommand{\bbar}{\overline{\bvec}}\) \(\newcommand{\bhat}{\widehat{\bvec}}\) \(\newcommand{\bperp}{\bvec^\perp}\) \(\newcommand{\xhat}{\widehat{\xvec}}\) \(\newcommand{\vhat}{\widehat{\vvec}}\) \(\newcommand{\uhat}{\widehat{\uvec}}\) \(\newcommand{\what}{\widehat{\wvec}}\) \(\newcommand{\Sighat}{\widehat{\Sigma}}\) \(\newcommand{\lt}{<}\) \(\newcommand{\gt}{>}\) \(\newcommand{\amp}{&}\) \(\definecolor{fillinmathshade}{gray}{0.9}\)Nucleotides, the building blocks of RNA and DNA, are themselves composed of a pentose sugar attached to a nitrogenous base on one side and a phosphate group on another. The sugar is either the 5-carbon sugar ribose or its close cousin, deoxyribose (the “deoxy” refers to a “missing” hydroxyl group on the 2-carbon, which has an H instead). The attached nitrogenous base can be a purine, which is a 6-member ring fused to a 5-member ring, or a pyrimidine, which is a single 6-membered ring. These bases are usually adenine (purine), guanine (purine), thymine (pyrimidine), and cytosine (pyrimidine) for DNA, with a substitution of uracil for thymine in RNA bases. However, there are also some unconventional and modified bases that show up in special situations, such as in tRNAs. In addition to being the monomer components of DNA and RNA, nucleotides have other important functions as well. The best known, adenosine triphosphate, or ATP, is the primary “instant” energy source for the cell by the energy released through hydrolysis of its terminal phosphate group.
DNA or RNA are built from nucleotides through linkages of the sugars, and the polymerization occurs by condensation reactions, but these bonds are not glycosidic bonds like with polysaccharides. Instead, bonds form between the 5’ phosphate group of one nucleotide and the 3’ hydroxyl group of another. These are phosphodiester bonds, and a quick glance at the structure (Figure \(\PageIndex{8}\)) explains the naming: an ester bond is a carbonoxygen linkage, and the phosphodiester bond is a C-O-P-O-C, so there are two esters with a phosphorus linking them. With the purine or pyrimidine base on the 1-carbon, this arrangement places the bases on the opposite side of the sugar from the polymerizing phosphodiester bonds. This forms a sugar-phosphate backbone to the DNA/RNA, which then has the bases projecting out from it.
The bases will then likely interact with the bases of other nucleotides, whether part of another nucleic acid strand or free-floating. Not only do they interact, but they interact with great specificity and consistency: adenines base-pair with thymines (or uracils) through two hydrogen bonds, while guanines interact with cytosine through three H-bonds. Note that while one extra hydrogen bond does not appear to be particularly significant, the attraction between G-C is 50% stronger than between A-T, and over long stretches of DNA, areas high in G-C content are significantly more difficult to unzip (separate strands) than areas high in A-T pairs. This specific base-pairing, known as Chargaff’s rules, is the basis for life: base-pairing is needed to make DNA double stranded, which gives an organism a built-in backup of genetic information and it is also the basis for transforming that information into proteins that form the bulk of a cell.
Nucleic acids, the long polymers of nucleotides, exist in either single or double stranded forms in vitro. However, in the cell, most RNA is single-stranded, and most DNA is double-stranded. This difference is important to their function: RNA is a temporary information transfer molecule for a particular gene, DNA is the permanent repository of all genetic information needed to make an organism. Therefore, RNA needs to be easily read, meaning that the bases need to be accessible, and not locked to a complementary strand. Its long-term stability is not particularly important because when it is made, usually many copies are made at the time, and it is only needed while the cell needs to make the protein it encodes. Conversely, the same strand of DNA is read over and over to make the RNA, and since there are only two copies of each chromosome (a chromosome is a single double-stranded DNA molecule) in a cell, the ability to maintain the integrity of the DNA is crucial. Because of base pairing, each strand of DNA contains all the information necessary to make a complete exact copy of its complementary strand.
Of course, the point of the genetic information in DNA is to encode the production of proteins that can then carry out the functions that define cellular life. Some of those functions, such as DNA replication, gene regulation, transcription, and translation, require the proteins to interact with a nucleic acid. Usually, part of the recognition process involves apposition of a positively charged region of the protein to the DNA (or RNA), which is a very negatively charged molecule, as expected from all the phosphates in the sugar-phosphate backbone. RNA, but not DNA (with some exceptions), can also interact with itself by complementary base-pairing. If a stretch of RNA sequence comes into contact with a stretch of RNA with a complementary sequence on the same molecule, then base-pairing can occur. Depending on the number of nucleotides between the complementary areas, secondary structures such as stem-and-loops and hairpins can form.