8.3: DNA Structure

Last updated
Save as PDF

Page ID: 88940

\( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

\( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)

\( \newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\)

( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\)

\( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)

\( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\)

\( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)

\( \newcommand{\Span}{\mathrm{span}}\)

\( \newcommand{\id}{\mathrm{id}}\)

\( \newcommand{\Span}{\mathrm{span}}\)

\( \newcommand{\kernel}{\mathrm{null}\,}\)

\( \newcommand{\range}{\mathrm{range}\,}\)

\( \newcommand{\RealPart}{\mathrm{Re}}\)

\( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)

\( \newcommand{\Argument}{\mathrm{Arg}}\)

\( \newcommand{\norm}[1]{\| #1 \|}\)

\( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)

\( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\AA}{\unicode[.8,0]{x212B}}\)

\( \newcommand{\vectorA}[1]{\vec{#1}} % arrow\)

\( \newcommand{\vectorAt}[1]{\vec{\text{#1}}} % arrow\)

\( \newcommand{\vectorB}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

\( \newcommand{\vectorC}[1]{\textbf{#1}} \)

\( \newcommand{\vectorD}[1]{\overrightarrow{#1}} \)

\( \newcommand{\vectorDt}[1]{\overrightarrow{\text{#1}}} \)

\( \newcommand{\vectE}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash{\mathbf {#1}}}} \)

\( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

\( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)

\(\newcommand{\avec}{\mathbf a}\) \(\newcommand{\bvec}{\mathbf b}\) \(\newcommand{\cvec}{\mathbf c}\) \(\newcommand{\dvec}{\mathbf d}\) \(\newcommand{\dtil}{\widetilde{\mathbf d}}\) \(\newcommand{\evec}{\mathbf e}\) \(\newcommand{\fvec}{\mathbf f}\) \(\newcommand{\nvec}{\mathbf n}\) \(\newcommand{\pvec}{\mathbf p}\) \(\newcommand{\qvec}{\mathbf q}\) \(\newcommand{\svec}{\mathbf s}\) \(\newcommand{\tvec}{\mathbf t}\) \(\newcommand{\uvec}{\mathbf u}\) \(\newcommand{\vvec}{\mathbf v}\) \(\newcommand{\wvec}{\mathbf w}\) \(\newcommand{\xvec}{\mathbf x}\) \(\newcommand{\yvec}{\mathbf y}\) \(\newcommand{\zvec}{\mathbf z}\) \(\newcommand{\rvec}{\mathbf r}\) \(\newcommand{\mvec}{\mathbf m}\) \(\newcommand{\zerovec}{\mathbf 0}\) \(\newcommand{\onevec}{\mathbf 1}\) \(\newcommand{\real}{\mathbb R}\) \(\newcommand{\twovec}[2]{\left[\begin{array}{r}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\ctwovec}[2]{\left[\begin{array}{c}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\threevec}[3]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\cthreevec}[3]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\fourvec}[4]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\cfourvec}[4]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\fivevec}[5]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\cfivevec}[5]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\mattwo}[4]{\left[\begin{array}{rr}#1 \amp #2 \\ #3 \amp #4 \\ \end{array}\right]}\) \(\newcommand{\laspan}[1]{\text{Span}\{#1\}}\) \(\newcommand{\bcal}{\cal B}\) \(\newcommand{\ccal}{\cal C}\) \(\newcommand{\scal}{\cal S}\) \(\newcommand{\wcal}{\cal W}\) \(\newcommand{\ecal}{\cal E}\) \(\newcommand{\coords}[2]{\left\{#1\right\}_{#2}}\) \(\newcommand{\gray}[1]{\color{gray}{#1}}\) \(\newcommand{\lgray}[1]{\color{lightgray}{#1}}\) \(\newcommand{\rank}{\operatorname{rank}}\) \(\newcommand{\row}{\text{Row}}\) \(\newcommand{\col}{\text{Col}}\) \(\renewcommand{\row}{\text{Row}}\) \(\newcommand{\nul}{\text{Nul}}\) \(\newcommand{\var}{\text{Var}}\) \(\newcommand{\corr}{\text{corr}}\) \(\newcommand{\len}[1]{\left|#1\right|}\) \(\newcommand{\bbar}{\overline{\bvec}}\) \(\newcommand{\bhat}{\widehat{\bvec}}\) \(\newcommand{\bperp}{\bvec^\perp}\) \(\newcommand{\xhat}{\widehat{\xvec}}\) \(\newcommand{\vhat}{\widehat{\vvec}}\) \(\newcommand{\uhat}{\widehat{\uvec}}\) \(\newcommand{\what}{\widehat{\wvec}}\) \(\newcommand{\Sighat}{\widehat{\Sigma}}\) \(\newcommand{\lt}{<}\) \(\newcommand{\gt}{>}\) \(\newcommand{\amp}{&}\) \(\definecolor{fillinmathshade}{gray}{0.9}\)

By 1878 a substance in the pus of wounded soldiers derived from cell nuclei (called nuclein) was shown to be composed of five bases (the familiar ones of DNA and RNA). The four bases known to make up DNA (as part of nucleotides) were thought to be connected through the phosphate groups, in short repeating chains of four nucleotides. By the 1940s, we knew that DNA was a long polymer. Nevertheless, it was still considered too simple to account for genes. After the Hershey and Chase experiments, only a few holdouts would not accept DNA as the genetic material. When the scientific community finally accepted DNA as the stuff of genes, the next questions were the following:

What did DNA look like?
How did its structure account for its ability to encode and reproduce life? While the four phosphate-linked nucleotide composition of DNA was known for some time, it became mandatory to explain how such a “simple molecule” could inform the thousands of proteins necessary for life. The answer to this question lay (at least in part) in an understanding of the physical structure of DNA, made possible by the advent of X-ray crystallography.

8.3.1. X-Ray Crystallography and the Beginnings of Molecular Biology

If a substance can be crystallized, the crystal will diffract X-rays at angles revealing regular (repeating) structures of the crystal. William Astbury demonstrated that high molecular-weight DNA had just such a regular structure. His crystallographs suggested DNA to be a linear polymer of stacked bases (nucleotides), each separated from the next by 0.34 nm. Astbury is also remembered for coining the term “molecular biology” to describe his studies. The term now covers all aspects of biomolecular structure, as well as molecular functions (e.g., replication, transcription, translation, and gene regulation).

In an irony of history, the Russian biologist Nikolai Konstantinovich Koltsov had already intuited in 1927 that the basis for the genetic transfer of traits would be a “giant hereditary molecule” made up of “two mirror strands that would replicate in a semiconservative fashion using each strand as a template” (The Remarkable Nikolai Koltsov - scroll down to read the biography). This was a pretty fantastic inference if you think about it, since it was proposed long before Watson, Crick, and their colleagues worked out the structure of the DNA double-helix!

8.3.2. Wilkins, Franklin, Watson, and Crick—DNA Structure Revealed

Maurice Wilkins, an English biochemist, was the first to isolate highly pure DNA with a high molecular weight. Working in Wilkins’s laboratory, Rosalind Franklin was able to crystalize this DNA and to produce very high-resolution X-ray diffraction images of the DNA crystals. Franklin’s most famous (and definitive) crystallograph was “Photo 51” (Figure 8.5).

Screen Shot 2022-05-19 at 12.02.25 PM.png — Figure 8.5: *Photo 51*, the X-ray crystallograph of DNA taken by Rosalind Franklin, revealed three crucial molecular dimensions that ultimately led to the double-helical DNA model.

This image confirmed Astbury’s 0.34 nm repeated dimension and revealed two more numbers, 3.4 nm and 2 nm, reflecting additional repeat structures in the DNA crystal. When James Watson and Francis Crick got hold of these numbers, they used them along with other data to build DNA models out of nuts, bolts, and plumbing.

Their models eventually revealed DNA to be a pair of antiparallel, complementary strands of nucleic acid polymers—shades of Koltsov’s mirror-image macromolecules! Each strand is a string of nucleotides linked by phosphodiester linkages, the two strands held together in a double helix by complementary H-bond interactions. Let’s look at the evidence for these conclusions. As we do, refer to the two illustrations of the double helix in Figure 8.6.

Screen Shot 2022-05-19 at 12.04.14 PM.png — Figure 8.6: The three molecular dimensions from Franklin’s Photo 51 could be best explained by the proposition of a DNA double helix (left). Model-building by Watson and Crick revealed the antiparallel structure of the double helix (right).

Recalling that Astbury’s 0.34 nm dimension was the distance between successive nucleotides in a DNA strand, Watson and Crick surmised that the 3.4 nm repeat was a structurally meaningful tenfold multiple of Astbury’s number. When they began building their DNA models, they realized from the bond angles connecting the nucleotides that the strand was forming a helix, from which they concluded that the 3.4 nm repeat was the pitch of the helix (i.e., the length of one complete turn of the helix). This meant that there were ten bases per turn of the helix. They further reasoned that the 2.0 nm number might reflect the diameter of the helix. When their scale model of a single-stranded DNA helix predicted a helical diameter much less than 2.0 nm, they were able to model a double helix that more nearly met the requirement for a 2.0 nm diameter. In building their double helix, Watson and Crick realized that bases in opposing strands would come together to form H-bonds, holding the helices together. However, for their double helix to have a constant diameter of 2.0 nm, they also realized that the smaller pyrimidine bases, thymine (T) and cytosine (C), would have to H-bond to the larger purine bases, adenine (A) and guanine (G), and that neither A-G nor C-T would form pairs in the double helix.

Now to the question of how a “simple” DNA molecule could have the structural diversity needed to encode thousands of different polypeptides and proteins. In early studies, purified E. coli DNA was chemically hydrolyzed down to nucleotide monomers. The hydrolytic products contained nearly equal amounts of each base, reinforcing the notion that DNA was that simple molecule that could not encode genes. But Watson and Crick had private access to data from Erwin Chargaff, who had determined the base composition of DNA isolated from different species, including E. coli. He found that the base composition of DNA from different species was not always equimolar, meaning that for some species, the DNA was not composed of equal amounts of each of the four bases (see some of this data in Table 8.1).

Screen Shot 2022-05-19 at 12.07.55 PM.png

The mere fact that DNA from some species could have base compositions that deviated from equimolarity put to rest the argument that DNA had to be a very simple sequence. Finally, it was safe to accept the obvious, namely that DNA sequences could vary almost infinitely and could indeed be the stuff of genes.

Chargaff’s data also showed a unique pattern of base ratios. Although base compositions could vary between species, the A/T and G/C ratio was always 1, for every species; likewise, the ratio of (A+C)/(G+T) and (A+G)/(C+T). From this information, Watson and Crick inferred that A (a purine) would H-bond with T (a pyrimidine), and that G (a purine) would H-bond with C (a pyrimidine) in the double helix. When building their model with this new information, they also found H-bonding between the complementary bases would be maximal only if the two DNA strands were antiparallel, leading to the most stable structure of the double helix. Watson and Crick published their paper A Structure for Deoxyribose Nucleic Acid in 1953 in Nature (read an annotated version of this seminal article at The Classic 1953 Watson-Crick.

Their article is also famous for predicting a semiconservative mechanism of replication, something that had also been predicted by Koltsov twenty-six years earlier, albeit based on intuition—and much less evidence! Watson, Crick, and Wilkins shared a Nobel Prize in 1962 for their work on DNA structure. Unfortunately, Franklin died in 1958, and Nobel prizes were not awarded posthumously. She did eventually get a measure of delayed and well-deserved recognition, including a university in Chicago named in her honor! Nevertheless, there is still controversy about why Franklin did not get appropriate credit at the time for her role in the work.

169-2 Unraveling the Structure of DNA

8.3.3. Meselson and Stahl’s Experiment—Replication is Semiconservative

Confirmation of Watson and Crick’s suggestion of semiconservative replication came from Matthew Meselson and Franklin Stahl’s very elegant experiment, which tested the three possible models of replication (Figure 8.7)

Screen Shot 2022-05-19 at 4.05.11 PM.png — Figure 8.7: Three possible modes of DNA replication could be imagined.

In their experiment, E. coli cells were grown in a medium containing \({}^{15} \rm N\), a “heavy” nitrogen isotope. After many generations, the DNA in all of the cells had become labeled with the heavy isotope. At that point, the \({}^{15} \rm N\)-tagged cells were placed back into a medium containing the more common, “light” \({}^{14} \rm N\) isotope and were allowed to grow for exactly one generation. Figure 8.8 shows Meselson and Stahl’s predictions for their experiment. Meselson and Stahl knew that \({}^{14} \rm N\)-labeled and \({}^{15} \rm N\)-labeled DNA would form separate bands after centrifugation on CsCl (cesium chloride) density gradients.

Screen Shot 2022-05-19 at 4.07.09 PM.png — Figure 8.8: Meselson and Stahl predicted experimental outcomes based on possible modes of replication.

They tested their predictions by purifying and centrifuging the DNA from the \({}^{15} \rm N\)-labeled cells grown in the \({}^{14} \rm N\) medium for one generation. They found that this DNA formed a single band with a density between that of \({}^{15} \rm N\)-labeled DNA and \({}^{14} \rm N\)-labeled DNA, thereby eliminating the conservative model of DNA replication (possibility #1), as Watson and Crick had also predicted.

That left two possibilities: replication was either semiconservative (possibility #2) or dispersive (possibility #3). The dispersive model was eliminated when DNA isolated from cells that had been grown for a second generation on \({}^{14} \rm N\) were shown to contain two bands of DNA on the CsCl density gradients.

170 Replication is Semiconservative

Search

Text Color

Text Size

Margin Size

Font Type