8.2: Storing Genetic Information
- Page ID
- 35708
\( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)
\( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)
\( \newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\)
( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\)
\( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)
\( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\)
\( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)
\( \newcommand{\Span}{\mathrm{span}}\)
\( \newcommand{\id}{\mathrm{id}}\)
\( \newcommand{\Span}{\mathrm{span}}\)
\( \newcommand{\kernel}{\mathrm{null}\,}\)
\( \newcommand{\range}{\mathrm{range}\,}\)
\( \newcommand{\RealPart}{\mathrm{Re}}\)
\( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)
\( \newcommand{\Argument}{\mathrm{Arg}}\)
\( \newcommand{\norm}[1]{\| #1 \|}\)
\( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)
\( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\AA}{\unicode[.8,0]{x212B}}\)
\( \newcommand{\vectorA}[1]{\vec{#1}} % arrow\)
\( \newcommand{\vectorAt}[1]{\vec{\text{#1}}} % arrow\)
\( \newcommand{\vectorB}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)
\( \newcommand{\vectorC}[1]{\textbf{#1}} \)
\( \newcommand{\vectorD}[1]{\overrightarrow{#1}} \)
\( \newcommand{\vectorDt}[1]{\overrightarrow{\text{#1}}} \)
\( \newcommand{\vectE}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash{\mathbf {#1}}}} \)
\( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)
\( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)
\(\newcommand{\avec}{\mathbf a}\) \(\newcommand{\bvec}{\mathbf b}\) \(\newcommand{\cvec}{\mathbf c}\) \(\newcommand{\dvec}{\mathbf d}\) \(\newcommand{\dtil}{\widetilde{\mathbf d}}\) \(\newcommand{\evec}{\mathbf e}\) \(\newcommand{\fvec}{\mathbf f}\) \(\newcommand{\nvec}{\mathbf n}\) \(\newcommand{\pvec}{\mathbf p}\) \(\newcommand{\qvec}{\mathbf q}\) \(\newcommand{\svec}{\mathbf s}\) \(\newcommand{\tvec}{\mathbf t}\) \(\newcommand{\uvec}{\mathbf u}\) \(\newcommand{\vvec}{\mathbf v}\) \(\newcommand{\wvec}{\mathbf w}\) \(\newcommand{\xvec}{\mathbf x}\) \(\newcommand{\yvec}{\mathbf y}\) \(\newcommand{\zvec}{\mathbf z}\) \(\newcommand{\rvec}{\mathbf r}\) \(\newcommand{\mvec}{\mathbf m}\) \(\newcommand{\zerovec}{\mathbf 0}\) \(\newcommand{\onevec}{\mathbf 1}\) \(\newcommand{\real}{\mathbb R}\) \(\newcommand{\twovec}[2]{\left[\begin{array}{r}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\ctwovec}[2]{\left[\begin{array}{c}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\threevec}[3]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\cthreevec}[3]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\fourvec}[4]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\cfourvec}[4]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\fivevec}[5]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\cfivevec}[5]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\mattwo}[4]{\left[\begin{array}{rr}#1 \amp #2 \\ #3 \amp #4 \\ \end{array}\right]}\) \(\newcommand{\laspan}[1]{\text{Span}\{#1\}}\) \(\newcommand{\bcal}{\cal B}\) \(\newcommand{\ccal}{\cal C}\) \(\newcommand{\scal}{\cal S}\) \(\newcommand{\wcal}{\cal W}\) \(\newcommand{\ecal}{\cal E}\) \(\newcommand{\coords}[2]{\left\{#1\right\}_{#2}}\) \(\newcommand{\gray}[1]{\color{gray}{#1}}\) \(\newcommand{\lgray}[1]{\color{lightgray}{#1}}\) \(\newcommand{\rank}{\operatorname{rank}}\) \(\newcommand{\row}{\text{Row}}\) \(\newcommand{\col}{\text{Col}}\) \(\renewcommand{\row}{\text{Row}}\) \(\newcommand{\nul}{\text{Nul}}\) \(\newcommand{\var}{\text{Var}}\) \(\newcommand{\corr}{\text{corr}}\) \(\newcommand{\len}[1]{\left|#1\right|}\) \(\newcommand{\bbar}{\overline{\bvec}}\) \(\newcommand{\bhat}{\widehat{\bvec}}\) \(\newcommand{\bperp}{\bvec^\perp}\) \(\newcommand{\xhat}{\widehat{\xvec}}\) \(\newcommand{\vhat}{\widehat{\vvec}}\) \(\newcommand{\uhat}{\widehat{\uvec}}\) \(\newcommand{\what}{\widehat{\wvec}}\) \(\newcommand{\Sighat}{\widehat{\Sigma}}\) \(\newcommand{\lt}{<}\) \(\newcommand{\gt}{>}\) \(\newcommand{\amp}{&}\) \(\definecolor{fillinmathshade}{gray}{0.9}\)What you’ll learn to do: Explain how DNA stores genetic information
The unique structure of DNA is key to its ability to store and replicated genetic information:
In this outcome, you will learn to describe the double helix structure of DNA: its sugar-phosphate backbone ladder with nitrogenous base “rungs” of ladder.
- Diagram the structure of DNA
- Relate the structure of DNA to the storage of genetic information
Structure of DNA
The building blocks of DNA are nucleotides. The important components of each nucleotide are a nitrogenous base, deoxyribose (5-carbon sugar), and a phosphate group (see Figure 2). Each nucleotide is named depending on its nitrogenous base. The nitrogenous base can be a purine, such as adenine (A) and guanine (G), or a pyrimidine, such as cytosine (C) and thymine (T). Uracil (U) is also a pyrimidine (as seen in Figure 2), but it only occurs in RNA, which we will talk more about later.
The nucleotides combine with each other by covalent bonds known as phosphodiester bonds or linkages. The phosphate residue is attached to the hydroxyl group of the 5′ carbon of one sugar of one nucleotide and the hydroxyl group of the 3′ carbon of the sugar of the next nucleotide, thereby forming a 5′-3′ phosphodiester bond.
In the 1950s, Francis Crick and James Watson worked together to determine the structure of DNA at the University of Cambridge, England. Other scientists like Linus Pauling and Maurice Wilkins were also actively exploring this field. Pauling had discovered the secondary structure of proteins using X-ray crystallography. In Wilkins’ lab, researcher Rosalind Franklin was using X-ray diffraction methods to understand the structure of DNA. Watson and Crick were able to piece together the puzzle of the DNA molecule on the basis of Franklin’s data because Crick had also studied X-ray diffraction (Figure 3). In 1962, James Watson, Francis Crick, and Maurice Wilkins were awarded the Nobel Prize in Medicine. Unfortunately, by then Franklin had died, and Nobel prizes are not awarded posthumously.
Watson and Crick proposed that DNA is made up of two strands that are twisted around each other to form a right-handed helix. Base pairing takes place between a purine and pyrimidine; namely, A pairs with T and G pairs with C. Adenine and thymine are complementary base pairs, and cytosine and guanine are also complementary base pairs. The base pairs are stabilized by hydrogen bonds; adenine and thymine form two hydrogen bonds and cytosine and guanine form three hydrogen bonds. The two strands are anti-parallel in nature; that is, the 3′ end of one strand faces the 5′ end of the other strand. The sugar and phosphate of the nucleotides form the backbone of the structure, whereas the nitrogenous bases are stacked inside. Each base pair is separated from the other base pair by a distance of 0.34 nm, and each turn of the helix measures 3.4 nm. Therefore, ten base pairs are present per turn of the helix. The diameter of the DNA double helix is 2 nm, and it is uniform throughout. Only the pairing between a purine and pyrimidine can explain the uniform diameter. The twisting of the two strands around each other results in the formation of uniformly spaced major and minor grooves (Figure 4).
Genetic Information
The genetic information of an organism is stored in DNA molecules. How can one kind of molecule contain all the instructions for making complicated living beings like ourselves? What component or feature of DNA can contain this information? It has to come from the nitrogen bases, because, as you already know, the backbone of all DNA molecules is the same. But there are only four bases found in DNA: G, A, C, and T. The sequence of these four bases can provide all the instructions needed to build any living organism. It might be hard to imagine that 4 different “letters” can communicate so much information. But think about the English language, which can represent a huge amount of information using just 26 letters. Even more profound is the binary code used to write computer programs. This code contains only ones and zeros, and think of all the things your computer can do. The DNA alphabet can encode very complex instructions using just four letters, though the messages end up being really long. For example, the E. coli bacterium carries its genetic instructions in a DNA molecule that contains more than five million nucleotides. The human genome (all the DNA of an organism) consists of around three billion nucleotides divided up between 23 paired DNA molecules, or chromosomes.
The information stored in the order of bases is organized into genes: each gene contains information for making a functional product. The genetic information is first copied to another nucleic acid polymer, RNA (ribonucleic acid), preserving the order of the nucleotide bases. Genes that contain instructions for making proteins are converted to messenger RNA (mRNA). Some specialized genes contain instructions for making functional RNA molecules that don’t make proteins. These RNA molecules function by affecting cellular processes directly; for example some of these RNA molecules regulate the expression of mRNA. Other genes produce RNA molecules that are required for protein synthesis, transfer RNA (tRNA), and ribosomal RNA (rRNA).
In order for DNA to function effectively at storing information, two key processes are required. First, information stored in the DNA molecule must be copied, with minimal errors, every time a cell divides. This ensures that both daughter cells inherit the complete set of genetic information from the parent cell. Second, the information stored in the DNA molecule must be translated, or expressed. In order for the stored information to be useful, cells must be able to access the instructions for making specific proteins, so the correct proteins are made in the right place at the right time.
Both copying and reading the information stored in DNA relies on base pairing between two nucleic acid polymer strands. Recall that DNA structure is a double helix (see Figure 5).
The sugar deoxyribose with the phosphate group forms the scaffold or backbone of the molecule (highlighted in yellow in Figure 5). Bases point inward. Complementary bases form hydrogen bonds with each other within the double helix. See how the bigger bases (purines) pair with the smaller ones (pyrimidines). This keeps the width of the double helix constant. More specifically, A pairs with T and C pairs with G. As we discuss the function of DNA in subsequent sections, keep in mind that there is a chemical reason for specific pairing of bases.
To illustrate the connection between information in DNA and an observable characteristic of an organism, let’s consider a gene that provides the instructions for building the hormone insulin. Insulin is responsible for regulating blood sugar levels. The insulin gene contains instructions for assembling the protein insulin from individual amino acids. Changing the sequence of nucleotides in the DNA molecule can change the amino acids in the final protein, leading to protein malfunction. If insulin does not function correctly, it might be unable to bind to another protein (insulin receptor). On the organismal level of organization, this molecular event (change of DNA sequence) can lead to a disease state—in this case, diabetes.
The order of nucleotides in a gene (in DNA) is the key to how information is stored. For example, consider these two words: stable and tables. Both words are built from the same letters (subunits), but the different order of these subunits results in very different meanings. In DNA, the information is stored in units of 3 letters. Use the following key to decode the encrypted message. This should help you to see how information can be stored in the linear order of nucleotides in DNA.
ABC = a | DEF = d | GHI = e | JKL = f |
MNO = h | PQR = i | STU = m | VWX = n |
YZA = o | BCD = r | EFG = s | HIJ = t |
KLM = w | NOP = j | QRS = p | TUV = y |
Encrypted Message: HIJMNOPQREFG – PQREFG – MNOYZAKLM – DEFVWXABC – EFGHIJYZABCDGHIEFG – PQRVWXJKLYZABCDSTUABCHIJPQRYZAVWX
- Show Answer
-
This is how DNA stores information.
Where in the DNA is information stored?
- The shape of the DNA
- The sugar-phosphate backbone
- The sequence of bases
- The presence of two strands.
- Show Answer
-
Answer c. The sequence of the bases codes for the instructions for protein synthesis. The shape is DNA is not related to information storage. The sugar-phosphate backbone only acts as a scaffold. The presence of two strands is important for replication, but their information content is equivalent, as they are complementary to each other.
Which statement is correct?
- The sequence of DNA bases is arranged into chromosomes, most of which contain the instructions to build an amino acid.
- The sequence of DNA strands is arranged into chromosomes, most of which contain the instructions to build a protein.
- The sequence of DNA bases is arranged into genes, most of which contain the instructions to build a protein.
- The sequence of DNA phosphates is arranged into genes, most of which contain the instructions to build a cell.
- Show Answer
-
Answer c. The sequence of DNA bases is arranged into genes, most of which contain the instructions to build a protein. DNA stores information in the sequence of its bases. The information is grouped into genes. Protein is what is mainly coded.
Check Your Understanding
Answer the question(s) below to see how well you understand the topics covered in the previous section. This short quiz does not count toward your grade in the class, and you can retake it an unlimited number of times.
Use this quiz to check your understanding and decide whether to (1) study the previous section further or (2) move on to the next section.
Contributors and Attributions
- Introduction to Storing Genetic Information. Authored by: Shelli Carter and Lumen Learning. Provided by: Lumen Learning. License: CC BY: Attribution
- Nucleic Acid Structure (a derivative from the original work). Authored by: Stephen Snyder. License: CC BY: Attribution
- Biology. Provided by: OpenStax CNX. Located at: http://cnx.org/contents/185cbf87-c72e-48f5-b51e-f14f21b5eabd@10.8. License: CC BY: Attribution. License Terms: Download for free at http://cnx.org/contents/185cbf87-c72...f21b5eabd@10.8
- Information Content of DNA. Provided by: Open Learning Initiative. Located at: https://oli.cmu.edu/jcourse/workbook/activity/page?context=434a5f7d80020ca600a650db8880db33. Project: Introduction to Biology (Open + Free). License: CC BY-NC-SA: Attribution-NonCommercial-ShareAlike
- DNA Simple. Authored by: Forluvoft. Located at: https://commons.wikimedia.org/wiki/File:DNA_simple2.svg. License: Public Domain: No Known Copyright