3.02: Protein Structure and Function
- Page ID
- 96223
\( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)
\( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)
\( \newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\)
( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\)
\( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)
\( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\)
\( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)
\( \newcommand{\Span}{\mathrm{span}}\)
\( \newcommand{\id}{\mathrm{id}}\)
\( \newcommand{\Span}{\mathrm{span}}\)
\( \newcommand{\kernel}{\mathrm{null}\,}\)
\( \newcommand{\range}{\mathrm{range}\,}\)
\( \newcommand{\RealPart}{\mathrm{Re}}\)
\( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)
\( \newcommand{\Argument}{\mathrm{Arg}}\)
\( \newcommand{\norm}[1]{\| #1 \|}\)
\( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)
\( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\AA}{\unicode[.8,0]{x212B}}\)
\( \newcommand{\vectorA}[1]{\vec{#1}} % arrow\)
\( \newcommand{\vectorAt}[1]{\vec{\text{#1}}} % arrow\)
\( \newcommand{\vectorB}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)
\( \newcommand{\vectorC}[1]{\textbf{#1}} \)
\( \newcommand{\vectorD}[1]{\overrightarrow{#1}} \)
\( \newcommand{\vectorDt}[1]{\overrightarrow{\text{#1}}} \)
\( \newcommand{\vectE}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash{\mathbf {#1}}}} \)
\( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)
\( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)
\(\newcommand{\avec}{\mathbf a}\) \(\newcommand{\bvec}{\mathbf b}\) \(\newcommand{\cvec}{\mathbf c}\) \(\newcommand{\dvec}{\mathbf d}\) \(\newcommand{\dtil}{\widetilde{\mathbf d}}\) \(\newcommand{\evec}{\mathbf e}\) \(\newcommand{\fvec}{\mathbf f}\) \(\newcommand{\nvec}{\mathbf n}\) \(\newcommand{\pvec}{\mathbf p}\) \(\newcommand{\qvec}{\mathbf q}\) \(\newcommand{\svec}{\mathbf s}\) \(\newcommand{\tvec}{\mathbf t}\) \(\newcommand{\uvec}{\mathbf u}\) \(\newcommand{\vvec}{\mathbf v}\) \(\newcommand{\wvec}{\mathbf w}\) \(\newcommand{\xvec}{\mathbf x}\) \(\newcommand{\yvec}{\mathbf y}\) \(\newcommand{\zvec}{\mathbf z}\) \(\newcommand{\rvec}{\mathbf r}\) \(\newcommand{\mvec}{\mathbf m}\) \(\newcommand{\zerovec}{\mathbf 0}\) \(\newcommand{\onevec}{\mathbf 1}\) \(\newcommand{\real}{\mathbb R}\) \(\newcommand{\twovec}[2]{\left[\begin{array}{r}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\ctwovec}[2]{\left[\begin{array}{c}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\threevec}[3]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\cthreevec}[3]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\fourvec}[4]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\cfourvec}[4]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\fivevec}[5]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\cfivevec}[5]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\mattwo}[4]{\left[\begin{array}{rr}#1 \amp #2 \\ #3 \amp #4 \\ \end{array}\right]}\) \(\newcommand{\laspan}[1]{\text{Span}\{#1\}}\) \(\newcommand{\bcal}{\cal B}\) \(\newcommand{\ccal}{\cal C}\) \(\newcommand{\scal}{\cal S}\) \(\newcommand{\wcal}{\cal W}\) \(\newcommand{\ecal}{\cal E}\) \(\newcommand{\coords}[2]{\left\{#1\right\}_{#2}}\) \(\newcommand{\gray}[1]{\color{gray}{#1}}\) \(\newcommand{\lgray}[1]{\color{lightgray}{#1}}\) \(\newcommand{\rank}{\operatorname{rank}}\) \(\newcommand{\row}{\text{Row}}\) \(\newcommand{\col}{\text{Col}}\) \(\renewcommand{\row}{\text{Row}}\) \(\newcommand{\nul}{\text{Nul}}\) \(\newcommand{\var}{\text{Var}}\) \(\newcommand{\corr}{\text{corr}}\) \(\newcommand{\len}[1]{\left|#1\right|}\) \(\newcommand{\bbar}{\overline{\bvec}}\) \(\newcommand{\bhat}{\widehat{\bvec}}\) \(\newcommand{\bperp}{\bvec^\perp}\) \(\newcommand{\xhat}{\widehat{\xvec}}\) \(\newcommand{\vhat}{\widehat{\vvec}}\) \(\newcommand{\uhat}{\widehat{\uvec}}\) \(\newcommand{\what}{\widehat{\wvec}}\) \(\newcommand{\Sighat}{\widehat{\Sigma}}\) \(\newcommand{\lt}{<}\) \(\newcommand{\gt}{>}\) \(\newcommand{\amp}{&}\) \(\definecolor{fillinmathshade}{gray}{0.9}\)16
Protein Structure and Function
Andrea Bierema
Learning Objectives
Students will be able to:
- Explain how proteins result in an organism’s traits.
- Explain the relationship between amino acids and proteins.
- Identify examples of proteins.
- Recognize that molecular structure determines molecular interactions and relates to the cellular functions of proteins.
- Describe how protein structure influences its function.
- Describe the relationship between mutation and evolution.
Overview
This chapter is titled “protein structure and function” because protein structure heavily influences its function. The structure of a protein is caused by the chemical properties of its amino acids, which is coded by a DNA sequence (a gene).
An interactive H5P element has been excluded from this version of the text. You can view it online here:
https://openbooks.lib.msu.edu/isb202/?p=104#h5p-142
This figure illustrates the insulin protein: part of its DNA sequence, part of its amino acid sequence, a representation of the protein, what the protein does, and the trait it causes. Hover over each image to learn more.
Traits
A trait is a specific characteristic of an organism, such as eye color or blood type. Traits can be determined by genes or the environment, or more commonly by interactions between them. The genetic contribution (i.e., the DNA) to a trait is called the genotype. The outward expression of the genotype, including visible and physiological traits, is called the phenotype.
Proteins
Proteins are coded and regulated by genes. These proteins, along with the environment, cause an organism’s traits.
Proteins are one of the most abundant organic molecules in living systems and have the most diverse range of functions of all macromolecules. Proteins may be structural, regulatory, contractile, or protective. They may serve in transport, storage, or membranes; or they may be toxins or enzymes. Each cell in a living system may contain thousands of proteins, each with a unique function. Their structures, like their functions, vary greatly. They are all, however, amino acid polymers arranged in a linear sequence (also referred to as a “peptide”).
Protein types and functions:
An interactive H5P element has been excluded from this version of the text. You can view it online here:
https://openbooks.lib.msu.edu/isb202/?p=104#h5p-92
Monomers and Polymers
Monomers are molecules that can bind into long chains—these long chains are called “polymers.” In other words, a polymer (“poly” = many) are made of monomers (“mono” meaning “one”).
Amino acids are the monomers that comprise polypeptides (polypeptides being the polymers). A polypeptide folds into a 3D structure called a protein. Scientists use the name “amino acid” because these acids contain both amino group and carboxyl-acid-group in their basic structure. As we mentioned, there are 20 common amino acids present in proteins. Nine of these are essential amino acids in humans because the human body cannot produce them and we obtain them from our diet. Below are two illustrations depicting the relationship between amino acids and polypeptides.
Protein Structure
As mentioned above, a protein’s shape is critical to its function. For example, an enzyme can bind to a specific substrate at an active site. If this active site is altered because of local changes or changes in overall protein structure, the enzyme may be unable to bind to the substrate. To understand how the protein gets its final shape or conformation, we need to understand the four levels of protein structure: primary, secondary, tertiary, and quaternary. See the image below and click on the information hotspots (labeled with an “i”) for explanations.
An interactive H5P element has been excluded from this version of the text. You can view it online here:
https://openbooks.lib.msu.edu/isb202/?p=104#h5p-93
As seen in the image above, a strand of amino acids folds on itself, creating a unique shape in the tertiary structure of the protein. This is caused by the chemical properties of the amino acids. The chemical properties of the amino acids determine how this shape occurs. For instance, each amino acid is negatively (-), positively (+), or neutrally (N) charged. Negatively charged amino acids bind with positively charged amino acids (neutrally charged amino acids are not affected). Also, the amino acid called cysteine contains sulfur and sulfurs easily bind with each other, creating a “disulfide bond.” Because of this, cysteines bind with other cysteines. See the table below for a list of all 20 amino acids and their charges. There are other properties that also influence a protein’s shape, such as the amino acid’s polarity. Note that these bonds are not as strong as what is created between amino acids when an amino acid chain is created, but these bonds are strong enough to hold the shape in the protein.
A list of the 20 amino acids common in all living things. The table includes the full name and abbreviations of each amino acid as well as their charge (positive, negative, or neutral). It is also noted which one can create a disulfide bond.
Amino Acid | 3-Letter Abbrev. | 1-Letter Abbrev. | Charge | Disulfide Bond Formation? |
---|---|---|---|---|
Alanine | Ala | A | Neutral | |
Arginine | Arg | R | Positive (+) | |
Asparagine | Asn | N | Neutral | |
Aspartate (Aspartic acid) | Asp | D | Negative (-) | |
Cysteine | Cys | C | Neutral | Yes |
Glutamine | Gln | Q | Neutral | |
Glutamate (Glutamic acid) | Glu | E | Negative (-) | |
Glycine | Gly | G | Neutral | |
Histidine | His | H | Positive (+) | |
Isoleucine | Ile | I | Neutral | |
Leucine | Leu | L | Neutral | |
Lysine | Lys | K | Positive (+) | |
Methionine | Met | M | Neutral | |
Phenylalanine | Phe | F | Neutral | |
Proline | Pro | P | Neutral | |
Serine | Ser | S | Neutral | |
Threonine | Thr | T | Neutral | |
Tryptophan | Trp | W | Neutral | |
Tyrosine | Tyr | Y | Neutral | |
Valine | Val | V | Neutral |
Exercise
Use the chart above to determine which amino acids may bond together to form the tertiary structure.
An interactive H5P element has been excluded from this version of the text. You can view it online here:
https://openbooks.lib.msu.edu/isb202/?p=104#h5p-94
Example
Here is an example of a polypeptide model depicting how charges influence the tertiary structure. The first and second images are the same, except the second image has hotspots with additional information marked with a question mark (?). The key at the bottom of the image is necessary for interpreting the image.
An interactive H5P element has been excluded from this version of the text. You can view it online here:
https://openbooks.lib.msu.edu/isb202/?p=104#h5p-95
Mutations
Mutations can impact protein synthesis and amino acid sequence. If these mutations are heritable, then they may influence the evolution of a species. Therefore, this chapter includes information on mutations and evolution.
What Are Mutations?
Mutation is a change in DNA, the hereditary material of life. An organism’s DNA codes for the production of proteins, which affects how it looks, how it behaves, and its physiology—all aspects of its life. So, a change in an organism’s DNA can cause changes in all aspects of its life.
An interactive H5P element has been excluded from this version of the text. You can view it online here:
https://openbooks.lib.msu.edu/isb202/?p=104#h5p-140
The gene encoding the protein ultimately determines the unique sequence for every protein. A change in the nucleotide sequence of the gene’s coding region may lead to adding a different amino acid to the growing polypeptide chain, causing a change in protein structure and function. In sickle cell anemia, the hemoglobin β chain has a single amino acid substitution, causing a change in protein structure and function. Specifically, valine in the β chain substitutes the amino acid glutamic. What is most remarkable to consider is that a hemoglobin molecule is comprised of two alpha and two beta chains that each consist of about 150 amino acids. The molecule, therefore, has about 600 amino acids. The structural difference between a normal hemoglobin molecule and a sickle cell molecule—which dramatically decreases life expectancy—is a single amino acid out of the total 600. What is even more remarkable is that three nucleotides each encode those 600 amino acids and a single base change (point mutation)—1 in 1800 bases—causes the mutation.
This change to one amino acid in the chain causes hemoglobin molecules to form long fibers that distort the biconcave, or disc-shaped, red blood cells and causes them to assume a crescent, or “sickle,” shape that clogs blood vessels. This can lead to a myriad of serious health problems such as breathlessness, dizziness, headaches, and abdominal pain for those affected by this disease.
The Causes of Mutations
Mutations happen for several reasons:
- DNA fails to copy accurately: Most of the mutations that we think matter to evolution are “naturally occurring.” For example, when a cell divides, it makes a copy of its DNA and sometimes that copy is not quite perfect. That small difference from the original DNA sequence is a mutation.
- External influences can create mutations: Mutations can also be caused by exposure to specific chemicals or radiation. These agents cause the DNA to break down. This is not necessarily unnatural—even in the most isolated and pristine environments, DNA breaks down. Nevertheless, when the cell repairs the DNA, it might not do a perfect job of the repair. So, the cell would end up with DNA slightly different than the original DNA and hence, a mutation.
Evolution
Biological evolution, simply put, is descent with modification. This definition encompasses small-scale evolution (changes in gene—or, more precisely and technically, allele—frequency in a population from one generation to the next) and large-scale evolution (the descent of different species from a common ancestor over many generations). Evolution is responsible for both the remarkable similarities we see across all life and the amazing diversity of that life, but how does it work?
For evolutionary mechanisms (such as natural selection) to act, there needs to be genetic variation and mutations, or changes, in the DNA. DNA codes for proteins, and when those proteins are produced, mutations create variation. Mutations can be beneficial, neutral, or harmful for the organism, but mutations do not “try” to supply what the organism “needs.” In this respect, mutations are random—whether a particular mutation happens or not is unrelated to how useful that mutation would be.
Because all cells in our body contain DNA, there are lots of places for mutations to occur; however, not all mutations matter for evolution. Somatic mutations occur in non-reproductive cells and won’t be passed onto offspring. Mutations can also be caused by exposure to specific chemicals or radiation. These agents cause the DNA to break down. This is not necessarily unnatural—even in the most isolated and pristine environments, DNA breaks down. Nevertheless, when the cell repairs the DNA, it might not do a perfect job of the repair. So the cell would end up with DNA slightly different than the original DNA and hence, a mutation.
A single germline mutation can have a range of effects:
- No change occurs in phenotype: Some mutations don’t have any noticeable effect on the phenotype of an organism. This can happen in many situations: perhaps the mutation occurs in a stretch of DNA with no function, or perhaps the mutation occurs in a protein-coding region but ends up not affecting the amino acid sequence of the protein.
- Small change occurs in phenotype: A single mutation caused this cat’s ears to curl backward slightly.
- Big change occurs in phenotype: Some really important phenotypic changes, like DDT resistance in insects, are sometimes caused by single mutations. A single mutation can also have strong negative effects for the organism. Mutations that cause the death of an organism are called lethals—and it doesn’t get more negative than that.
An interactive H5P element has been excluded from this version of the text. You can view it online here:
https://openbooks.lib.msu.edu/isb202/?p=104#h5p-96
There are some sorts of changes that a single mutation, or even a lot of mutations, could not cause. Neither mutations nor wishful thinking will make pigs have wings; only pop culture could have created the Teenage Mutant Ninja Turtles—mutations could not have done it.
See the “Evolution” chapter in this textbook for more information.
Attributions
This chapter is a modified derivative of the following articles:
“Biological Molecules” by OpenStax College, Biology, CC BY 4.0. Download the original article for free at https://openstax.org/books/biology-2e/pages/3-4-proteins
“Trait” by National Human Genome Research Institute, National Institutes of Health, Talking Glossary of Genetic Terms.
Understanding Evolution. 2020. University of California Museum of Paleontology. 16 July 2020 <http://evolution.berkeley.edu/>. Published with permission.