Skip to main content
Biology LibreTexts

2.6.1: H1. Protein Aggregation

  • Page ID
  • \( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

    \( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)

    \( \newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\)

    ( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\)

    \( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)

    \( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\)

    \( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)

    \( \newcommand{\Span}{\mathrm{span}}\)

    \( \newcommand{\id}{\mathrm{id}}\)

    \( \newcommand{\Span}{\mathrm{span}}\)

    \( \newcommand{\kernel}{\mathrm{null}\,}\)

    \( \newcommand{\range}{\mathrm{range}\,}\)

    \( \newcommand{\RealPart}{\mathrm{Re}}\)

    \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)

    \( \newcommand{\Argument}{\mathrm{Arg}}\)

    \( \newcommand{\norm}[1]{\| #1 \|}\)

    \( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)

    \( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\AA}{\unicode[.8,0]{x212B}}\)

    \( \newcommand{\vectorA}[1]{\vec{#1}}      % arrow\)

    \( \newcommand{\vectorAt}[1]{\vec{\text{#1}}}      % arrow\)

    \( \newcommand{\vectorB}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

    \( \newcommand{\vectorC}[1]{\textbf{#1}} \)

    \( \newcommand{\vectorD}[1]{\overrightarrow{#1}} \)

    \( \newcommand{\vectorDt}[1]{\overrightarrow{\text{#1}}} \)

    \( \newcommand{\vectE}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash{\mathbf {#1}}}} \)

    \( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

    \( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)

    \(\newcommand{\avec}{\mathbf a}\) \(\newcommand{\bvec}{\mathbf b}\) \(\newcommand{\cvec}{\mathbf c}\) \(\newcommand{\dvec}{\mathbf d}\) \(\newcommand{\dtil}{\widetilde{\mathbf d}}\) \(\newcommand{\evec}{\mathbf e}\) \(\newcommand{\fvec}{\mathbf f}\) \(\newcommand{\nvec}{\mathbf n}\) \(\newcommand{\pvec}{\mathbf p}\) \(\newcommand{\qvec}{\mathbf q}\) \(\newcommand{\svec}{\mathbf s}\) \(\newcommand{\tvec}{\mathbf t}\) \(\newcommand{\uvec}{\mathbf u}\) \(\newcommand{\vvec}{\mathbf v}\) \(\newcommand{\wvec}{\mathbf w}\) \(\newcommand{\xvec}{\mathbf x}\) \(\newcommand{\yvec}{\mathbf y}\) \(\newcommand{\zvec}{\mathbf z}\) \(\newcommand{\rvec}{\mathbf r}\) \(\newcommand{\mvec}{\mathbf m}\) \(\newcommand{\zerovec}{\mathbf 0}\) \(\newcommand{\onevec}{\mathbf 1}\) \(\newcommand{\real}{\mathbb R}\) \(\newcommand{\twovec}[2]{\left[\begin{array}{r}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\ctwovec}[2]{\left[\begin{array}{c}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\threevec}[3]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\cthreevec}[3]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\fourvec}[4]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\cfourvec}[4]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\fivevec}[5]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\cfivevec}[5]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\mattwo}[4]{\left[\begin{array}{rr}#1 \amp #2 \\ #3 \amp #4 \\ \end{array}\right]}\) \(\newcommand{\laspan}[1]{\text{Span}\{#1\}}\) \(\newcommand{\bcal}{\cal B}\) \(\newcommand{\ccal}{\cal C}\) \(\newcommand{\scal}{\cal S}\) \(\newcommand{\wcal}{\cal W}\) \(\newcommand{\ecal}{\cal E}\) \(\newcommand{\coords}[2]{\left\{#1\right\}_{#2}}\) \(\newcommand{\gray}[1]{\color{gray}{#1}}\) \(\newcommand{\lgray}[1]{\color{lightgray}{#1}}\) \(\newcommand{\rank}{\operatorname{rank}}\) \(\newcommand{\row}{\text{Row}}\) \(\newcommand{\col}{\text{Col}}\) \(\renewcommand{\row}{\text{Row}}\) \(\newcommand{\nul}{\text{Nul}}\) \(\newcommand{\var}{\text{Var}}\) \(\newcommand{\corr}{\text{corr}}\) \(\newcommand{\len}[1]{\left|#1\right|}\) \(\newcommand{\bbar}{\overline{\bvec}}\) \(\newcommand{\bhat}{\widehat{\bvec}}\) \(\newcommand{\bperp}{\bvec^\perp}\) \(\newcommand{\xhat}{\widehat{\xvec}}\) \(\newcommand{\vhat}{\widehat{\vvec}}\) \(\newcommand{\uhat}{\widehat{\uvec}}\) \(\newcommand{\what}{\widehat{\wvec}}\) \(\newcommand{\Sighat}{\widehat{\Sigma}}\) \(\newcommand{\lt}{<}\) \(\newcommand{\gt}{>}\) \(\newcommand{\amp}{&}\) \(\definecolor{fillinmathshade}{gray}{0.9}\)

    We have seen that protein aggregates complicate the lives of people who study protein folding in vitro and who try to express human proteins in prokaryotes like E. Coli in vivo. Instead of viewing these aggregates as junk, some now study them avidly. It turns out that these aggregates are not as non-specific as earlier believed. In addition, an understanding of how and when they form will give us clues into the etiology and treatment of some of the most debilitating and feared diseases. Much of this review is based on the following reference: Taubes, G. Misfolding the Way to Disease, G. Science, 271, 1493-1495 (1996)

    Clues Showing the Specificity of Aggregate Formation

    • 1970's: It was shown that chymotrypsinogen could not be folded in vitro without aggregates forming. An intermediate was presumed to have formed that if present in high concentration would aggregate irreversibly instead of fold to the native state. Refolding of tryptophanase showed that it aggregated only with itself, suggesting specificity.
    • 1981: King, at MIT, found a single amino acid folding mutant in a viral protein. Both the normal and mutant viral protein unfold at high temperature, but only the mutant would aggregate at high temperature, suggesting that aggregation could be programmed into or out of a gene
    • mid 80's: The biotech industry, struggling to express growth hormone, found that a single amino acid change in bovine growth hormone completely prevented aggregation without affecting correct folding.

    This knowledge of protein folding and aggregation was soon turn toward understanding several diseases in which protein aggregates were observed which either initiated or were associated with disease. These protein aggregates were termed "amyloid deposits" and seemed to be associated and perhaps causative of several neurodegenerative diseases. The name amyloid was first used by a German pathologist, Rudolf Virchow, who in 1853 described waxy tissue deposits associated with eosinophils (a type of immune cell). These deposits seemed to resemble starch (made of amylose and amylopectin) so he termed them amyloid. All known amyloid deposits are, however, composed of protein, not starch. It now appears that these disease may be caused by improper protein folding and subsequent aggregation. Except in certain rare inherited diseases, the amyloid deposits are composed of normal proteins (often called wild-type proteins) which seems to polymerize into fibrils. Sometimes, in inherited conditions, or when mutations appear in a specific protein, the amyloid protein deposits consist of the mutant protein. The proteins in these deposited fibers are composed predominantly of beta sheets which are perpendicular to the fiber axis, while the polymerized protein usually has little beta sheet structure. Examples are given below:

    1. Familial amyloidotic polyneuropathy (FAP) - Affects 1/10,00 to 1/100,000 people. The protein involved is called transthyrein, which normally exists in blood as a tetramer formed by association of 4 identical monomers. In mildly acid condition in vitro, the equilibrium between tetramer and monomer is shifted to monomer, which can aggregate into fibrils. This aggregation could be promoted by possible transition to a molten globule (discussed previously with lactalbumin) like state. This has secondary structure but loosely packed tertiary structure with more exposed hydrophobes. If the concentration is high enough the molten globules aggregate. In people with the disease, mutations in the protein destabilize the tetramer, pushing the equilibrium to the monomer, which presumably increases molten globule formation and aggregation. Specifically, Val30Met and Leu55Pro mutations promote dissociation of the tetramer and formation of aggregates. Conversely, Thr119Met inhibits tetramer dissociation. The aggregates deposit in heart, lungs, kidney, etc, leading to death.
    2. Light Chain Amyloidosis; Light Chain Deposition Disease - The light chain protein is a normal component of circulating antibody molecules. Mutants in the light chain cause a destabilization of the native state to state similar to a molten globule, which then aggregates.
    3. Lysozyme amyloidosis - This protein, with extensive alpha-helix structure, is usually involved in carbohydrate catabolism. Two mutants, Asp67His and another, Ile56Thr (normal amino acid/number in sequence/mutant amino acid) destabliize the protein structure (as indicated by a decrease in the Tm of about 10 degrees C) to a molten-globule form, which probably aggregates to fibrils characterized by extensive beta structure.
    4. Alzheimers-This disease involves a defect in a protein normally found in the membrane of neurons. The protein, called beta-amyloid precursor protein (BAPP), is a transmembrane protein. A slightly truncated, soluble form is also found secreted from cells and is found in extracellular fluid (such as cerebrospinal fluid and blood). The normal function of these BAPP proteins is not yet clear. An endoprotease cleaves a small 40-42 amino acid fragment from this protein, forming the amyloid beta (Ab) protein. It is this protein or a mutant form of it which aggregates to form beta-sheet containing fibrils in Alzheimer's disease. Several mutations in different proteins have been linked to Alzheimer's, but they all seem to increase production or deposition or both of the amyloid beta protein. These deposited plaques are extracellular, and have been shown to cause neuronal damage. The are found in areas of the brain required for memory and cognition. The BAPP gene is found on chromosome 21, the same chromosome which is present in an extra copy (trisomy 21) in Downs Syndrome, whose symptoms include presenile dementia and amyloid plaques. Aggregate formation appears to be driven by increased expression of BAPP and hence amyloid beta protein. In addition, some mutants may serve to destabilize the amyloid beta protein, increasing its aggregation.


    1. Transmissible spongiform encephalopathies (TSEs) - Including scrapie in sheep, bovine spongiform encephalopathy (mad cow disease), and in humans Creutzfeld-Jacob Disease (CJD), Fatal Familial Insomnia (FFI),Gerstman-Straussler-Scheinker Syndrome, and Kuru (associated with cannibalism). In these fatal diseases, the brain, on autopsy, resembles a sponge with holes. In contrast to the diseases above, these diseases can be transmitted from one animal to another, but typically not between species. (However, consider the controversy with mad cow disease.) Also, the infectious agent can self-replicate in vivo. The logical conclusion is that a virus (slow-acting) is the causative agent. However, the infectious agent survives radiation, heat, chemical agents, and enzymes designed to kill viruses and their associated nucleic acids. Mathematical analyses suggested that the infectious agent in such diseases could be nothing more than a protein. Stanley B. Prusiner in the 80's isolated just such a protein which he named a prion, for proteinaceous infectious agent. Since then he and others have amassed substantial evidence to support his contention. In October 1997 he was awarded the Nobel Prize in Medicine.

    This page titled 2.6.1: H1. Protein Aggregation is shared under a CC BY-NC-SA license and was authored, remixed, and/or curated by Henry Jakubowski.

    • Was this article helpful?