Skip to main content
Biology LibreTexts

8.11: Protein Cleavage

  • Page ID
  • \( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

    \( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)

    \( \newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\)

    ( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\)

    \( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)

    \( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\)

    \( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)

    \( \newcommand{\Span}{\mathrm{span}}\)

    \( \newcommand{\id}{\mathrm{id}}\)

    \( \newcommand{\Span}{\mathrm{span}}\)

    \( \newcommand{\kernel}{\mathrm{null}\,}\)

    \( \newcommand{\range}{\mathrm{range}\,}\)

    \( \newcommand{\RealPart}{\mathrm{Re}}\)

    \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)

    \( \newcommand{\Argument}{\mathrm{Arg}}\)

    \( \newcommand{\norm}[1]{\| #1 \|}\)

    \( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)

    \( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\AA}{\unicode[.8,0]{x212B}}\)

    \( \newcommand{\vectorA}[1]{\vec{#1}}      % arrow\)

    \( \newcommand{\vectorAt}[1]{\vec{\text{#1}}}      % arrow\)

    \( \newcommand{\vectorB}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

    \( \newcommand{\vectorC}[1]{\textbf{#1}} \)

    \( \newcommand{\vectorD}[1]{\overrightarrow{#1}} \)

    \( \newcommand{\vectorDt}[1]{\overrightarrow{\text{#1}}} \)

    \( \newcommand{\vectE}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash{\mathbf {#1}}}} \)

    \( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

    \( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)

    \(\newcommand{\avec}{\mathbf a}\) \(\newcommand{\bvec}{\mathbf b}\) \(\newcommand{\cvec}{\mathbf c}\) \(\newcommand{\dvec}{\mathbf d}\) \(\newcommand{\dtil}{\widetilde{\mathbf d}}\) \(\newcommand{\evec}{\mathbf e}\) \(\newcommand{\fvec}{\mathbf f}\) \(\newcommand{\nvec}{\mathbf n}\) \(\newcommand{\pvec}{\mathbf p}\) \(\newcommand{\qvec}{\mathbf q}\) \(\newcommand{\svec}{\mathbf s}\) \(\newcommand{\tvec}{\mathbf t}\) \(\newcommand{\uvec}{\mathbf u}\) \(\newcommand{\vvec}{\mathbf v}\) \(\newcommand{\wvec}{\mathbf w}\) \(\newcommand{\xvec}{\mathbf x}\) \(\newcommand{\yvec}{\mathbf y}\) \(\newcommand{\zvec}{\mathbf z}\) \(\newcommand{\rvec}{\mathbf r}\) \(\newcommand{\mvec}{\mathbf m}\) \(\newcommand{\zerovec}{\mathbf 0}\) \(\newcommand{\onevec}{\mathbf 1}\) \(\newcommand{\real}{\mathbb R}\) \(\newcommand{\twovec}[2]{\left[\begin{array}{r}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\ctwovec}[2]{\left[\begin{array}{c}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\threevec}[3]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\cthreevec}[3]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\fourvec}[4]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\cfourvec}[4]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\fivevec}[5]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\cfivevec}[5]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\mattwo}[4]{\left[\begin{array}{rr}#1 \amp #2 \\ #3 \amp #4 \\ \end{array}\right]}\) \(\newcommand{\laspan}[1]{\text{Span}\{#1\}}\) \(\newcommand{\bcal}{\cal B}\) \(\newcommand{\ccal}{\cal C}\) \(\newcommand{\scal}{\cal S}\) \(\newcommand{\wcal}{\cal W}\) \(\newcommand{\ecal}{\cal E}\) \(\newcommand{\coords}[2]{\left\{#1\right\}_{#2}}\) \(\newcommand{\gray}[1]{\color{gray}{#1}}\) \(\newcommand{\lgray}[1]{\color{lightgray}{#1}}\) \(\newcommand{\rank}{\operatorname{rank}}\) \(\newcommand{\row}{\text{Row}}\) \(\newcommand{\col}{\text{Col}}\) \(\renewcommand{\row}{\text{Row}}\) \(\newcommand{\nul}{\text{Nul}}\) \(\newcommand{\var}{\text{Var}}\) \(\newcommand{\corr}{\text{corr}}\) \(\newcommand{\len}[1]{\left|#1\right|}\) \(\newcommand{\bbar}{\overline{\bvec}}\) \(\newcommand{\bhat}{\widehat{\bvec}}\) \(\newcommand{\bperp}{\bvec^\perp}\) \(\newcommand{\xhat}{\widehat{\xvec}}\) \(\newcommand{\vhat}{\widehat{\vvec}}\) \(\newcommand{\uhat}{\widehat{\uvec}}\) \(\newcommand{\what}{\widehat{\wvec}}\) \(\newcommand{\Sighat}{\widehat{\Sigma}}\) \(\newcommand{\lt}{<}\) \(\newcommand{\gt}{>}\) \(\newcommand{\amp}{&}\) \(\definecolor{fillinmathshade}{gray}{0.9}\)

    Because of their large size, intact proteins can be difficult to study using analytical techniques, such as mass spectrometry. Consequently, it is often desirable to break a large polypeptide down into smaller pieces. Proteases are enzymes that typically break peptide bonds by binding to specific amino acid sequences in a protein and catalyzing their hydrolysis.

    Chemical reagents, such as cyanogen bromide, which cleaves peptide bonds on the C-terminal side of a methionine residue can also be used to cut larger proteins into smaller peptides. Common proteins performing this activity are found in the digestive system and are shown below.

    • Subtilisin - C-terminal side of large uncharged side chains
    • Chymotrypsin - C terminal side of aromatics (Phe, Tyr, Trp)
    • Trypsin - C-terminal side of lysine and arginines (not next to proline)
    • Carboxypeptidase - N-terminal side of C-terminal amino acid
    • Elastase - Hydrolyzes C-side of small AAs (Gly, Ala)
    • Cyanogen Bromide (chemical) - Hydrolyzes C-side of Met
    Figure 8.45 - Protease cleavage sites on a polypeptide

    Determining mass and protein sequence

    Mass spectrometry, as its name suggests, is a method that can be used to determine the masses of molecules. Once limited to analyzing small molecules, it has since been adapted and improved to allow the analysis of biologically important molecules like proteins and nucleic acids. Mass spectrometers use an electrical field to accelerate an ionized molecule toward a detector. The time taken by an ionized molecule to move from its point of ionization to the detector will depend on both its mass and its charge and is termed its time of flight (TOF).


    Figure 8.46 - A desktop MALDI-TOF system

    MALDI-TOF (Matrix-assisted Laser Desorption Ionization - Time of Flight) is an analytical technique allowing one to determine the molecular masses of biologically relevant molecules with great precision. It is commonly used in proteomics and determination of masses of large biomolecules, including nucleic acids. The development of MALDI, which permits the production of ionic forms of relatively large molecules, was crucial to the successful use of mass spectrometry of biomolecules. Figure 8.46 shows a compact MALDI-TOF system.

    The MALDI-TOF process involves three basic steps. First, the material to be analyzed is embedded in solid support material (matrix) that can be volatilized in a vacuum chamber by a laser beam. In the second part of the process, a laser focused on the matrix volatilizes the sample, causing the molecules within it to vaporize and, in the process, to form ions by either gaining or losing protons. Third, the ions thus created in the sample are accelerated by an electric field towards a detector. Their rate of movement towards the detector is a function of the ratio of their mass to charge (m/z). An ion with a mass of 100 and a charge of +1 will move twice as fast as an ion with a mass of 200 and a charge of +1 and at the same rate as an ion with a mass of 200 and a charge of +2. Thus, by precisely determining the time it takes for an ion to go from ionization (time zero of the laser treatment) to being detected, the mass to charge ratio for all of the molecules in a sample can be readily determined.

    Ionization may result in destabilization of larger molecules, which fragment into smaller ones in the MALDI-TOF detection chamber. The size of each of the sub-fragments of a larger molecule allows one to determine its identity if this is not previously known. This fragmentation can be intentionally enhanced by having the accelerated ions collide with an inert gas, like argon.

    Fragmentation of a molecule may also be carried out prior to analysis, as for example, by cleaving a protein into smaller peptides by the use of enzymes or chemical agents. The amino acid sequence of a protein may be determined by using MALDI-TOF by analyzing the precise molecular masses of the many short peptide fragments obtained from a protein. When one amino acid, for example, fragments from a larger peptide, this can be detected as the difference in mass between the fragment with and without the amino acid, since each amino acid will have a characteristic molecular mass. By peptide mass fingerprinting and analysis of smaller fragments of individual peptides, the entire sequence of a polypeptide can, thus, be determined.

    This page titled 8.11: Protein Cleavage is shared under a CC BY-NC-SA 4.0 license and was authored, remixed, and/or curated by Kevin Ahern, Indira Rajagopal, & Taralyn Tan via source content that was edited to the style and standards of the LibreTexts platform.