# 2.3.1: C1. Main Chain Conformations

$$\newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} }$$

$$\newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}}$$

$$\newcommand{\id}{\mathrm{id}}$$ $$\newcommand{\Span}{\mathrm{span}}$$

( \newcommand{\kernel}{\mathrm{null}\,}\) $$\newcommand{\range}{\mathrm{range}\,}$$

$$\newcommand{\RealPart}{\mathrm{Re}}$$ $$\newcommand{\ImaginaryPart}{\mathrm{Im}}$$

$$\newcommand{\Argument}{\mathrm{Arg}}$$ $$\newcommand{\norm}[1]{\| #1 \|}$$

$$\newcommand{\inner}[2]{\langle #1, #2 \rangle}$$

$$\newcommand{\Span}{\mathrm{span}}$$

$$\newcommand{\id}{\mathrm{id}}$$

$$\newcommand{\Span}{\mathrm{span}}$$

$$\newcommand{\kernel}{\mathrm{null}\,}$$

$$\newcommand{\range}{\mathrm{range}\,}$$

$$\newcommand{\RealPart}{\mathrm{Re}}$$

$$\newcommand{\ImaginaryPart}{\mathrm{Im}}$$

$$\newcommand{\Argument}{\mathrm{Arg}}$$

$$\newcommand{\norm}[1]{\| #1 \|}$$

$$\newcommand{\inner}[2]{\langle #1, #2 \rangle}$$

$$\newcommand{\Span}{\mathrm{span}}$$ $$\newcommand{\AA}{\unicode[.8,0]{x212B}}$$

$$\newcommand{\vectorA}[1]{\vec{#1}} % arrow$$

$$\newcommand{\vectorAt}[1]{\vec{\text{#1}}} % arrow$$

$$\newcommand{\vectorB}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} }$$

$$\newcommand{\vectorC}[1]{\textbf{#1}}$$

$$\newcommand{\vectorD}[1]{\overrightarrow{#1}}$$

$$\newcommand{\vectorDt}[1]{\overrightarrow{\text{#1}}}$$

$$\newcommand{\vectE}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash{\mathbf {#1}}}}$$

$$\newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} }$$

$$\newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}}$$

$$\newcommand{\avec}{\mathbf a}$$ $$\newcommand{\bvec}{\mathbf b}$$ $$\newcommand{\cvec}{\mathbf c}$$ $$\newcommand{\dvec}{\mathbf d}$$ $$\newcommand{\dtil}{\widetilde{\mathbf d}}$$ $$\newcommand{\evec}{\mathbf e}$$ $$\newcommand{\fvec}{\mathbf f}$$ $$\newcommand{\nvec}{\mathbf n}$$ $$\newcommand{\pvec}{\mathbf p}$$ $$\newcommand{\qvec}{\mathbf q}$$ $$\newcommand{\svec}{\mathbf s}$$ $$\newcommand{\tvec}{\mathbf t}$$ $$\newcommand{\uvec}{\mathbf u}$$ $$\newcommand{\vvec}{\mathbf v}$$ $$\newcommand{\wvec}{\mathbf w}$$ $$\newcommand{\xvec}{\mathbf x}$$ $$\newcommand{\yvec}{\mathbf y}$$ $$\newcommand{\zvec}{\mathbf z}$$ $$\newcommand{\rvec}{\mathbf r}$$ $$\newcommand{\mvec}{\mathbf m}$$ $$\newcommand{\zerovec}{\mathbf 0}$$ $$\newcommand{\onevec}{\mathbf 1}$$ $$\newcommand{\real}{\mathbb R}$$ $$\newcommand{\twovec}[2]{\left[\begin{array}{r}#1 \\ #2 \end{array}\right]}$$ $$\newcommand{\ctwovec}[2]{\left[\begin{array}{c}#1 \\ #2 \end{array}\right]}$$ $$\newcommand{\threevec}[3]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \end{array}\right]}$$ $$\newcommand{\cthreevec}[3]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \end{array}\right]}$$ $$\newcommand{\fourvec}[4]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}$$ $$\newcommand{\cfourvec}[4]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}$$ $$\newcommand{\fivevec}[5]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}$$ $$\newcommand{\cfivevec}[5]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}$$ $$\newcommand{\mattwo}[4]{\left[\begin{array}{rr}#1 \amp #2 \\ #3 \amp #4 \\ \end{array}\right]}$$ $$\newcommand{\laspan}[1]{\text{Span}\{#1\}}$$ $$\newcommand{\bcal}{\cal B}$$ $$\newcommand{\ccal}{\cal C}$$ $$\newcommand{\scal}{\cal S}$$ $$\newcommand{\wcal}{\cal W}$$ $$\newcommand{\ecal}{\cal E}$$ $$\newcommand{\coords}[2]{\left\{#1\right\}_{#2}}$$ $$\newcommand{\gray}[1]{\color{gray}{#1}}$$ $$\newcommand{\lgray}[1]{\color{lightgray}{#1}}$$ $$\newcommand{\rank}{\operatorname{rank}}$$ $$\newcommand{\row}{\text{Row}}$$ $$\newcommand{\col}{\text{Col}}$$ $$\renewcommand{\row}{\text{Row}}$$ $$\newcommand{\nul}{\text{Nul}}$$ $$\newcommand{\var}{\text{Var}}$$ $$\newcommand{\corr}{\text{corr}}$$ $$\newcommand{\len}[1]{\left|#1\right|}$$ $$\newcommand{\bbar}{\overline{\bvec}}$$ $$\newcommand{\bhat}{\widehat{\bvec}}$$ $$\newcommand{\bperp}{\bvec^\perp}$$ $$\newcommand{\xhat}{\widehat{\xvec}}$$ $$\newcommand{\vhat}{\widehat{\vvec}}$$ $$\newcommand{\uhat}{\widehat{\uvec}}$$ $$\newcommand{\what}{\widehat{\wvec}}$$ $$\newcommand{\Sighat}{\widehat{\Sigma}}$$ $$\newcommand{\lt}{<}$$ $$\newcommand{\gt}{>}$$ $$\newcommand{\amp}{&}$$ $$\definecolor{fillinmathshade}{gray}{0.9}$$

In contrast to micelles and bilayers, which are composed of aggregates of single and double chain amphiphiles, proteins are covalent polymers of 20 different amino acids, which fold, to a first approximation, in a thermodynamically spontaneous process into a single unique conformation, theoretically at a global energy minimum. This chapter section will investigate the possible conformations available to proteins, just as we studied the conformations of free fatty acids and acyl chains in lipid aggregates. The next chapter section will discuss the actual processes of folding and of unfolding (denaturation), both in vitro and in vivo. Then we will discuss the thermodynamics and intermolecular forces which stabilize the folded (or native) shape and the unfolded (or denatured state) of proteins, in a fashion similar to how we discussed micelle and bilayer stability.

Just as saturated fatty acid chains have preferred conformations (all ttt), peptide chains also have preferred conformations. The complexity is much greater, however. With fatty acid chains, we dealt only with torsion or dihedral angles around the methylene carbons. For proteins, we must consider the covalent links which attach the amino acids together, as well as the rotations possible in 20 different amino acids. The peptide bonds connect the carbonyl C of the i th amino acid to the alpha amine N of the i th+1 amino acid. The resulting bond is an amide link. X-ray analysis shows that the the C-N bond has double bond character. This can be accounted for by delocalizing the nonbonding electron pair of the N to the carbonyl C forming a double bond, with the pi bonded electrons of the carbonyl C-O bond moving to the O. These resonance structures lead to a planar arrangement of the peptide carbonyl C and amide N and the two other atoms connected to each, since the hybridization of the C and N has sp2 character, with 120° bond angles. This greatly simplifies the number of conformations which a protein can adopt since these 6 atoms can be considered to reside and move in a plane. The alpha C serves as the corner attachment point of two different planes, each which can rotate independently of the other plane. The two planes can twist around the alpha carbon. The rotation angles for the two planes are called phi (f) and psi(y) are analogous to the torsion angles in the acyl chains of fatty acid. They can vary from -180° to +180°. The R group substituent attached to the alpha C can also rotate around the alpha C and the beta C of the side chain. This angle is defined as chi. Other rotations also occur within the side chain. We will concentrate on phi (f) and psi(y) angles in this section.

Figure: Extended Polypeptide Showing Planes and phi/psi Angles

Another important feature of the peptide bond is that the alpha Cs at opposite ends of the rectangle are usually trans to each other (on opposite sides of the C-N bond in the peptide bond. This trans arrangement of the alpha Cs is sterically favored by a factor of 1000/1 for all peptide bonds except X-Pro. Pro, which is a cyclic amino acid, is sterically restricted.

Figure: trans arrangement of the alpha Cs

The figure above, which also shows the X-Pro bond, clearly shows that both the trans and cis forms of the X-Pro bonds are hindered to a similar extent. In X-Pro bonds in proteins, the trans/cis ratio found in proteins is 4/1. The diagram above shows rans peptide bonds, and how they could be converted to cis through rotation around the C-N bond.

A protein can now be thought of as a series of linked sequences of rigid, planar peptide units which can rotate around phi/psi angles. When the chain is fully extended (as shown in the links above), phi/psi are 180°.

When phi (f) and psi(y) equal 0o, the two peptide bonds flanking the alpha Cs are in the same plane. This conformation is prohibited since the O of the C=O on one plane and the H of the H-N on the other are overlapping - i.e. they approach closer than their van der Waals radii.

This simple example shows that all conformational space is not accessible for protein folding.

• Animation showing phi (f) angle rotation at psi (y)= 0.
• Animations showing psi (y) phi (f) = 0.
• Quick time movie (Alpha carbons are red. Move slider to approx. half way point to see "extended chain, analogous to the zig-zag conformation of the acyl chains in saturated fatty acids)

## Ramachandran plot

Ramachandran was the first to calculate which phi/psi angles are allowed. He modeled the angles permitted to a tripeptide, assuming the atoms were hard spheres. The angles allowed depended in part on the limiting distance chosen for interatomic contacts. (i.e. the usual H -- H distance is 2.0 angstroms, and 3.0 for C--C bonds.) The plot below show the allowed regions in red. Only 3 small regions of conformational space are available. If you allow a closer approach by 0.1 angstrom, more conformation space is available, but only one new area is available (shown in yellow in the plot below).

Figure: Ramachandran plot

A Ramachandran plot of Ala-Ala-Ala is nearly identical to the plot for Phe-Phe-Phe (which is unbranched at the beta carbon (the first methylene C in the side chain). The plot for Thr-Thr-Thr, which has a branch at the beta C (with OH and CH3 attached) shows somewhat less room than the other plots. Pro-Pro-Pro is most restricted for obvious reasons. For a longer chain than a tripeptide, there are more restrictions than for (Ala)3, since the chain can't assume a conformation when it passes through itself. The plots for actual proteins have many points which do fall in forbidden regions. However, these points would be allowed if the peptide bonds is twisted a few degrees. Gly bonds also fall outside the allowed regions. This is understandable, since the side chain of Gly is H, and it is used in protein where sharp turns of the chain is necessary. Right hand alpha helices fall at -57,-47 while left hand alpha helices fall at +57,+47. (Notice these are not mirror images of each other. The mirror image of a right-handed alpha helix would be a left handed helix made of D-amino acids.) Parallel beta sheets are at -119, -113, while antiparallel sheets falls at -139, +135. Other types of helices also are found. The 310 helix , a sharper helix with 3 amino acids/turn, falls at -49,-26. All of these examples of secondary structure fall in allowed regions. Modern Ramachandran plots do not model the atoms as hard spheres but instead consider the potential energy of the atoms using the Lennard-Jones potential (6-12 potential) for van der Waals interactions. We discussed this potential function in the molecular modeling lab.

Figure: Ramachandran plots showing phi/psi angles for Gly, Ala, Tyr, and Pro in actual proteins

• Ramachandran Plots from Protopedia
• Query of Ramachandran Plots for any selected amino acid
• Ramachandran Plot Server
• from ExPASy. (concentrate on first part on backbone conformation and Ramachandran plots) (great site)
• Side Chain Conformations: How are side chain atoms and bond angles designated? A tutorial from ExPASy