Skip to main content
Biology LibreTexts

7: Phenotypic Variation and the Resemblance Between Relatives

  • Page ID
    96511
  • \( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

    \( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)

    \( \newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\)

    ( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\)

    \( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)

    \( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\)

    \( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)

    \( \newcommand{\Span}{\mathrm{span}}\)

    \( \newcommand{\id}{\mathrm{id}}\)

    \( \newcommand{\Span}{\mathrm{span}}\)

    \( \newcommand{\kernel}{\mathrm{null}\,}\)

    \( \newcommand{\range}{\mathrm{range}\,}\)

    \( \newcommand{\RealPart}{\mathrm{Re}}\)

    \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)

    \( \newcommand{\Argument}{\mathrm{Arg}}\)

    \( \newcommand{\norm}[1]{\| #1 \|}\)

    \( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)

    \( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\AA}{\unicode[.8,0]{x212B}}\)

    \( \newcommand{\vectorA}[1]{\vec{#1}}      % arrow\)

    \( \newcommand{\vectorAt}[1]{\vec{\text{#1}}}      % arrow\)

    \( \newcommand{\vectorB}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

    \( \newcommand{\vectorC}[1]{\textbf{#1}} \)

    \( \newcommand{\vectorD}[1]{\overrightarrow{#1}} \)

    \( \newcommand{\vectorDt}[1]{\overrightarrow{\text{#1}}} \)

    \( \newcommand{\vectE}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash{\mathbf {#1}}}} \)

    \( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

    \( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)

    The distinction between genotype and phenotype is one of the most useful ideas in biology.1 The genotype of an individual (the genome), for most purposes, is decided when the gametes fuse to form a zygote (individual). The phenotype of an individual represents any measurable aspect of an organism.

    Behaviorism_1.gif
    Figure \(\PageIndex{1}\): European aspen P. tremula.

    Your height, the amount of RNA transcribed from a given gene, what you ate last Tuesday: all of these are phenotypes. Nearly any phenotype we can choose to measure about an organism represents the outcome of the information encoded by their genome played out through an incredibly complicated developmental, physiological and/or behavioural processes that in turn interact with a myriad of environmental and stochastic factors. Honestly it boggles the mind how organisms work as well as they do, let alone that I managed to eat lunch last Tuesday.

    There are many different ways to think about studying the path from genotype through to phenotype. The one we will take here is to think about how phenotypic variation among individuals in a population arises as a result of genetic variation in the population. One simple way to measure this genotype-phenotype relationship is to

    calculate the phenotypic mean for each genotype at a locus. For example, Wang et al. (2018) explored the genetic basis of budset time in European aspen (Populus tremula); the effect of one specific SNP on that phenotype is shown in in Figure \(\PageIndex{2}\). Budset timing is a key trait underlying local adaptation to varying growing season length. The associated SNP falls in a gene (PtFT2) that is known to play a strong role in flowering time regulation in other plants.

    Behaviorism_1.gif
    Figure \(\PageIndex{2}\): The effect of a flowering time gene (PtFT2) SNP on budset time in European aspen. Each dot gives the genotype-phenotype combination for an individual. The horizontal lines give the budset mean for each genotype and the vertical lines show the inter-quartile range. The dotted line gives the linear regression of phenotype on genotype. Thanks to Pär Ingvarsson for sharing these data from Wang et al. (2018).

    One way for us to assess the relationship between genotype and phenotype is to find the best fitting linear line through the data, i.e. fit a linear regression of phenotypes for our individuals on their genotypes at a particular SNP (l):

    \[X ∼ μ + alGl\]

    In the equation above, X is a vector of the phenotypes of a set of individuals and Gl is our vector of genotypes at locus l, with Gi,l taking the value 0, 1, or 2 depending on whether our individual i is homozygote, heterozygote, or the alternate homozygote at our locus of interest. Here μ is our phenotypic mean. The slope of this regression line (al) has the interpretation of being the average effect of substituting a copy of allele 2 for a copy of allele 1. In our aspen example the slope is −13.6, i.e. swapping a single T for a G allele moves the budset forward by 13.6 days, such that the GG homozygote is predicted to set buds 27.2 days earlier than the T T homozygote.

    As a measure of the significance of this genotype-phenotype re- lationship, we can calculate the p-value of our regression. To try to identify loci that are associated with our trait genome-wide, we can conduct this regression at each SNP we genotype in the genome. One common way to display the results of such an analysis (called
    a genome-wide association study or GWAS for short) is to plot the minus logarithm of the p-value for each SNP along genome (a so-called Manhattan plot). Here’s one from Wang et al. (2018) for their aspen budset phenotype

    Behaviorism_1.gif
    Figure \(\PageIndex{3}\): Manhattan plot of the p-value of the linear association be- tween genotype and budset in aspen. Each dot represents the test at a single SNP, plotted at its physical coordinate in the genome. Different chromosomes are plotted in alternat- ing colours. The SNPs surrounding the PtFT2 gene are shown in red. From Wang et al. (2018), licensed under CC BY 4.0.

    The SNP with the most significant p-value is SNP in PtFT2. Note that other SNPs in the surrounding region also light up as showing a significant association with budset timing. This is because loci that are in linkage disequilibrium with a functional locus may in turn show an association, not because they directly affect the phenotype, but simply because the genotypes at the two loci are themselves non-randomly associated. Below is a zoomed in version (Figure 2 in Wang et al. (2018)) with SNPs coloured by the strength of their LD with the putatively functional SNP. Note how SNPs in strong LD with the functional allele (redder points) have more significant p-values.

    Variation in some traits seems to have a relatively simple genetic basis. In our aspen example there is one clear large-effect locus, which explains 62% of the variation in budset. Note that even in this case, where we have an allele with a very strong effect on a phenotype, this is not an allele for budset, nor is PtFT2 a gene for budset. it is an allele that is associated with budset in the sampled environments and populations. In a different set of environments, this allele’s effects may be far smaller, and a different set of alleles may contribute to phenotype variation. PtFT2, the gene our focal SNP falls close to, is just one of many genes and molecular pathways involved in budset. A mutant screen for budset may uncover many genes with larger effects; this gene is just a locus that happens to be polymorphic in this particular set of genotyped individuals.

    Behaviorism_1.gif
    Figure \(\PageIndex{4}\): The Manhattan plot zoomed in on the top-hit (red SNPs from Figure 7.3). SNPs are now coloured by their D′ value with the most significant SNP. D′ is the LD covariance between a pair of loci (D, eqn (3.15)) normalized by the largest value D can take given the allele frequencies. Figure from Wang et al. (2018), licensed under CC BY 4.0.

    While phenotypic variation for some phenotypes has a relatively simple genetic basis, many phenotypes are likely much more genetically complex, involving the functional effect of many alleles at hundreds or thousands of polymorphic loci. For example hundreds of small effect loci affecting human height have been mapped in European populations to date. Such genetically complex traits are called polygenic traits.

    In this chapter, we will use our understanding of the sharing of alleles between relatives to understand the phenotypic resemblance between relatives in quantitative phenotypes. This will allow us to understand the contribution of genetic variation to phenotypic variation. In the next chapter, we will then use these results to understand the evolutionary change in quantitative phenotypes in response to selection.

    A simple additive model of a trait

    Let’s imagine that the genetic component of the variation in our trait is controlled by L autosomal loci that act in an additive manner. The frequency of allele 1 at locus l is pl, with each copy of allele 1 at this locus increasing your trait value by al above the population mean. The phenotype of an individual, let’s call her i, is Xi. Her genotype at SNP l is Gi,l. Here Gi,l = 0, 1, or 2, representing the number of copies of allele 1 she has at this SNP. Her expected phenotype, given her genotype at all L SNPs, is then

    \[E(Xi|Gi,1,··· ,Gi,L) = μ + XA,i = μ +Gi,lall\]

    where μ is the mean phenotype in our population, and XA,i is the deviation away from the mean phenotype due to her genotype. Now in reality the phenotype is a function of the expression of those alleles in a particular environment. Therefore, we can think of this expected phenotype as being an average across a set of environments that occur in the population.

    When we measure our individual’s observed phenotype we see

    \[Xi =μ+XA,i +XE,i\]

    where XE is the deviation from the mean phenotype due to the envi- ronment. This XE includes the systematic effects of the environment our individual finds herself in and all of the noise during development, growth, and the various random insults that life throws at our indi- vidual. If a reasonable number of loci contribute to variation in our trait then we can approximate the distribution of XA,i by a normal distribution due to the central limit theorem (see Figure 7.5). 2 Thus if we can approximate the distribution of the effect of environmental variation on our trait (XE,i) also by a normal distribution, which is reasonable as there are many small environmental effects, then the distribution of phenotypes within the population (Xi) will be normally distributed (see Figure \(\PageIndex{5}\)).

    Behaviorism_1.gif
    Figure \(\PageIndex{5}\): The convergence of the phenotypic distribution to a normal distribution. Each of the three his- tograms shows the distribution of the phenotype in a large sample, for increasingly large numbers of loci (L = 1, 4, and 10, with the pro- portion of variance explained held at VA = 1). I have simulated each individual’s phenotype following equations 7.2 and 7.3. Specifically, we’ve simulated each individual’s biallelic genotype at L loci, assuming Hardy-Weinberg proportions and that the allele is at 50% frequency. We assume that all of the alleles have equal effects and combine them ad- ditively together. We then add an environmental contribution, which is normally distributed with mean zero and variance 0.05. Note that in the left two pictures you can see peaks corresponding to different genotypes due to our low environmental noise (in practice we can rarely see such peaks for real quantitative pheno- types).

    Note that as this is an additive model; we can decompose eqn. 7.3 into the effects of the two alleles at each locus and rewrite it as

    \[\]

    that our individual received from her mother (maternal alleles) and father (paternal alleles) respectively. This will come in handy in just a moment when we start thinking about the phenotypic covariance of relatives.

    Now obviously this model seems silly at first sight as alleles don’t only act in an additive manner, as they interact with alleles at the same loci (dominance) and at different loci (epistasis). Later we’ll relax this assumption, however, we’ll find that if we are interested
    in evolutionary change over short time-scales it is actually only the “additive component” of genetic variation that will (usually) concern us. We will define this more formally later on, but for the moment we can offer the intuition that parents only get to pass on a single allele at each locus on to the next generation. As such, it is the effect of these transmitted alleles, averaged over possible matings, that is an individual’s average contribution to the next generation (i.e. the additive effect of the alleles that their genotype consists of).

    Additive genetic variance and heritability

    As we are talking about an additive genetic model, we’ll talk about the additive genetic variance (VA), the phenotypic variance due to the additive effects of segregating genetic variation. This is a subset of the total genetic variance if we allow for non-additive effects.

    The variance of our phenotype across individuals (VP ) we can write as

    \[VP =Var(X)=Var(XA)+Var(XE)=VA +VE\]

    In writing the phenotypic variance as a sum of the additive and envi- ronmental contributions, we are assuming that there is no covariance between XG,i and XE,i i.e. there is no covariance between genotype and environment.

    Our additive genetic variance can be written as

    \[VA = Var(XA) =V ar(Gi,lal)\]

    where V ar(Gi,lal) is the contribution of locus l to the additive vari- ance among individuals. Assuming random mating, and that our loci are in linkage equilibrium, we can write our additive genetic variance as

    \[\]

    where the 2pl(1 − pl) term follows from the binomial sampling of two alleles per individual at each locus.

    Exercise \(\PageIndex{1}\)

    You have two biallelic SNPs contributing to variance in human height. At the first SNP you have an allele with an additive effect of 5cm which is found at a frequency of 1/10,000. At
    the second SNP you have an allele with an additive effect of −0.5cm segregating at 50% frequency. Which SNP contributes more to the additive genetic variance? Explain the intuition of your answer.

    Above in eqn(7.4) we decomposed the additive genetic component of XA,i as XM,i + XP,i the additive contributions of the maternal and paternal derived alleles in the ith individual. Similarly we can decompose the additive genetic variance VA as

    \[VA = V ar(XA) = V ar(XM,i) + V ar(XP,i)\]

    \[V ar(XM,i) = V ar(XM,i) = VA/2\]

    assuming that our individuals are mating at random and that mater- nal and paternal alleles are equal in their effect in offspring.

    An example of the additive basis of variation using polygenic scores.

    Now we don’t usually get to see the individual loci contributing to highly polygenic traits. Instead, we only get to see the distribution
    of the trait in the population. However, with the advent of GWAS in human genetics we can see some of the underlying genetics using the many trait-associated loci identified to date. Using the estimated ef- fect sizes at each locus, each one of which is tiny, we can calculate the weighted sum over an individual’s genotype as in equation 7.2. This weighted sum is called the individual’s polygenic score. To illustrate how polygenic scores work, we can take a set of 1700 SNPs6. The ef- fects of these SNPs are tiny; the median, absolute additive effect size is 0.07cm. Figure 7.6 shows the distribution of a thousand individuals’ polygenic scores calculated using these 1700 SNPs (simulated geno- types using the UKBB frequencies). The standard deviation of these polygenic scores ∼ 2cm. The individuals with higher polygenic scores for height are predicted to be taller than the individuals with lower polygenic scores.

    Behaviorism_1.gif
    Figure \(\PageIndex{6}\): Left) The distribution of the number of height-increasing alleles that individuals carry at 1700 SNPs associated with height in the UK Biobank, for a sample of 1000 individuals. right) The distribution of the polygenic scores for these 1000 individuals. Plotted on top is a normal distribution with the same mean and variance. The empirical variance of these polygenic scores is 0.13, the additive genetic variance calculated by equation (7.7) is 0.135, so the two are in good agreement.

    The narrow sense heritability We would like a way to think about what proportion of the variation in our phenotype across individuals is due to genetic differences as opposed to environmental differences. Such a quantity will be key in helping us think about the evolution of phenotypes. For example, if variation in our phenotype had no genetic basis, then no matter how much selection changes the mean phenotype within a generation the trait will not change over generations.

    We’ll call the proportion of the variance that is genetic the heritability, and denote it by h2. We can then write heritability as

    \[\]

    Remember that we are thinking about a trait where all of the alleles act in a perfectly additive manner. In this case our heritability h2 is referred to as the narrow sense heritability, the proportion of the variance explained by the additive effect of our loci. When we allow dominance and epistasis into our model, we’ll also have to define the broad sense heritability (the total proportion of the phenotypic vari- ance attributable to genetic variation).

    The narrow sense heritability of a trait is a useful quantity; indeed we’ll see shortly that it is exactly what we need to understand the evolutionary response to selection on a quantitative phenotype. We can calculate the narrow sense heritability by using the resemblance between relatives. For example, if the phenotypic differences between individuals in our population were solely determined by environmental differences experienced by these different individuals, we should not expect relatives to resemble each other any more than random individuals drawn from the population. Now the obvious caveat here is that relatives also share an environment, so they may resemble each other due to shared environmental effects.

    Note that the heritability is a property of a sample from the population in a particular set of environments at a particular time. Changes in the environment may change the phenotypic variance. Changes in the environment may also change how our genetic alleles are expressed through development and so change VA. Thus estimates of heritability are not transferable across environments or populations.

    The covariance between relatives

    People have long been fascinated by the resemblance between relatives, particularly twins (see Figure \(\PageIndex{7}\)). Families hold a special place in quantitative genetics, as remarkably we can use the resemblance between relatives to directly estimate the heritability and covariance of traits. To see this we can calculate the covariance in phenotype be- tween pairs of individuals (1 and 2) who have phenotypes X1 and X2 respectively.7 To think about imagine plotting the phenotypes of, say, sisters against each other. The x and y coordinates of each point will be the, say, heights of the pair of siblings. Do tall women tend to have tall sisters, do short women tend to have short sisters? How much do their phenotypes covary? If some of the variation in our phenotype is genetic we expect identical twins to resemble each other more than full siblings, who in turn will resemble each other more than half-sibs and so on out (see Figure \(\PageIndex{8}\)). Under our simple additive model of phenotypes we can write the covariance as

    \[Cov(X1,X2) = Cov(X1M +X1P +X1E, X2M +X2P +X2E)\]

    We can expand this out in terms of the covariance between the various components in these sums.

    Behaviorism_1.gif
    Figure \(\PageIndex{7}\): The Cholmondeley Ladies. Unknown British Painter, circa 1600. Inscription on bottom left of the painting “Two Ladies of the Chol- mondeley Family, Who were born the same day, Married the same day, And brought to Bed the same day.” The ladies are thought to be twin sisters, but there’s a clue that they’re not identical twins. Can you spot it?
    Behaviorism_1.gif
    Figure \(\PageIndex{8}\): Covariance of phenotypes between pairs of individuals of a given relatedness. Each point gives the phenotypes of a different pair of indi- viduals. The additive genetic variance is held constant at VA = 1, such that the expected covariances (2F1,2VA) should be 1, 0.5, 0.25, and 0.125 re- spectively in good agreement with the empirical covariances reported in the title of each graph. The data were simulated as described in the caption of Figure 7.5. The dashed red line shows x = y and the solid blue line shows the best fitting linear regression line.

    To make our task easier, we will make two commonly made assumptions:

    1. We can ignore the covariance of the environments between individuals (i.e. Cov(X1E , X2E ) = 0)
    2. We can ignore the covariance between the environment of one individual and the genetic variation in another individual (i.e. Cov(X1E , (X2M + X2P )) = 0).

    The failure of these assumptions to hold can undermine our esti- mates of heritability, but we’ll return to that later. Moving forward with these assumptions, we can simplify our original expression above and write our phenotypic covariance between our pair of individuals as

    \[Cov(X1, X2) = Cov(X1M , X2M )+Cov(X1M , X2P )+Cov(X1P , X2M )+Cov(X1P , X2P )\]

    This equation is saying that, under our simple additive model, we can see the covariance in phenotypes between individuals as the covariance between the maternal and paternal allelic effects in our individuals. We can use our results about the sharing of alleles between relatives to obtain these covariance terms. But before we write down the general case, let’s quickly work through some examples.

    The covariance between identical twins

    Let’s first consider the case of a pair of identical twins, monzygotic (MZ) twins, from two unrelated parents. Our pair of twins share their maternal and paternal allele identical by descent (X1M = X2M and X1P = X2P ). As their maternal and paternal alleles are not correlated draws from the population, i.e. have no probability of being IBD as we’ve said the parents are unrelated, the covariance between their effects on the phenotype is zero (i.e. Cov(X1P , X2M ) = Cov(X1M , X2P ) = 0). In that case, eqn. 7.12 is

    \[Cov(X1, X2) = Cov(X1M , X2M )+Cov(X1P , X2P ) = V ar(X1M )+V ar(X1P ) = VA\]

    where the middle step follows from the fact the maternal (or sim- ilarly the paternal) allele in a pair of twins is the same allele so Cov(X1M , X2M ) = Cov(X1M , X1M ) = V ar(X1M ), as the covariance of random variable with itself is just its variance, and then the addi- tive variance of the maternal allele contribution is V ar(X1M ) = VA/2 following from eqn(7.9).

    To calculate the narrow sense heritability we could then in principal divide the covariance of our pairs of MZ twins (MZ1 and MZ2) by the trait variance to give

    \[\]

    where ρMZ is the correlation of pairs of MZ twins (see Appendix eqn (A.43) for more on correlations). For example, we could estimate the heritability of a measure of body from the MZ correlation in Figure \(\PageIndex{9}\). In general, this simple estimator isn’t great as the correlation of identical twins includes the effects of the shared family environment of the twins (i.e. Cov(X1E,X2E)).

    Moreover, it can be inflated by non-additive effects as identical twins don’t just share alleles, they share their entire genotypes, and thus resemble each other in phenotype also because of shared dominance effects (we’ll discuss non-additive effects in Section 7.1.1). Better twin-based estimates of heritability are commonly used based on the comparison of MZ vs twins that bypass some of these issues.

    Behaviorism_1.gif
    Figure \(\PageIndex{9}\): A measure of body fat in pairs of monozygotic (MZ) and dizygotic (DZ) twins. Our sample correlations are ρˆMZ = 0.72 and ρˆDZ = 0.10. Data from Faith et al. (1999)

    The covariance in phenotype between parent and child Children re- semble their biological parents because children inherit their genome from their parents (putting aside shared environments for the moment). If a mother and father are unrelated individuals, i.e. they are two random draws from the population, then this mother and her child share one allele IBD at each locus (i.e. r1 = 1 and r0 = r2 = 0). Let’s assume that our mother (ind 1) transmits her paternal allele to the child (ind 2), in which case XP1 = XM2, and so Cov(XP1,XM2) = V ar(XP 1) = 12 VA, and all the other covariances in eqn. 7.12 are zero. We’d also arrive at this result if instead we had thought of the mother transmitting her own maternal allele. Thus Cov(X1, X2) = 12 VA, we can leverage this form of the covariance to directly estimate h2 by regression.

    We can estimate the narrow sense heritability through the regression of child’s phenotype on the parental mid-point phenotype. The parental mid-point phenotype is simply the average of the mum and dad’s phenotype. See Figure \(\PageIndex{10}\) for an example from song sparrows.

    Behaviorism_1.gif
    Figure \(\PageIndex{10}\): Parent-midpoint off- spring regression for beak depth and tarsus length in song sparrows. The phenotypes have been standardized to have mean 0 and variance 1. The red line shows the best fitting slope, whose slope is reported on the graph. Note that Smith and Zach (1979) regressed the average offspring phenotype for each family on parental mid-point (Xavg.kid ∼ Xmid), as they had multiple offspring per family. However, this doesn’t change the slope of the regression from the form given by eqn (7.15). The grey line is the x = y line. Data from Smith and Zach (1979)

    We denote the child’s phenotype by Xkid and mid-point phenotype by Xmid, so that if we take the regression Xkid ∼ Xmid this regres- sion has slope β = Cov(Xkid, Xmid)/V ar(Xmid). The covariance of Cov(Xkid, Xmid) = 21 VA, and V ar(Xmid) = 12 VP , as by taking the aver- age of the parents we have halved the variance, such that the slope of the regression is

    \[\]

    Behaviorism_1.gif
    Figure \(\PageIndex{11}\): Song sparrow (Melospiza melodia). “He is the most incurable optimist of my acquaintance”.

    i.e. the regression of the child’s phenotype on the parental midpoint phenotype is an estimate of the narrow sense heritability. If much of the phenotypic variation is due to the (additive) differences in genotypes among individuals (h2 ≈ 1), then children will closely resemble their parents. Conversely if much of the variation is environmental (h2 ≈ 0), and there is no shared environment between parent and child, children will not resemble their parents.

    Applying this heritability estimate to the Song sparrow sample we find h2 = 0.43 and h2 = 0.3 for beak depth and tarsus length respectively from Figure \(\PageIndex{10}\). So in Smith and Zach (1979) analysis, for example, 30% of the variance in tarsus length is atttributal to the additive effect of genetic differences among individuals. Smith and Zach (1979) also regressed the average offspring phenotype agains their fathers or mothers against their offspring, giving
    a slope of βdad,avg.kid and βmum,kid. For tarsus length, for example, they found βdad,avg.kid = 0.19 and βmum,avg.kid = 0.17. Following a similar argument to that in eqn (7.15) we find that these slopes are βdad,kid = VA/2/VP = h2/2, and the same for mums. Thus the regression of offspring’s phenotype on a particular parent is an estimate of half the narrow-sense heritability, in line with the reduced slopes found by Smith and Zach (1979), this halfing of the slope is due to the fact that a single parent’s phenotype is a noisier estimate of the parental mid-point and so less informative about the child s phenotype. These parent specific estimates of heritability are particularly useful as they allow us to investigate sex-specific inheritance and sexual dimorphism (we’ll explore this in a later section).

    Behaviorism_1.gif
    Figure \(\PageIndex{12}\): Regression of child’s phenotype of the parental mid-point phenotype. The three panels show decreasing levels of environmental variance (VE) holding the additive genetic variance constant (VA = 1). In these figures, we simulate 100 loci, as described in the caption of Figure 7.5.We simulate the genotypes and phenotypes of the two parents, and then simulate the child’s genotype following mendelian transmission. The red line shows x = y and the blue line shows the best fitting linear regression line.

    Estimating heritability by these various parent-offspring regression have the issue of not controlling for environmental correlations between parent and offspring, which can inflate our estimates of hertability (as we will mistake environmentally mediated resemblance for genetics). Raising the organisms in the lab could remove much of the potential for shared environment between parent and offspring, but it also removes much of the environmental variation and we (as evolutionary geneticists) are usually not primarily interested in knowing the heritability in the lab bur rather in the field. In some organisms, notably plants, we can begin to sidestep these issues by raising offspring in a common set of randomized field conditions (a so called “common garden”). Another option is cross-foster animals, for example Smith and Dhondt (1980) returned to the song sparrow population and swapped eggs between parents nests. They found that the covariance between biological parents and children was still high despite these children being raised in a different nest, but that there was no significant covariance between foster parents and their non-biological children (see Figure \(\PageIndex{13}\) for beak depth). This suggests that family environment is not confounding the estimate of heritability in this song sparrow sample. However, such manipulations are often impossible in many systems, and issues of shared environmental covariance due to maternal resources from egg (or seed) are still present.

    Behaviorism_1.gif
    Figure \(\PageIndex{13}\): Foster Parent-midpoint offspring regression for beak depth and tarsus length in song sparrows. The red line shows the best fitting slope, whose slope is reported on the graph. The slope is not significant. The grey line is the x = y line. Data from Smith and Dhondt (1980)

    Despite its issues, this measure of heritability provides useful intuition and is directly relevant to our discussion of the response to selection in the next chapter. That’s because our regression allows us to attempt to predict the phenotype of the child given the phenotypes of the parents; how well we can do this depends on the slope. See Figure \(\PageIndex{12}\) for examples. If the slope is close to zero then the parental phenotypes hold little information about the phenotype of the child, while if the slope is close to one then the parental mid-point is a good predictor of the child’s phenotype. As we will see, natural selection will only efficiently drive evolution if children resemble their parents.

    Thinking abour our prediction of child’s phenotpye more formally, the expected phenotype of the child given the parental phenotypes is

    \[E(Xkid|Xmum, Xdad) = μ + βmid,kid(Xmid − μ) = μ + h2(Xmid − μ)\]

    which follows from the definition of linear regression. So to find the child’s predicted phenotype, we simply take the mean phenotype and add on the difference between our parental mid-point and the population mean, multiplied by our narrow sense heritability.

    The covariance between general pairs of relatives under an additive model The above examples above make clear that to understand
    the covariance between phenotypes of relatives, we simply need to think about the alleles they share IBD. Consider a pair of relatives (1 and 2) with a probability r0, r1, and r2 of sharing zero, one, or two alleles IBD respectively. When they share zero alleles Cov((X1M + X1P ), (X2M + X2P )) = 0, when they share one allele Cov((X1M + X1P),(X2M +X2P)) = Var(X1M) = 12VA, and when they share two alleles Cov((X1M + X1P ), (X2M + X2P )) = VA. Therefore, the general covariance between two relatives is

    \[\]

    where F1,2 is our coefficient of kinship, i.e. the probability that two alleles sampled at random from our pair individuals 1 and 2 are IBD (see eqn (2.5)). So under a simple additive model of the genetic basis of a phenotype, to measure the narrow sense heritability we need to measure the covariance between pairs of relatives (assuming that we can remove the effect of shared environmental noise). From the covariance between relatives we can calculate VA, and we can then divide this by the total phenotypic variance to get h2.

    Exercise \(\PageIndex{2}\)

    A) In polygynous red-winged blackbird populations (i.e. males mate with several females), paternal half-sibs can be identified. Suppose that the covariance of tarsus lengths among half-sibs is 0.25 cm2 and that the total phenotypic variance is 4 cm2. Use these data to estimate h2 for tarsus length in this population.

    B) Why might paternal half-sibs be preferable for measuring heritability than maternal half-sibs?

    Behaviorism_1.gif
    Figure \(\PageIndex{14}\): Red-winged blackbird and tricoloured blackbirds (it Agelaius phoeniceus and Agelaius tricolor).

    Estimating additive genetic variance across a variety of different relationships (The animal model). In many natural populations we may have access to individuals with a range of different relationships to each other (e.g. through monitoring of the paternity of individuals), but relatively few pairs of individuals for a specific relationship (e.g. sibs). We can try and use this information on various relatives as fully as possible in a mixed model framework. Building from equation 7.3, we can write an individual’s phenotype Xi as

    \[Xi = μ + XA,i + XE,i\]

    where XE,i ∼ N(0,VE) and XA,i is normally distributed across individuals with covariance matrix VAA, where the the entries for a pair of individuals i and j are Aij = 2Fi,j and Aii = 1. Given the matrix A we can estimate VA. We can also add fixed effects into this model to account for generation effects, additional mixed effects could also be included to account for shared environments between particular individuals (e.g. a shared nest). This approach is sometimes called the “animal model”, and is widely used to in modern quantitative gentics to estimate genetic variances and heritabilities.

    Multiple traits

    Traits often covary with each other, both due to environmentally in- duced effects (e.g. due to the effects of diet on multiple traits) and due to the expression of underlying genetic covariance between traits. Genetic covariance, in turn, can reflect pleiotropy, a mechanistic effect of an allele on multiple traits (e.g. variants that affect skin pigmenta- tion often affect hair color), the genetic linkage of loci independently affecting multiple traits, or the effects of assortative mating.

    Consider two traits X1,i and X2,i in an individual i. These traits could be, say, the individual’s leg length and nose length. As before,

    \[X1,i = μ1 + X1,A,i + X1,E,i

    X2,i = μ2 + X2,A,i + X2,E,i\]

    As before we can talk about the total phenotypic variance (V1,V2), environmental variance (V1,E and V2,E), and the additive genetic variance for trait one and two (VA,1, VA,2). But now we also have to consider the total covariance between trait one and trait two, V1,2 = Cov(X1,X2), as well as the environmentally induced covari- ance (VE,1,2 = Cov(X1,E,X2,E)) and the additive genetic covariance (VA,1,2 = Cov(X1,A,X2,A)). To better understand the covariance aris- ing due to pleiotropy, let’s think about a set of L SNPs contributing to our two traits. If the additive effect of an allele at the ith SNP is αi,1 and αi,2 on traits 1 and 2, then the additive covariance between our traits is

    \[\]

    assuming our loci are not in linkage disequilibrium. Thus a genetic correlation arises due to pleiotropy, because loci that tend to affect trait 1 also systematically affect trait 2. For example, alleles associated with later age at menarche (AAM), age at first menstrual cycle, in European women also tend to be positively associated with height (see Figure \(\PageIndex{15}\)), thereby creating a genetic correlation between AAM and height.

    Behaviorism_1.gif
    Figure \(\PageIndex{15}\): The additive effect sizes of loci associated with female age at menarche (AAM) and their effect size on height in a European population. Data from Pickrell et al. (2016).

    We can ‘store’ our variance and covariance values in matrices, a way of gathering these terms that will be useful when we discuss selection:

    \[\]

    Behaviorism_1.gif
    Figure \(\PageIndex{16}\): Grey treefrog (Hyla versicolor)

    \[\]

    Here we’ve shown the matrices for two traits, but we can generalize this to an arbitrary number of traits.

    We can estimate these quantities, in a similar way as before, by studying the covariance in different traits between relatives:

    \[\]

    Behaviorism_1.gif
    Figure \(\PageIndex{17}\): Phenotypic and genetic correlations in male grey treeforg (Hyla versicolor) calls. On the left each male is shown as a dot, recording their inter-call period and the number of pulses in each call. One the right each dot corresponds to a father with the mean of sons for both phenotypes. Data from Welch et al. (2014)

    An example of phenotype and genetic covariance are shown on the left and right of Figure \(\PageIndex{17}\) respectively. Gray treefrogs (Hyla versicolor) chorus to attract mates. Their call is made up of a trill, a note rapidly pulsed a number of times, that is then repeated after some period. Female frogs prefer males who make a lot of calls and where each of those calls have a large number of pulses. However, doing both is be very energetic, and so there is potentially a tradeoff between these two aspects of a male frog’s call. Indeed Welch et al. (2014) found in lab-reared male frogs that the pulse number and the time period between calls were positively correlated, left side of Figure \(\PageIndex{17}\), i.e. individuals were investing their energy in making either few highly pulsed calls or many calls with few pulses. This phenotypic covariance reflects underlying a genetic covariance between theses two frog call characteristics (right side Figure \(\PageIndex{17}\)). Fathers whose sons have calls with highly pulsed calls also have sons whose calls are more spaced apart.

    Behaviorism_1.gif
    Figure \(\PageIndex{18}\): Relationship of standardized forehead patch size between fathers and sons and daughters in pied fly-catchers. Data from Potti and Canal.

    One useful summary of a genetic covariance is the genetic correlation between two phenotypes

    \[\]

    where VA,1 and VA,2 are the additive genetic variance for trait 1 and 2 respectively. Here, rg tells us to what extent the additive genetic variance in two traits is correlated.

    Another important application of genetic covariances is in the study of sexually antagonistic selection and the evolution of sexual dimorphism; here we’ll calculate the genetic covariance between male and female phenotypes. For example, below is the relationship between the forehead patch size for pied fly-catcher fathers and their sons and daughters. The phenotype has been standardized to have mean 0 and variance 1 in each group. The phenotypic covariance of the sample of fathers and sons is 0.35, while the phenotypic covariance of fathers and daughter is 0.23.

    Behaviorism_1.gif
    Figure \(\PageIndex{19}\): Ficedula hypoleuca, pied fly-catcher.
    Exercise \(\PageIndex{3}\)

    Assume we can ignore the effect of the shared environment in our pied fly-catcher example.
    A) What is the additive genetic covariance between male and female patch size?

    B) What is the additive genetic correlation of male and female patch size? You can assume that the additive genetic variance is the same in males and females.

    Non-additive variation.

    Up to now we’ve assumed that our alleles contribute to our pheno- type in an additive fashion. However, that does not have to be the case as there may be non-additivity among the alleles present at a locus (dominance) or among alleles at different loci (epistasis). We can accommodate these complications into our models. We do this
    by partitioning our total genetic variance into independent variance components. We’ll see that dominant and epistatically interacting loci can contribute to the additive genetic variance. In constructing these variance components models we’ll assume that we know the alleles contributing to variation in our trait and their effects, but in reality we rarely know these. However, as we’ll see we don’t need to know these details and we can partition our variance and estimate additive variance and other forms of non-additive variation using the resemblance between various types of relatives.

    Dominance. To understand the effect of dominance, let’s consider how the allele that a parent transmits influences their offspring’s phenotype. A parent transmits one of their two alleles at a locus to their offspring. Assuming that individuals mate at random, this allele is paired with another allele drawn at random from the population. For example, assume your mother transmitted an allele 1 to you: with probability p it would be paired with another allele 1, and you would be a homozygote; and with probability q it’s paired with a 2 allele and you’re a heterozygote.

    Now consider an autosomal biallelic locus l, with frequency p for allele 1, and genotypes 0, 1, and 2 corresponding to how many copies of allele 1 individuals carry. We’ll denote the mean phenotype of an individual with genotype 0, 1, and 2 as Xl,0, Xl,1, Xl,2 respectively. This mean is taking an average phenotype over all the environments and genetic backgrounds the alleles are present on. We’ll mean center (MC) these phenotypic values, setting X′l,0 = Xl,0 − μ, and likewise for the other genotypes.

    We can think about the average (marginal) MC phenotype of an individual who received an allele 1 from their parent as the average of the MC phenotype for heterozgotes and 11 homozygotes, weighted by the probability that the individual has these genotypes, i.e. the probability they receive an additional allele 1 or an allele 2 from their other parent:

    \[\]

    Similarly, if your parent transmitted an 2 allele to you, your average MC phenotype would be

    \[\]

    Let’s now consider the average phenotype of an offspring by how many copies of the allele 1 they carry

    Behaviorism_1.gif

    i.e. the mean phenotype of each genotypes’ offspring averaged over all possible matings to other individuals in the population (assuming individuals mate at random). These are the additive MC genetic values (breeding values) of our genotypes. Here we are simply adding up the additive contributions of the alleles present in each genotype and ignoring any non-additive effects of genotype.

    To illustrate this, in Figure \(\PageIndex{20}\) we plot two different cases of dominance relationships; in the top row an additive polymorphism and in the second row a fully dominant allele. The additive genetic values of the genotypes are shown as red dots. Note that the additive values of the genotypes line up with the observed MC phenotypic means in the top row, when our alleles interact in a completely additive manner. Our additive genetic values always fall along a linear line (the red line in our figure). The additive values are falling along the best fitting line of linear regression for our population, when phenotype is regressed against the additive genotype (0, 1, 2 copies of allele 1) across all individuals in our population. Note in the dominant case the additive genetic values differ from the observed phenotypic means, and are closer to the observed values for the genotypes that are most common in the population.

    Behaviorism_1.gif
    Figure \(\PageIndex{20}\): The average mean- centered (MC) phenotypes plotted against the number of allele 1 carried (from 0 for 22 to 2 for 11). Top Row: Additive relationship between genotype and phenotype. Bottom Row: Allele 1 is dominant over allele 2, such that the heterozygote has the same phenotype as the 11 genotype. The area of each circle is proportion to the fraction of the population in each genotypic class (p2, 2pq, and q2). One the left column p = 0.1 and the right column is p = 0.9. The additive genetic values of the genotypes are shown as red dots. The regression between phenotype and additive genotype is shown as a red line. The black vertical arrows show the difference between the average MC phenotype and additive genetic value for each genotype.

    The difference in the additive effect of the two alleles al,2 − al,1 can be interpreted as an average effect of swapping an allele 1 for an allele 2; we’ll call this difference αl = al,2 − al,1. Our αl is also the slope of the regression of phenotype against genotype (the red line in Figure \(\PageIndex{20}\). Note that the slope of our regression of phenotype on genotype (αl) does not depend on the population allele frequency for our completely additive locus (top row of 7.20). In contrast, when there is dominance, the slope between genotype and phenotype (αl) is a function of allele frequency (bottom row of 7.20). When a dominant allele (1) is rare there is a strong slope of phenotype on genotype, bottom left Figure 7.20. This strong slope is because replacing a single copy of the 2 allele with a 1 allele in an individual has a big effect on average phenotype, as it will most likely move an individual from be- ing a 22 homozygote to being a 12 heterozygote. In contrast, when the dominant allele (1) is common in the population, replacing a 2 allele by a 1 allele in an individual on average has little phenotypic effect, leading to a weak slope (bottom right Figure \(\PageIndex{20}\)). This small effect is because as we are mainly turning heterozygotes into homozygotes (11), who have the same mean phenotype as each other.

    Behaviorism_1.gif
    Figure \(\PageIndex{21}\): Atlantic salmon (Salmosalar).

    As as an example of how dominance and population allele frequencies can change the additive effect of an allele, let’s consider the genetics of the age of sexual maturity in Atlantic Salmon. A single allele of large effect segregates in Atlantic Salmon that influences the sexual maturation rate in salmon (Ayllon et al., 2015; Barson et al., 2015), and hence the timing of their return from the sea to spawn (sea age). The allele falls close to the autosomal gene VGLL3 (Cousminer et al., 2013, variation at this gene in humans also influences the timing of puberty). The left side of Figure \(\PageIndex{22}\) shows the age at sexual maturity in males. The L allele associated with slower sexual maturity is recessive in males. While the LL homozygotes mature on average a whole year later, the additive effect of the allele is weak while the L allele is rare in the population. The right panel shows the effect of the L allele in females. Note how the allele is much more dominant in females, and has a much more pronounced additive effect. The dominance of an allele is not a fixed property of the allele but rather a statement of the relationship of genotype to phenotype, such that the dominance relationship between alleles may vary across phenotypes and contexts (e.g. sexes).

    Behaviorism_1.gif
    Figure \(\PageIndex{22}\): The average age at sexual maturity for each genotype, broken down by sex. The area of each circle is proportional to the fraction of the population in each genotypic class. The regression between pheno- type and additive genotype is shown as a red line. Data from Barson

    et al. (2015).

    The variance in the population phenotype due to these additive breeding values at locus l, assuming HW proportions, is

    \[\]

    The total additive variance for the whole genotype can be found by summing the individual additive genetic variances over loci

    \[\]

    Having assigned the additive genetic variance to be the variance ex- plained by the additive contribution of the alleles at a locus, we define the dominance variance as the population variance among genotypes at a locus due to their deviation from additivity. We can calculate how much each genotypic mean deviates away from its additive prediction at locus l (the length of the arrows in Figure \(\PageIndex{20}\)). For example, the heterozygote deviates

    \[\]

    away from its additive genetic value, with similar expressions for each of the homozygotes (dl,0 and dl,2). We can then write the dominance variance at our locus as the genotype-frequency weighted sum of our squared dominance deviations

    \[\]

    Writing our total dominance variance as the sum across loci

    \[\]

    Having now partitioned all of the genetic variance into additive and dominant terms, we can write our total genetic variance as

    \[\]

    We can do this because by construction the covariance between our additive and dominant deviations for the genotypes is zero. We can define the narrow sense heritability as before h2 = VA/VP = VA/(VG + VE ), which is the proportion of phenotypic variance due to additive genetic variance. We can also define the total proportion of the phenotypic variance due to genetic differences among individuals, as the broad-sense heritability H2 = VG/(VG + VE).

    Behaviorism_1.gif
    Phenotypic covariance be- tween some pairs of relatives, include the dominance variation. ∗ Assuming this is the only relationship the pair of individuals share (above that expected from randomly sampling individuals from the population).

    The additive and dominance variance can be estimated by the re- semblance among relatives. When dominance is present in the loci influencing our trait (VD > 0), we need to modify our phenotype covariance among relatives to account for this non-additivity. Specifi- cally, our equation for the covariance among a general pair of relatives (eqn. 7.17 for additive variation) becomes

    \[\]

    where r2 is the probability that the pair of individuals share 2 alleles identical by descent, making the same assumptions (other than additivity) that we made in deriving eqn. 7.17. In table 7.1 we show the phenotypic covariance for some common pairs of relatives. Importantly, in the presence of dominance variance, the regression of offspring phenotype on parental midpoint still has a slope VA/VP , as a parent and offspring share precisely one of their autosomal alleles IBD but never their genotype IBD (assuming no inbreeding).

    Full sibs and parent-offspring have the same covariance if there is no dominance variance (as they have the same kinship coefficient F1,2). However, when dominance effects are present (VD > 0), full-sibs resemble each other more than parent-offspring pairs. That’s because full-sibs can share both alleles (i.e. the full genotype at a locus) identical by descent. We can attempt to estimate VD by comparing different sets of relationships. For example, non-identical twins (full sibs born at same time) should have 1/2 the phenotypic covariance of identical twins if VD = 0. Therefore, we can attempt to estimate VD by look- ing at whether identical twins have more than twice the phenotypic covariance than non-identical twins.

    The most important aspect of this discussion for thinking about evolutionary genetics is that the parent-offspring covariance is still only a function of VA. This is because our parent (e.g. the mother) transmits only a single allele, at each locus, to its offspring. The other allele the offspring receives is random (assuming random mating), as
    it comes from the other unrelated parent (the father). Therefore, the average effect on the child’s phenotype of an allele the child receives from their mother is averaged over all possible random alleles the child could receive from their father (weighted by their frequency in the population). Thus we only care about the additive effect of the allele, as parents transmit only alleles (not genotypes) to their offspring. This means that the short-term response to selection, as described by the breeder’s equation, depends only on VA and the additive effect of alleles. Therefore, if we can estimate the narrow-sense heritability we can predict the short-term response.

    While our VA predicts the short term response to selection, if alleles display dominance, our value of VA will change as alleles at our loci change in frequency. For, example as dominant alleles become common in the population their contribution to VA decreases, we can see this in Figure 7.20 our rare dominant allele (bottom left) contributes to the additive variance far more than when it is at high frequency (bottom right). So if selection favours higher values of our trait, the response to selection will push the dominant allele to higher frequency decreasing VA. Therefore, if there is dominance our value of VA will not be constant across generations.

    Up to this point we have only considered dominance and not epistasis. However, we can include epistasis in a similar manner (for ex- ample among pairs of loci). This gets a little tricky to think about, so we will only briefly explain it. We can first estimate the additive effect of the alleles by considering the effect of the alleles averaging over their possible genetic backgrounds (including the other interact- ing alleles they are possibly paired with), just as before. We can then calculate the additive genetic variance from this. We can estimate the dominance variance, by calculating the residual variance among genotypes at a locus unexplained by the additive effect of the loci. We can then estimate the epistatic variance by estimating the residual vari- ance left unexplained among the two locus genotypes after accounting for the additive and dominant deviations calculated from each locus separately. In practice these high variance components are hard to estimate, and usually small as much of our variance is assigned to the additive effect. Again we would find that we mostly care about VA for predicting short-term evolution, but that the contribution of loci to the additive genetic variance will depend on the epistatic relationships among loci.

    Exercise \(\PageIndex{4}\)

    How could you use 1/2 sibs vs. full-sibs to estimate VD? Why might this be difficult in practice? Why are identical vs. non- identical twins better suited for this?

    Summary

    • A key concern of quantitative genetics is how phenotypic variation within populations is partitioned into environmental and genetic components of the variance.
    • The additive genetic variance is The proportion of phenotypic variance is the narrow sense heritability h2 = VA/VP . These quantities are both measurements of the contribution of the current standing genetic variation in a particular set of environments and should not be thought of as fixed quantities of the population or trait.
    • We can estimate the additive genetic variance and the heritability by using the resemblance of relatives, if we can experimentally remove or statistically partition out the effect if the shared environment among relatives.
    • The genetic basis of variation in traits can genetically covary due to pleiotropy, assortative mating, and linkage. We can estimate the genetic covariance between traits by using the covariance in different traits among relatives.
    • Alleles with dominance and epistatic effects can and do contribute to VA to the extent to which transmitting an additional copy of the allele to an offspring changes their expected phenotype. These alleles and combinations of alleles also contribute to higher order genetic variance components, the dominance and epistatic covariance.
    • The magnitude of the additive, dominance, and epistatic genetic variance can change as allele frequency change and recombination changes the context in which alleles are expressed.
    Exercise \(\PageIndex{4}\)

    The additive genetic variance for leg length on mice is 10mm2.

    What is the expected covariance of mice who are first cousins?

    Exercise \(\PageIndex{5}\)

    Can you construct a case where VA = 0 and VD > 0? You need just describe it qualitatively; you don’t need to work out the math. (tricker question).


    This page titled 7: Phenotypic Variation and the Resemblance Between Relatives is shared under a CC BY 3.0 license and was authored, remixed, and/or curated by Graham Coop via source content that was edited to the style and standards of the LibreTexts platform; a detailed edit history is available upon request.