5.3: Correlation Between Relatives
- Page ID
- 143500
\( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)
\( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)
\( \newcommand{\dsum}{\displaystyle\sum\limits} \)
\( \newcommand{\dint}{\displaystyle\int\limits} \)
\( \newcommand{\dlim}{\displaystyle\lim\limits} \)
\( \newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\)
( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\)
\( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)
\( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\)
\( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)
\( \newcommand{\Span}{\mathrm{span}}\)
\( \newcommand{\id}{\mathrm{id}}\)
\( \newcommand{\Span}{\mathrm{span}}\)
\( \newcommand{\kernel}{\mathrm{null}\,}\)
\( \newcommand{\range}{\mathrm{range}\,}\)
\( \newcommand{\RealPart}{\mathrm{Re}}\)
\( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)
\( \newcommand{\Argument}{\mathrm{Arg}}\)
\( \newcommand{\norm}[1]{\| #1 \|}\)
\( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)
\( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\AA}{\unicode[.8,0]{x212B}}\)
\( \newcommand{\vectorA}[1]{\vec{#1}} % arrow\)
\( \newcommand{\vectorAt}[1]{\vec{\text{#1}}} % arrow\)
\( \newcommand{\vectorB}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)
\( \newcommand{\vectorC}[1]{\textbf{#1}} \)
\( \newcommand{\vectorD}[1]{\overrightarrow{#1}} \)
\( \newcommand{\vectorDt}[1]{\overrightarrow{\text{#1}}} \)
\( \newcommand{\vectE}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash{\mathbf {#1}}}} \)
\( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)
\(\newcommand{\longvect}{\overrightarrow}\)
\( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)
\(\newcommand{\avec}{\mathbf a}\) \(\newcommand{\bvec}{\mathbf b}\) \(\newcommand{\cvec}{\mathbf c}\) \(\newcommand{\dvec}{\mathbf d}\) \(\newcommand{\dtil}{\widetilde{\mathbf d}}\) \(\newcommand{\evec}{\mathbf e}\) \(\newcommand{\fvec}{\mathbf f}\) \(\newcommand{\nvec}{\mathbf n}\) \(\newcommand{\pvec}{\mathbf p}\) \(\newcommand{\qvec}{\mathbf q}\) \(\newcommand{\svec}{\mathbf s}\) \(\newcommand{\tvec}{\mathbf t}\) \(\newcommand{\uvec}{\mathbf u}\) \(\newcommand{\vvec}{\mathbf v}\) \(\newcommand{\wvec}{\mathbf w}\) \(\newcommand{\xvec}{\mathbf x}\) \(\newcommand{\yvec}{\mathbf y}\) \(\newcommand{\zvec}{\mathbf z}\) \(\newcommand{\rvec}{\mathbf r}\) \(\newcommand{\mvec}{\mathbf m}\) \(\newcommand{\zerovec}{\mathbf 0}\) \(\newcommand{\onevec}{\mathbf 1}\) \(\newcommand{\real}{\mathbb R}\) \(\newcommand{\twovec}[2]{\left[\begin{array}{r}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\ctwovec}[2]{\left[\begin{array}{c}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\threevec}[3]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\cthreevec}[3]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\fourvec}[4]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\cfourvec}[4]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\fivevec}[5]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\cfivevec}[5]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\mattwo}[4]{\left[\begin{array}{rr}#1 \amp #2 \\ #3 \amp #4 \\ \end{array}\right]}\) \(\newcommand{\laspan}[1]{\text{Span}\{#1\}}\) \(\newcommand{\bcal}{\cal B}\) \(\newcommand{\ccal}{\cal C}\) \(\newcommand{\scal}{\cal S}\) \(\newcommand{\wcal}{\cal W}\) \(\newcommand{\ecal}{\cal E}\) \(\newcommand{\coords}[2]{\left\{#1\right\}_{#2}}\) \(\newcommand{\gray}[1]{\color{gray}{#1}}\) \(\newcommand{\lgray}[1]{\color{lightgray}{#1}}\) \(\newcommand{\rank}{\operatorname{rank}}\) \(\newcommand{\row}{\text{Row}}\) \(\newcommand{\col}{\text{Col}}\) \(\renewcommand{\row}{\text{Row}}\) \(\newcommand{\nul}{\text{Nul}}\) \(\newcommand{\var}{\text{Var}}\) \(\newcommand{\corr}{\text{corr}}\) \(\newcommand{\len}[1]{\left|#1\right|}\) \(\newcommand{\bbar}{\overline{\bvec}}\) \(\newcommand{\bhat}{\widehat{\bvec}}\) \(\newcommand{\bperp}{\bvec^\perp}\) \(\newcommand{\xhat}{\widehat{\xvec}}\) \(\newcommand{\vhat}{\widehat{\vvec}}\) \(\newcommand{\uhat}{\widehat{\uvec}}\) \(\newcommand{\what}{\widehat{\wvec}}\) \(\newcommand{\Sighat}{\widehat{\Sigma}}\) \(\newcommand{\lt}{<}\) \(\newcommand{\gt}{>}\) \(\newcommand{\amp}{&}\) \(\definecolor{fillinmathshade}{gray}{0.9}\)- \( \sigma^{2}_{D} \), the dominance effects from some alleles being dominant over others
- \( \sigma^{2}_{E} \), the epistatic effects that result from interactions among alleles at different loci.
Think back to "heritability" in the sense of "the reason the members of a family resemble eachother." However, dominance isn't inherited from parents to offspring -- it's a result of cell biology. Neither is epistasis. What part of genotypic variance is responsible for the fact that complex traits are heritable?
Alleles!
Thus, narrow sense heritability is the proportion of total genotypic variance due to shared alleles:
\[h^{2} = \frac{\sigma^{2}_{A}}{\sigma^{2}_{p}} \]
And given this, it makes sense that we would estimate narrow-sense heritability by comparing individuals that share alleles: parents and offspring.
Estimating Narrow-Sense Heritability using Parent-Offspring Regression
Narrow-sense heritability can be estimated by comparing related individuals (siblings, parents and offspring, cousins, etc) and taking into account the proportion of alleles shared by the individuals. This analysis is based on several strong assumptions:
- The particular character has diploid, Mendelian inheritance.
- There is no linkage among loci controlling the character of interest, or the population is in linkage equilibrium.
- The population is random mated.
- Parents are not inbred.
- There is no environmental correlation between the performance of parents and progeny (to avoid violating this last assumption, randomize parents and progeny within replications; i.e., do not test them in the same plot).
To do so, we employ a method called linear regression.
Several analytical procedures are commonly used to sort out the sources of variation in the sample, to determine the relationship among factors contributing to the variability, and to estimate the heritability of the character.
Regression — this procedure examines the strength of the relationship between factors or the influence one factor has on another. The linear regression procedure fits a straight line to a scatterplot of data points. The general equation for the regression line is:
\[y = a + bx\]
where:
- y = response or dependent variable
- x = predictor or independent variable
- a = y-intercept of the line
- b = regression coefficient, the slope of the line

The regression line always passes through the point (
). Relative to the total spread of the data, if most of the datapoints lie on or very near the line, there is a strong relationship between the predictor and response variables—x has a strong influence on y. In contrast, the fewer the points that fall on or near the line, the less influence x has on y. A cautionary note: although the x variable may have strong influence on the y variable, x may not be the cause of the y response, nor the sole factor influencing y.
Regression can be used to assess the relative effect of environment on phenotypic value or to obtain information about gene action. The relative scatter about the regression line of a plot of genotype (the predictor variable, x) against phenotypic value (the response variable, y), provides information about gene action. For example, regression analysis of the following two examples suggests that the gene action in example 1 (left panel) is additive (no dominance), whereas there is a complete dominance (by the A2 allele) in the case of example 2 (right panel) (Fehr, 1987).
When the heterozygous genotype has a value midway between the two homozygotes and thus all three genotypic values fall on the linear regression line the only gene action contributing to the phenotype is additive.
Regression analysis of phenotypic values of progeny (y) against parents (x) provides useful information about the degree of similarity of progeny to the parents.
Computing narrow-sense heritability requires that first compute b, the regression coefficient, using the following formula:
\[b = \frac{\Sigma(X - \bar{X})(Y - \bar{Y})}{\Sigma (X - \bar{X})^{2}}\]
where:
- b = regression coefficient
- X = population 1 values
- Y = population 2 values
- \( \bar{X} \) = the mean of the population 1 values
- \( \bar{Y} \) = the mean of the population 2 values
Populations 1 and 2 are groups of individuals with some genetic relationship -- parents and offspring, siblings, etc. Thus, X is the independent variable, and Y is the response or dependent variable.
Regression Coefficient and Narrow-Sense Heritability
What does the regression coefficient, b, tell us?
If b = 1, then
- gene action is completely additive,
- negligible environmental effect,
- and negligible experimental error.
The smaller the value of b, the less closely the population 2 resemble their population 1, indicating
- greater environmental influence on the character,
- greater dominance and/or epistatic effects, and/or
- greater experimental error.
In general, then,
\[ h^2 = b / a \]
where
- \( h^2 \) is the narrow-sense heritability of a trait
- \( b \) is the regression coefficient, and
- \( a \) is the proportion of alleles shared between population 1 and population 2.
Examples are given below:
Parents and Offspring
An offspring has half of a parents' alleles. Thus,
\[ h^2 = b / 0.5 = 2 * b \]
Half-siblings
Half-siblings share one parent and has half of that parent's alleles -- so half-siblings on average share 0.25 of eachother's alleles, and
\[h^{2} = b / 0.25 = 4 * b \]
Monozygotic twins
Monozygotic twins are genetically identical -- they not only share alleles but dominance and epistatic relationships as well. As such, comparing monozygotic twins measures the broad sense heritability of a trait:
\[H^{2} = b \]
References
Barbour, M.G, J.H. Burk, F.S. Gilliam, W.D. Pitts and M.W. Schwartz. 1999. Terrestrial Plant Ecology. 3rd edition. Benjamin Cummings, San Francisco, CA.
Clausen, J., D.D. Keck, and W. Hiesey. 1940. Experimental studies on the nature of species. I. Effects of varied environments on western North American plants. Carnegie Inst. Wash. Publ. 520.
Clausen, J., D. D. Keck, and W. M. Hiesey. 1948. Experimental studies on the nature of species. III. environmental responses of climactic races of Achillea. Carnegie Inst. Wash. Publi. 581.
Conner, J.K. and D.L. Hartl. 2004. A Primer of Ecological Genetics. Sinauer Associates, Sunderland, MA.
Falconer, D.S., and T.F.C. Mackay. 1996. Introduction to Quantitative Genetics. 4th edition. Longman Publ. Group, San Francisco, CA.
Hallauer, A. R. and J. B. Miranda. 1988. Quantitative Genetics in Maize Breeding. 2nd Edition. Iowa State University Press, Ames, IA.
Hill, W.G. 2005. A century of corn selection. Science 307: 683-684.
Pierce, B. A. 2008. Genetics: A Conceptual Approach. 3rd edition. W.H. Freeman, New York.
Rausher, M.D.. 2005. Example of Clausen, Keck, and Hiesey Experiment, Lecture 1-Lineages, Populations, and Genetic Variation. Online lecture notes from course on Principles of Evolution.
Department of Crop Science, University of Illinois at Urbana-Champaign. Values obtained for protein in the strains selected for oil and the values for oil obtained for the strains selected for protein each generation (1896-2004). 2007

