13.4: ML and Bayesian Tests for State-Dependent Diversification

Last updated
Save as PDF

Page ID: 21659

\( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \) \( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)\(\newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\) \( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\) \( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\) \( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\) \( \newcommand{\Span}{\mathrm{span}}\) \(\newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\) \( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\) \( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\) \( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\) \( \newcommand{\Span}{\mathrm{span}}\)\(\newcommand{\AA}{\unicode[.8,0]{x212B}}\)

Now that we can calculate the likelihood for state-dependent diversification models, formulating ML and Bayesian tests follows the same pattern we have encountered before. For ML, some comparisons are nested and so you can use likelihood ratio tests. For example, we can compare the full BiSSe model (Maddison et al. 2007), with parameters q₀₁, q₁₀, λ₀, λ₁, μ₀, μ₁ with a restricted model with parameters q₀₁, q₁₀, λ_all, μ_all. Since the restricted model is a special case of the full model where λ₀ = λ₁ = λ_all and μ₀ = μ₁ = μ_all, we can compare the two using a likelihood ratio test, as described earlier in the book. Alternatively, we can compare a series of BiSSe-type models by comparing their AIC_c scores.

Figure 13.4. Data from Goldberg and Igic (2012) showing presence (red) and absence (black) of self-incomatibility among Solanaceae. Branches colored using stochastic character mapping under a model with distinct forwards and backwards rates; these reconstructions are biased if characters affect diversification rates. Image by the author, can be reused under a CC-BY-4.0 license.

For example, I will apply this approach to the example of self-incompitability. I will use data from Goldberg and Igic (2012), who provide a phylogenetic tree and data for 356 species of Solanaceae. All species were classified as having any form of self incompatibility, even if the state is variable among populations. The data, along with a stochastic character map of state changes, are shown in Figure 13.4. Applying the BiSSe models to these data and assuming that q₀₁ ≠ q₁₀, we obtain the following results:

Model	Number of parameters	Parameter estimates	lnL	AIC
Character-independent model	4	λ = 0.65, μ = 0.57	-945.96	1899.9
		q₀₁ = 0.16, q₁₀ = 0.09
Speciation rate depends on character	5	μ = 0.57	-945.57	1901.1
		λ₀ = 0.69, λ₁ = 0.63
		q₀₁ = 0.17, q₁₀ = 0.08
Extinction rate depends on character	5	λ = 0.65	-943.93	1897.9
		μ₀ = 0.45, μ₁ = 0.67
		q₀₁ = 0.22, q₁₀ = 0.06
Full character-dependent model	6	λ₀ = 0.49, λ₁ = 0.79	-941.94	1895.9
		μ₀ = 0.20, μ₁ = 0.84
		q₀₁ = 0.29, q₁₀ = 0.05

From this, we conclude that models where the character influences diversification fit best, with the full model receiving the most support. We can't discount the possibility that the character only influences extinction and not speciation, since that model is within 2 AIC units of the best model.

Alternatively, we can carry out a Bayesian test for state-dependent diversification.

Bayesian test for state-dependent diversification

Like other models in the book, this requires setting up an MCMC algorithm that samples the posterior distributions of our model parameters (FitzJohn 2012). In this case:

Sample a set of starting parameter values, q₀₁, q₁₀, λ₀, λ₁, μ₀, μ₁, from their prior distributions. For example, one could set prior distribution for all parameters as exponential with a mean and variance of λ_{prior_i} (note that, as usual, the choice for this parameter should depend on the units of tree branch lengths you are using). We then select starting values for all parameters from the prior.
Given the current parameter values, select new proposed parameter values using the proposal density Q(p′|p). For all parameter values, we can use a uniform proposal density with width w_p, so that Q(p′|p) U(p − w_p/2, p + w_p/2). We can either choose all parameter values simultaneously, or one at a time (the latter is typically more effective).
Calculate three ratios:
- The prior odds ratio. This is the ratio of the probability of drawing the parameter values p and p′ from the prior. Since we have exponential priors for all parameters, we can calculate this ratio as: \[ R_{prior} = \frac{\lambda_{prior_i} e^{-\lambda_{prior_i} p'}}{\lambda_{prior_i} e^{-\lambda_{prior_i} p}}=e^{\lambda_{prior_i} (p-p')} \label{13.11}\[
- The proposal density ratio. This is the ratio of probability of proposals going from p to p′ and the reverse. We have already declared a symmetrical proposal density, so that Q(p′|p)=Q(p|p′) and R_proposal = 1.
- The likelihood ratio. This is the ratio of probabilities of the data given the two different parameter values. We can calculate these probabilities from the approach described in the previous section.
Find R_accept as the product of the prior odds, proposal density ratio, and the likelihood ratio. In this case, the proposal density ratio is 1, so (eq. 13.12):
R_accept = R_prior ⋅ R_likelihood