|
![]() |
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
|
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
Help Documentation
Current document |
|
Nucleotide Models: Overview
SIMMAP implements the following nucleotide substitution models and their sub-variants,
|
|
Configuring the nucleotide substitution model
SIMMAP allows users to perform a molecular character history analysis using fixed model parameters - an empirical Bayesian analysis - or using samples taken from the posterior distribution - a hierarchical Bayesian analysis. To configure the model select Set Model in the Models main menu. This will open the Model window which has three views (each are shown below). The tabs allow configuring different aspects of the model: substitution rate parameters, character frequencies, and rate variation. Substitution parameters Three susbtitution types (nst) are available: 1, 2, and 6. This does not mean that you can not configure the model to all possible subvariants of the basic GTR model (203 exist) as will be described.When setting Nst = 1 this corresponds, depending on how the character frequencies are dealt with, to the Jukes-Cantor (1969) [equal frequency] or Felsenstein's (1981) [unequal frequency] model. When Nst = 2 is selected the user can opt to Fix Kappa, using the text field to the right, or Use Kappa from file, or Estimate Kappa (from GTR). These settings refer to Kimura's 2-parameter (K2P) model, or the Hasegawa-Kishino-Yano (HKY85) / Felsenstein's 1984 (F84) model. Which is implemented depends on the choice of how nucleotide frequencies are treated (equal frequency = K2P; unequal frequency = HKY85/F84). When Nst = 6 is selected the user can opt to Fix GTR rates or GTR rates from file. If you opt to fix the rates then values can be entered in the appropriate text fields (e.g., AG; row A, column G). This allows the user to enter, say the maximum likelihood estimates, values directly without using a parameter file. This is equivalant to an empirical Bayesian analysis. On the other hand if the values are read from a file the user can customize the model to any of the possible GTR sub-variant models. The parameter file will require a column for each of the rates but these can be set to a value of 1 for specific rates that should be equal. If these are samples from the posterior then this is equivalant to a hierarchical Bayesian analysis. For more information on using or how to use model parameters to perform a particular type of analysis see the help documents on molecular model parameters. Character frequencies Nucleotide frequencies can be dealt with in four ways: Equal frequencies, Empirical frequencies, Estimated frequencies (if present), or Fix frequencies. Equal frequencies, as it says, sets each nucleotide frequency to being equal (0.25) as in the Jukes-Cantor (1969) or Kimura 2-parameter (K2P) models. Empirical frequencies are calculated from the data matrix. Estmated frequencies are read from the parameter file if present in the file. These can be samples from the posterior (e.g., from MrBayes) or maximum likelihood estimates (e.g., from PAUP*) or any program desired. For information on setting model parameters from a file and linking parameters and trees see the help page on molecular substitution model parameters. If using point estimates (e.g., maximum likelihood estimates from PAUP*) the values can be entered manually using the Fix frequencies option or using a file containing the parameter values. Rate variation SIMMAP implements four different rate variation models: (1) a Discrete gamma distribution, (2) an Invariable sites model, (3) a Discrete gamma + Invariable sites model, and (4) a Site-specific rates model. The interface for setting these options is shown to the right. This view can accessed by selecting the pop-up button at the top of the Model window.The Discrete gamma has three options available depending on whether a parameter file has been opened.
The Site-specific rates has three options available depending on whether a parameter file has been opened. (Note: Currently, SIMMAP supports site-specific rates only for protein coding data, i.e., codon positions.)
|
|
Morphology Models: Overview
SIMMAP implements the Mk class of models for morphological or standard characters (Lewis 2001). When k=2 SIMMAP uses a symmetrical Beta prior in sampling character histories. If k > 2 then the bias parameter is 1/k. The morphology model also includes a prior on the overall evolutionary rate. SIMMAP uses a gamma prior. Both the bias and overall rate priors can be implemented as sampling from a distribution or from fixed values. For more details on morphology priors and configuring them see the help documents on morphology priors. |
|
|
|
|
Page Last Updated: 6 August 2008 |