Mutation Rates from Genome Resequencing | ||||||||
Changed: | ||||||||
< < | Motivation: You have re-sequenced several genomes after a mutation accumulation or adaptive evolution experiment. How do you infer the rates of different types of mutation rates from these data? What are the 95% confidence intervals on these values? | |||||||
> > | Motivation: You have re-sequenced several genomes after a mutation accumulation or adaptive evolution experiment. How do you infer the rates of different types of mutations from these data? What are the 95% confidence intervals on these values? | |||||||
Changed: | ||||||||
< < | Case 1: Single-base substitutions | |||||||
> > | Case 1: Mutations with many identical sites | |||||||
Changed: | ||||||||
< < | Assumptions: The number of mutations is small compared to the number of sites. | |||||||
> > | Assumptions: | |||||||
Added: | ||||||||
> > |
| |||||||
Changed: | ||||||||
< < | If you restrict your data to one genome per experimental population, then you can calculate the 95% confidence limits by assuming this is a Poisson process (poisson.test in R). | |||||||
> > | Example: Single-base substitutions | |||||||
Changed: | ||||||||
< < | If you take multiple genomes from one experimental population, this is a type of pseudo-replication (they may have a shared evolutionary history). This makes calculating the 95% confidence intervals more complicated. | |||||||
> > | Calculation: | |||||||
Added: | ||||||||
> > |
| |||||||
Case 2: One-time mutations | ||||||||
Changed: | ||||||||
< < | Assumptions: A mutation can only happen once per genome. | |||||||
> > | Assumptions: | |||||||
Added: | ||||||||
> > |
| |||||||
Changed: | ||||||||
< < | Example: Deletion of a chromosomal region. Once deleted, it can never be deleted again. | |||||||
> > | Example: Deletion of an unstable chromosomal region. Once deleted, it can never be deleted again. | |||||||
Changed: | ||||||||
< < | This is a type of "survival analysis". You can calculate the fraction of genomes that have and do not have your mutation. Then consider this a binomial process, to calculate a 95% confidence interval Then, convert this to a per-generation rate by dividing by the number of mutations. | |||||||
> > | Calculation: | |||||||
Added: | ||||||||
> > |
| |||||||
Added: | ||||||||
> > | Exact binomial test
data: n - m and n
number of successes = 7, number of trials = 12, p-value = 0.7744
alternative hypothesis: true probability of success is not equal to 0.5
95 percent confidence interval:
0.2766697 0.8483478
sample estimates:
probability of success
0.5833333
Issues: Pseudo-replicationIssues: Different mutation rates in different lineages |