SMILE

Stochastic Models for the Inference of Life Evolution

Splitting trees with neutral Poissonian mutations II: Largest and oldest families

Champagnat, N., Lambert, A.

Stochastic Processes and their Applications

2013

We consider a supercritical branching population, where individuals have i.i.d. lifetime durations (which are not necessarily exponentially distributed) and give birth (singly) at constant rate. We assume that individuals independently experience neutral mutations, at constant rate \$\backslashtheta\$ during their lifetimes, under the infinite-alleles assumption: each mutation instantaneously confers a brand new type, called allele or haplotype, to its carrier. The type carried by a mother at the time when she gives birth is transmitted to the newborn. We are interested in the sizes and ages at time \$t\$ of the clonal families carrying the most abundant alleles or the oldest ones, as \$t\backslashto\backslashinfty\$, on the survival event. Intuitively, the results must depend on how the mutation rate \$\backslashtheta\$ and the Malthusian parameter \$\backslashalpha>0\$ compare. Hereafter, \$N\backslashequiv N\_t\$ is the population size at time \$t\$, constants \$a,c\$ are scaling constants, whereas \$k,k'\$ are explicit positive constants which depend on the parameters of the model. When \$\backslashalpha>\backslashtheta\$, the most abundant families are also the oldest ones, they have size \$cN^\1-\backslashtheta/\backslashalpha\\$ and age \$t-a\$. When \$\backslashalpha<\backslashtheta\$, the oldest families have age \$(\backslashalpha /\backslashtheta)t+a\$ and tight sizes; the most abundant families have sizes \$k\backslashlog(N)-k'\backslashlog\backslashlog(N)+c\$ and all have age \$(\backslashtheta-\backslashalpha)^\-1\\backslashlog(t)\$. When \$\backslashalpha=\backslashtheta\$, the oldest families have age \$kt-k'\backslashlog(t)+a\$ and tight sizes; the most abundant families have sizes \$(k\backslashlog(N)-k'\backslashlog\backslashlog(N)+c)^2\$ and all have age \$t/2\$. Those informal results can be stated rigorously in expectation. Relying heavily on the theory of coalescent point processes, we are also able, when \$\backslashalpha\backslashleq\backslashtheta\$, to show convergence in distribution of the joint, properly scaled ages and sizes of the most abundant/oldest families and to specify the limits as some explicit Cox processes.

Bibtex

@article{champagnat_splitting_2013,
Author = {Champagnat, Nicolas and Lambert, Amaury},
Title = {Splitting trees with neutral {Poissonian} mutations
{II}: {Largest} and oldest families},
Journal = {Stochastic Processes and their Applications},
Volume = {123},
Number = {4},
Pages = {1368--1414},
abstract = {We consider a supercritical branching population,
where individuals have i.i.d. lifetime durations (which
are not necessarily exponentially distributed) and give
birth (singly) at constant rate. We assume that
individuals independently experience neutral mutations,
at constant rate \$\backslash{}theta\$ during their
lifetimes, under the infinite-alleles assumption: each
mutation instantaneously confers a brand new type,
called allele or haplotype, to its carrier. The type
carried by a mother at the time when she gives birth is
transmitted to the newborn. We are interested in the
sizes and ages at time \$t\$ of the clonal families
carrying the most abundant alleles or the oldest ones,
as \$t\backslash{}to\backslash{}infty\$, on the
survival event. Intuitively, the results must depend on
how the mutation rate \$\backslash{}theta\$ and the
Malthusian parameter \$\backslash{}alpha>0\$ compare.
Hereafter, \$N\backslash{}equiv N\_t\$ is the
population size at time \$t\$, constants \$a,c\$ are
scaling constants, whereas \$k,k'\$ are explicit
positive constants which depend on the parameters of
the model. When
\$\backslash{}alpha>\backslash{}theta\$, the most
abundant families are also the oldest ones, they have
size \$cN^\{1-\backslash{}theta/\backslash{}alpha\}\$
and age \$t-a\$. When
\$\backslash{}alpha<\backslash{}theta\$, the oldest
families have age \$(\backslash{}alpha
/\backslash{}theta)t+a\$ and tight sizes; the most
abundant families have sizes
\$k\backslash{}log(N)-k'\backslash{}log\backslash{}log(N)+c\$
and all have age
\$(\backslash{}theta-\backslash{}alpha)^\{-1\}\backslash{}log(t)\$.
When \$\backslash{}alpha=\backslash{}theta\$, the
oldest families have age \$kt-k'\backslash{}log(t)+a\$
and tight sizes; the most abundant families have sizes
\$(k\backslash{}log(N)-k'\backslash{}log\backslash{}log(N)+c)^2\$
and all have age \$t/2\$. Those informal results can be
stated rigorously in expectation. Relying heavily on
the theory of coalescent point processes, we are also
able, when
\$\backslash{}alpha\backslash{}leq\backslash{}theta\$,
to show convergence in distribution of the joint,
properly scaled ages and sizes of the most
abundant/oldest families and to specify the limits as
some explicit Cox processes.},
doi = {10.1016/j.spa.2012.11.013},
issn = {03044149},
language = {en},
month = apr,
shorttitle = {Splitting trees with neutral {Poissonian} mutations
{II}},
url = {http://linkinghub.elsevier.com/retrieve/pii/S0304414912002530},
urldate = {2014-08-31},
year = 2013
}

Link to the article

Accéder à l'article grâce à son DOI.