Inferring Population Histories Using Genome-Wide Allele Frequency Data - INRIA - Institut National de Recherche en Informatique et en Automatique Accéder directement au contenu
Article Dans Une Revue Molecular Biology and Evolution Année : 2012

Inferring Population Histories Using Genome-Wide Allele Frequency Data

Résumé

The recent development of high-throughput genotyping technologies has revolutionized the collection of data in a wide range of both model and nonmodel species. These data generally contain huge amounts of information about the demographic history of populations. In this study, we introduce a new method to estimate divergence times on a diffusion time scale from large single-nucleotide polymorphism (SNP) data sets, conditionally on a population history that is represented as a tree. We further assume that all the observed polymorphisms originate from the most ancestral (root) population; that is, we neglect mutations that occur after the split of the most ancestral population. This method relies on a hierarchical Bayesian model, based on Kimura's time-dependent diffusion approximation of genetic drift. We implemented a Metropolis-Hastings within Gibbs sampler to estimate the posterior distribution of the parameters of interest in this model, which we refer to as the Kimura model. Evaluating the Kimura model on simulated population histories, we found that it provides accurate estimates of divergence time. Assessing model fit using the deviance information criterion (DIC) proved efficient for retrieving the correct tree topology among a set of competing histories. We show that this procedure is robust to low-to-moderate gene flow, as well as to ascertainment bias, providing that the most distantly related populations are represented in the discovery panel. As an illustrative example, we finally analyzed published human data consisting in genotypes for 452,198 SNPs from individuals belonging to four populations worldwide. Our results suggest that the Kimura model may be helpful to characterize the demographic history of differentiated populations, using genome-wide allele frequency data.
Fichier principal
Vignette du fichier
mss257.pdf (677.05 Ko) Télécharger le fichier
Origine : Fichiers éditeurs autorisés sur une archive ouverte

Dates et versions

hal-00783026 , version 1 (14-06-2021)

Licence

Paternité

Identifiants

Citer

Mathieu Gautier, Renaud Vitalis. Inferring Population Histories Using Genome-Wide Allele Frequency Data. Molecular Biology and Evolution, 2012, 30 (3), pp.654-668. ⟨10.1093/molbev/mss257⟩. ⟨hal-00783026⟩
153 Consultations
51 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More