- Research article
- Open Access
Exploring the relationship between lifestyles, diets and genetic adaptations in humans
© Valente et al.; licensee BioMed Central. 2015
- Received: 4 September 2014
- Accepted: 30 April 2015
- Published: 28 May 2015
One of the most important dietary shifts underwent by human populations began to occur in the Neolithic, during which new modes of subsistence emerged and new nutrients were introduced in diets. This change might have worked as a selective pressure over the metabolic pathways involved in the breakdown of substances extracted from food. Here we applied a candidate gene approach to investigate whether in populations with different modes of subsistence, diet-related genetic adaptations could be identified in the genes AGXT, PLRP2, MTRR, NAT2 and CYP3A5.
At CYP3A5, strong signatures of positive selection were detected, though not connected to any dietary variable, but instead to an environmental factor associated with the Tropic of Cancer. Suggestive signals of adaptions that could indeed be connected with differences in dietary habits of populations were only found for PLRP2 and NAT2. Contrarily, the demographic history of human populations seemed enough to explain patterns of diversity at AGXT and MTRR, once both conformed the evolutionary expectations under selective neutrality.
Accumulated evidence indicates that CYP3A5 has been under adaptive evolution during the history of human populations. PLRP2 and NAT2 also appear to have been modelled by some selective constrains, although clear support for that did not resist to a genome wide perspective. It is still necessary to clarify which were the biological mechanisms and the environmental factors involved as well as their interactions, to understand the nature and strength of the selective pressures that contributed to shape current patterns of genetic diversity at those loci.
- Diet adaptations
- Signals of natural selection
- Africa Sub-Saharan
The most remarkable dietary change over the recent history of human populations was that associated with the change from food collection to food production , which occurred independently and in different times in separate parts of the world marking the beginning of the Neolithic, a transition that in some regions dates back to 12,000 years ago. The domestication of plants and animals prompted the conditions that would brought about new modes of subsistence as well as new food habits as a consequence of the shift in the availability and exploitation of dietary resources [1, 2]. Genetic adaptations to dietary specializations are thought to have represented advantageous evolutionary solutions in humans, however it is still unclear the extent to which dietary factors have created selective pressures acting on genes that play roles in food-related metabolic pathways. Recent studies have revealed genomic signatures of adaptations likely driven by diet-related pressures [1, 3, 4]. In addition, candidate genes approaches had already provided tight evidence for genetic adaptations to differences in nutrient consumption such as at the lactase and amylase genes [5-10].
Other metabolic-related genes have been hypothesized to constitute dietary adaptations, among which are included: AGXT, coding for alanine:glyoxylate aminotransferase, the enzyme responsible for the transamination of glyoxylate into glycine [11-13]; PLRP2, coding for pancreatic lipase-related protein 2, involved in galactolipids hydrolysis, [14-17]; MTRR, encoding for methionine synthase reductase, an enzyme acting in the complex folate pathway [15, 18]; NAT2 coding for N-acetyltransferase 2, a phase-II enzyme involved in the detoxification of a wide number of xenobiotics [15, 19-23]; and CYP3A5, coding for cytochrome P-450 3A5, a member of the CYP3A enzymes that are involved in the oxidative metabolism of many endogenous substrates and xenobiotics, which is implied in sodium homeostasis [24-27].
Genetic variation in AGXT was tentatively linked with meat content in diets, PLRP2 with richness in cereals , both MTRR and NAT2 with availability of folate in foods and CYP3A5 with health conditions that are influenced by dietary salt intake [24, 27]. However, for these 5 genes results so far obtained were either contradictory (e.g. AGXT), or not yet replicated (e.g. MTRR and PLRP2), or not clear enough to ascertain whether they can indeed represent genetic adaptations to any dietary variable. This prompted us to address the issue applying of genetic adaptation within those genes.
Thus, assuming that current modes of subsistence are still good surrogates of main diets in which populations have traditionally relied, the aim of this study was to gain further insights into the relationship between diet-related variables in populations and patterns of diversity at variations in above mentioned five genes.
Functional variants within AGXT, PLRP2, MTRR, NAT2 and CYP3A5 were examined in six sub-Saharan populations with distinct modes of subsistence and also in one European population that was also screened to generate a non-African reference group. Results were then combined with previously published information for other African and Eurasian populations to evaluate the contribution of geography and mode of subsistence or other diet-related variables to explain the patterns of genetic diversity observed for the five genes.
Locus by locus analysis
Derived allele frequencies
c.32C > T (AGXT)
c.1074G > A (PLRP2)
c.1130A > G (MTRR)
c.191G > A (NAT2*14)
c.341 T > C (NAT2*5)
c.590G > A (NAT2*6)
c.857G > A (NAT2*7)
c.219-237G > A (CYP3A5)
0.0000 ± 0.0000
0.3261 ± 0.0691
0.5294 ± 0.0856
0.1522 ± 0.0530
0.2046 ± 0.0748
0.3636 ± 0.0725
0.0000 ± 0.0000
0.2400 ± 0.0604
0.0482 ± 0.0166
0.3214 ± 0.0360
0.3563 ± 0.0363
0.0977 ± 0.0225
0.3588 ± 0.0536
0.1786 ± 0.0296
0.0233 ± 0.0115
0.1429 ± 0.0270
0.0370 ± 0.0257
0.2333 ± 0.0546
0.5500 ± 0.0642
0.1429 ± 0.0540
0.2500 ± 0.0884
0.2857 ± 0.0697
0.0000 ± 0.0000
0.1167 ± 0.0414
0.0727 ± 0.01751
0.3945 ± 0.0331
0.3835 ± 0.0339
0.0699 ± 0.0187
0.3902 ± 0.0575
0.3085 ± 0.0337
0.0055 ± 0.0055
0.2336 ± 0.0289
0.0147 ± 0.0146
0.18912 ± 0.0455
0.3846 ± 0.0551
0.0263 ± 0.0184
0.1842 ± 0.0536
0.2568 ± 0.0508
0.0000 ± 0.0000
0.1447 ± 0.0404
0.0000 ± 0.0000
0.0242 ± 0.0138
0.1371 ± 0.0309
0.0000 ± 0.0000
0.0656 ± 0.0239
0.0484 ± 0.0193
0.0968 ± 0.0266
0.2097 ± 0.0366
0.1915 ± 0.0406
0.5106 ± 0.0516
0.1383 ± 0.0356
0.0000 ± 0.0000
0.5000 ± 0.0903
0.2021 ± 0.0414
0.0532 ± 0.0232
0.9022 ± 0.03010
In the AGXT gene, we studied the variant c.32C > T, concerning which the derived allele T had been previously suggested to play an adaptive role in populations traditionally relying in meat-rich diets [11, 28]. The hypothesis was specifically investigated by Caldwell et al.  who reported on frequency data sustaining the model, a conclusion for which much accounted the observation of the highest frequency of the derived allele in the Sweden Saami, who have a long history of consuming high amounts of animal products [11, 28]. Though, later, revisiting the question with a better coverage of Central Asian populations Ségurel et al.  failed to find increased allele frequencies across populations with diets richer in meat comparatively to those less meat rich, challenging this way the adaptive model proposed for the variation.
In this study, in terms of meat content in diets of African populations, we have assumed that in general farmers rely less in meat than pastoralists or hunter-gatherers, in accordance with a recent review from ethnographic compilations of hunter-gatherer diets indicating that animal food comprises their dominant energy source . Among the 6 sub-Saharan populations examined, the frequency of the derived allele at c.32C > T ranged from 0 to 7.27 % without showing any pattern of variation that could be connected with mode of subsistence or meat content in diets of populations. For instance, it was absent both from the farmers from Angola and from the hunter-gatherers Khoisan, although the first are representative of less meat consumers groups while the second are from more meat consumers ones. In the sample from Portugal, considered to be a farming population with a mixed diet reasonably balanced regarding animal and plant food resources, the derived allele reached 19.15 %, a frequency higher than registered in any of the African populations regardless of its mode of subsistence or reliance upon meat.
To integrate our results in a more comprehensive distribution, data for c.32C > T was retrieved from the literature on populations for which information on the relative predominance of meat in their diets was available (Additional file 2: Table S2). There were results only for populations from Africa and Eurasia, among which the average frequency of the derived allele was 0.081 across the set of populations assigned to have high meat consumption, while it was, 0.133, across the populations with low-meat consumption. Actually neither the overall differences in allele frequencies within the “low-meat” and “high-meat” groups were statistically significant (P = 0.0710, One-Way ANOVA), nor the trend in the frequency distribution sustained the hypothesis that the allele could be positively selected in meat-rich diet populations.
Furthermore, if the broad geographical distribution of c.32C > T in Africa and Eurasia conformed well the major population clusters commonly identified by random neutral genetic markers, intriguingly in Asia, where there is a high dispersion of gene frequencies, the extreme values were reported for two populations in rather close geographical proximity but with distinct traditional lifestyles: in the Tajiks, a group of sedentary agriculturalists from Western Tajikistan the derived allele was very well represented (26.9 %), whereas in the Kazaks from Western Uzbekistan, who are traditionally nomadic herders whose diet mainly consists of meat, milk and dairy products, the allele only occurred marginally (1.7 %).
From these analyses, no connection emerged between the frequency distribution of c.32C > T in AGXT and lifestyle of populations.
In this gene we focused on c.1074G > A, a variant that causes a premature truncation of the pancreatic lipase-related protein 2 resulting in a more active version of the enzyme. In a very recent genome-wide scan for selection in human populations, Hancock et al.  identified in this variant a convincing signal of adaptation to a dietary specialization, since the derived allele was found to be significantly more common in populations relying in diets with high content in cereals (farmers) than in other populations.
As long as we know, the association was not further investigated except in the present study, where among the screened African groups, the derived allele was detected to be quite common in the three farmers’ groups (23.3 % - 32.6 %) as well as in the herders from Uganda (39.5 %). Comparatively, the two hunter-gatherers groups showed lower frequencies, specially the Ju/hoansi (2.42 %). The sample from Portugal showed the highest frequency in this study with, 51.1 % (Table 1).
As a whole, these results suggest that diversity at PLRP2 was shaped by selective pressures that differed according to populations’ lifestyle.
Within MTRR we examined the common variation affecting levels of enzymatic activity c.1130A > G, since it was another candidate adaptive genetic variation identified in the before mentioned genome-wide study . Before, MTRR had received high attention in association studies, having been implicated, for instance, with risk for spina bifida . However, its adaptive role to dietary specializations was addressed in only one work where c.1130A > G was found to be strongly correlated with diets containing mainly the folate-poor foods roots and tubers . The results obtained in this work revealed that the derived allele was quite common in most African groups, peaking in the agriculturists from Angola and Mozambique with values of 0.529 and 0.550, respectively (Table 1). Both estimates are similar to that described in the Yoruba (0.548) the only African group with a diet principally relying on roots and tubers addressed in a previous study . So, at least in Africa high frequencies of this allele can be found in populations without having such a dietary specialization. Furthermore, no indication arose that the distribution of c.1130A > G could be correlated to the dietary availability in folates, which is generally thought to be lower in non-forager populations (agricultural and pastoral) than in hunter-gatherers . In fact, in the hunter-gatherers Baka, in the herders from Uganda and in the farmers from Equatorial Guinea, the derived allele occurred at similar frequencies (0.385, 0.384, 0.356, respectively) despite the differences in mode of food production. In the hunter-gatherers Ju/honasi from Namibia, the allele occurred at the lowest frequency in Africa (0.137) but with a magnitude similar to that found in the European sample (0.138), considered as a representative of an agriculturalist society (Table 1). To interpret our results under a wide framework of African and Eurasian populations, frequency data were recruited once more from the literature (Additional file 2: Table S2), and the combined information allowed to realize that the distribution of c.1130A > G fitted well the pattern generally provided by neutral markers, not appearing to be influenced by the mode of subsistence or the relative folate content in diets of populations from Eurasia and Africa. In East Asia, for instance, the two highest values of the derive allele were present in the Tu (0.4), nomadic herders, and in the Hezhen (0.333), mainly hunters and fishers, but nonetheless in the foragers Orogen and Yakut, who also live in East Asia, the allele was absent or very rare (Additional file 2: Table S2).
So, for the variation c.1130A > G in MTRR, the current patterns of diversity do not indicates that it could represent an adaptation to the mode of subsistence of human populations.
The dietary availability in folates had also been previously hypothesized to be a modulator of genetic diversity at the gene that encodes for NAT2 (N-acetyltransferase 2) . Individuals can be classified in fast, intermediate or slow acetylator phenotypes, which are determined by the haplotypic composition defined by genetic variations at the NAT2 locus. Evidence for the diet-related hypothesis provided by Luca et al.  was reinforced with the recent findings by the same people , based on a more comprehensive analysis of NAT2 worldwide genetic diversity, that were also compatible with a model holding that the slow acetylator phenotypes were selectively favored in populations relying in dietary regimens with reduced folate supply, whereas the fast acetylators were neutral or even advantageous in the presence of folate-rich diets, as those thought to be fulfilled by hunter-gatherers. To extent the population coverage of previous works, frequencies of NAT2 haplotypes and acetylator phenotypes were also estimated in this study (Additional file 3: Table S3). The distribution of haplotypes was very heterogeneous across African populations, but in line with previous observations the prevalence of the slow acetylator phenotype in the two hunters-gatherers groups (Khoisan, 1.6 %; Baka Pygmies, 13.5 %) was significantly much lower than in the three agriculturalists groups or in the Ugandan pastoralists, all displaying values up to 37.4 % (P = 0.0139, One-Way ANOVA). In the Portuguese the slow acetylator phenotype accounted for the high proportion of 52.2 %, which falls within the range typical from other European populations .
Next, we contrasted our data with other results before published for Eurasian and African populations (Additional file 2: Table S2), confining the analysis to c.590G > A, which defines allele NAT2*6, because it was the variation with more information accumulated for populations representatives of the three modes of subsistence.
In brief, our analyses reinforce previous indications that NAT2 has evolved under a selective factor influenced by human diet.
With regard to CYP3A5, we screened the intronic variation c.219-237G > A, commonly referred to CYP3A5*1/*3 polymorphism, in which the derived allele A results in a premature stop codon that reduces protein expression. It has been firmly demonstrated that the variation possesses a very unusual worldwide distribution whereby the frequency of CYP3A5*3 is significantly correlated with latitude .
CYP3A5*1/*3 likely influences salt and water retention and risk for salt-sensitive hypertension , exerting an effect on blood pressure that is determined by interactions with dietary salt intake [27,31]. Since anthropological evidence indicates that diet of hunting and gathering people is usually characterized by low level of salt intake, being often considered as a surrogate of the preagricultural humans’ diet, lately praised as a model of well balanced food consumption , we asked whether diversity at CYP3A5*1/*3 could be related with diet of populations.
These analyses led to conclude that CYP3A5 was the target of a selective factor determined by the geographic location of human populations.
AMOVA analysis under different criteria
c.32C > T (AGXT)
c.1074G > A (PLRP2)
c.1130A > G (MTRR)
c.590G > A (NAT2)
c.219-237G > A (CYP3A5)
Mode of subsistence
Main diet component
Above/below Tropic of Cancer
Geography was found to significantly account to explain the total genetic variance across Africa and Eurasia at AGXT, PLPR2, MTRR, and CYP3A5, but not at NAT2. The contribution of geography was especially high in CYP3A5 in which it amounted to a very high proportion, 40.4 % of total diversity. For this variation it was further assessed the effect of i) latitude and ii) the location North and South the Tropic of Cancer, leading to realize that for CYP3A5 the highest value of F CT (which measures the proportion of variance among groups) was achieved when populations North of the Tropic of Cancer were grouped against the southern ones, attaining then 44.9 % of total diversity.
Concerning mode of subsistence, it was found to be a considerable modulator of diversity at PLRP2, explaining 8.8 % of the total diversity at the locus, while also accounting to residual proportions of diversity at NAT2 (1.6 %) and AGXT (1.5 %). When the criterion to group populations was the content in diets of cereals (for PLPR2), meat (for AGXT), folates (for MTRR and NAT2) or salt (for CYP3A5), significant F CT values were only observed at PLRP2, in which the more or less reliance in cereals contributed to 6.5 % of the total variance, and at NAT2, where differences in the dietary richness in folates explained 3 % of the locus diversity.
Signals of selection
To dissect better whether from the levels of genetic differentiation across Africa and Eurasia signs of selection could be captured, we used a conventional F ST-based approach that assumes that genetic differentiation among populations is expectedly higher or lower for loci under directional or balanced selection, respectively, expected under neutrality.
Viewing that, we have firstly generated null sampling distribution of the empirical F ST employing two different models, the finite Island Model (IM), which assumes the classical island model at migration-drift equilibrium ; and the Hierarchical Island Model (HIM), in which populations samples are assigned to different groups, allowing for increased migration rates between populations within groups than between groups . Besides portraying more realistically the demographic history of human populations, HIM was shown to produce a low rate of false positive signs comparatively to IM, when used to test loci for selection .
Considering simultaneously Africa and Eurasia and using as reference the IM distribution, the F STs for MTRR and AGXT did not differed significantly from the null expectations (Fig. 4A). By contrast, the global differentiations at PLRP2, NAT2 and CYP3A5, all lied outside the 95 % confidence region of the neutral distribution, though showing departures with opposite directions: whereas the F ST coefficient for NAT2 was significantly smaller than expected, the coefficients for CYP3A5 and PLRP2 were both significantly larger (P-values in Fig. 4A). The outlier position is especially remarkable in the case of CYP3A5 that presented the exceedingly high F ST coefficient of 0.3813, almost five times greater compared to the average empirical neutral level of 0.079 between African and Eurasian populations. These results suggest that NAT2 could have been under balanced or negative selection whist both PLRP2 and CYP3A5 might well have been modeled by positive selection. Taken into account the F ST null distribution simulated under the HIM (Fig. 4C), the F STs for NAT2 and PLRP2 lost the condition of significant outliers and the unique differentiation that remained significantly higher than the neutral expectations was at CYP3A5. Simulations were also carried out considering separately Africa and Eurasia. While in Eurasia none of the five assessed SNPs revealed to be outsiders in the distributions simulated under the simple or the hierarchical island models (results not shown), noteworthy in Africa the differentiations at PLRP and CYP3A5 were significantly higher than expected under the neutral expectations derived from the two demographical models (Fig. 4B and D).
Linkage Disequilibrium including D’ and r 2 parameters
Gene (target SNP)
c.1242G > C
c.1382 T > C
c.1084G > A
c.341 T > C
Associated with slow acetylator due to N-acetyltransferase enzyme variant (acetylation slow phenotype)
c.803G > A
The analysis of patterns of human genetic diversity at wide geographical scales can disclose remarkable features difficultly explained by demographic events or pure neutral processes, that rather might represent the first symptoms of environmental adaptations.
In this study, we draw attention to variations in AGXT, PLRP2, MTRR, NAT2 and CYP3A5, five genes assumedly involved in the metabolism of substances (including xenobiotics) that gain entry into the organism through dietary food stuffs, for which it has been previously posited that they could represent instances of gene-culture coevolution in humans [13, 15, 20].
Out of those genes, PLRP2, NAT2 and CYP3A5 were found to present signs in their distribution patterns evoking the action of environmental selective pressures, though of diverse nature and strength.
The most unequivocal signature of selection was associated with CYP3A5 that displayed a level of inter-population differentiation dramatically surmounting even the most conservative neutral expectations. Contrarily to our starting hypothesis, however, the amount of salt presumed to be ingested across main dietary habits did not accounted for the distribution of CYP3A5, which instead was highly determined by the geographical location of populations in the North or in the South of the Tropic of Cancer. So, the analyses here undertaken fully support previous findings indicating that CYP3A5*3 evolved under a selective pressure determined by an environmental factor correlated with latitude , but also add accuracy to the interpretation pointing toward a factor shared by regions located above or below the Northern Tropic. CYP3A5 has been intensively explored in the context of the genetic factors contributing to hypertension susceptibility, known to vary widely across different human populations. Nearly 40 years ago Gleibermann  proposed the “sodium retention” hypothesis, according to which the high rate of hypertension in certain populations could partially be due to a genetic background that was environmental adaptive, presuming that efficient salt retaining mechanisms might had been advantageous in the hot savanna climate where humans first emerged. More recently, it was argued that hypertension susceptibility was ancestral in humans, and that differential susceptibility arose due to distinct selective pressures after the Out-of-Africa expansion of modern humans . CYP3A5 is being often quoted to address the evolutionary perspective of hypertension susceptibility, due to the demonstrated role of CYP3A5 enzymes in sodium homeostasis, even though the many studies that analyzed the relationship between CYP3A5 genotypes and blood pressure/hypertension have provided quite inconsistent results (reviewed in Lamba et al. ). So, together with the clarification of the link between CYP3A5 and blood pressure, future lines of research should pay more attention to the role of CYP3A5 enzymes in the physiological processes related with thermoregulation and/or with neutralization of effects of sunlight exposure. In the highly heat stressful intertropical region, there is a regular need to deal with the threat of dehydration, which may raise complicated physiological responses in wet or dry climates under which the efficient control of heat loss likely differs. Interestingly, the involvement of CYP3A5 in such responses seems to obtain support from the recent discovery of an osmosensitive transcriptional control of human CYP3A4, CYP3A7, and CYP3A5 that revealed increased mRNA expressions under ambient hypertonicity .
Concerning PLRP2, the explorations here undertaken led in essence to corroborate the findings of Hancock et al. , indicating that diversity at the locus is somehow connected with mode of subsistence in populations. In fact, the assessed truncated allele showed to be significantly more frequent in farmers comparatively to groups not relying in farming, with the general trend, inferred from the whole set of African and Eurasian populations, pointing to a clinal decrease in frequency from farmers, next pastoralists towards agriculturalists. In addition, the global differentiation at this variant fell outside the neutral expectations, except when the HIM model was used in the tests for selection in Africa plus Eurasia. Hancock et al.  have associated the worldwide distribution of PLRP2 to the content in cereals in diets of populations, on the grounds of the important role of the protein encoded by PLRP2 in plant-based diets once, unlike other pancreatic lipases, this enzyme hydrolyzes galactolipids, which are the main triglyceride component in plants . However, the recent demonstration that the truncated allele addressed in their (and our) study exhibits near absence of secretion makes it unlikely that the encoded product may contribute to plant lipid digestion in humans , which seemingly undermines the biological basis originally proposed. In the meanwhile, new insights arose on the physiological role of PLRP2, suggesting for instance a likely major influence in fat digestion in newborns . This refreshed information opens new perspectives that deserve future investigation to clarify whether cereal content or other PLRP2 substrate, or amount of substrate, differing in hunter-gatherers, herders and farmers is the factor that exerted the selective pressure contributing to shape the current pattern of PLRP2 diversity in human populations.
Before, however, it is necessary to overcome the uncertainty raised by the presence of significant LD between PLRP2 and PLRP1. The two genes encode lipases that show high sequence homology and that assumedly participate in dietary fat metabolism, although not being yet clarified their differences in substrate specificity. Consequently, for the moment is not possible to discriminate between which variations at PLRP2 or PLRP1 are the best candidates to be causative of the selective signals detected.
With respect to NAT2, in agreement with earlier studies [21, 22] we also detected that the average frequency of a slow acetylator allele was statistically lower in hunter-gatherers than in food-producer populations, when Eurasian and African populations were taken as a set. Furthermore, the same slow acetylator variant, which is the widespread NAT2*6 allele, revealed an unusual low level of geographical structure across Africa and Eurasia, indicating that it was subjected to drifting constrains that likely could arise under the action of a mode of selection resulting in such a homogeneous allele distribution. In the tests for selection involving NAT2, significant departure from the neutral expectation was captured when the simple IM demographic model was assumed, which can raise uncertainty on whether signs of more subtle selective pressures might mistakenly escape the stringency of the HIM. The adaptive evolution of NAT2 has been supported by a number of different studies [19-23, 41, 42], including the examination of NAT2 sequence data whose patterns of diversity made it plausible that slow-acetylating variants have been subject to weak selective pressures . However, it is yet to be clarified the nature of such pressures. Luca et al.  tentatively claimed that it could be related with the diminished availability of folates in diet brought with the shift from economies relying in hunting and gathering to those based on farming and herding of domesticates. In line with the hypothesis, very recently a significant correlation was reported between NAT2 acetylator phenotypes and historical dietary habitudes in India with the slow acetylator prevalence being higher in regions where is higher the proportion of vegetarians populations . A major problem with this folate-related model is the still non-demonstrated role of NAT2 human enzymes in folate metabolism . Endogenous substrates for human NAT2 are not known, although being well established that NAT2 catalyzes the acetylation of many xenobiotics . Since the exposure to xenobiotic substances or to concentration of xenobiotics likewise must had altered along the change in diets experienced by producers of food resources, the role of this kind of substances in shaping diversity at NAT2 deserves further attention.
In summary, we provided added evidence that diversity at PLRP2 and NAT2 harbor signatures of genetic adaptations that might have been triggered by the diversification of modes of subsistence and diets that human populations began to experience after the rise of the Neolithic. Before that, modern humans had started to leave their original homeland in Africa, traveling out of the continent to colonize all regions in the globe. They progressively reached a wide range of new environments, facing new selective pressures that may have contributed to shape human genetic diversity. In CYP3A5, the compelling correlation with regions North and South the Tropic of Cancer, makes it likely that it belongs to the yet not fully understood catalog of genetic adaptations triggered by environmental stresses. Furthermore, the genetic signatures that CYP3A5 harbors seem strength enough to have been driven by very long-lasting selection.
Finally, we were unable to confirm the hypotheses at stake that AGXT and MTRR could also be diet-selected genes, since their diversity patterns could be well reconciled with demographic history at least of African and Eurasian populations.
In this study, we found signs that PLRP2, NAT2 and CYP3A5, three genes assumedly involved in the metabolism of substances (including xenobiotics) that gain entry into the organism through dietary food stuffs, can represent instances of gene-culture coevolution in humans. Concerning PLRP2, it is still needed to clarify whether the signal detected is not a hitchhiking effect of its neighbor PLRP1. In addition, it is also necessary to demonstrate which were the biological mechanisms, and the environmental factors involved as well as their interactions, to understand the nature of the selective pressures that contributed to shape current patterns of genetic diversity at those loci. Furthermore, functional studies are needed to demonstrate the putative biological impact of the variations assessed, which ultimately would also exclude that the detected signs of selection could be due to other variations in linkage disequilibrium with those that were here examined.
The current study was approved by the Institute of Molecular Pathology and Immunology of the University of Porto institutional review board. All samples involved were anonymised DNA extracts previously obtained from healthy unrelated individuals. The samples were collected under written informed consent. The current study complies with the ethical principles of the 2000 Helsinki Declaration of the 206 World Medical Association (http://jama.jamanetwork.com).
Samples and DNA extraction
A total amount of 361 individuals from six sub-Saharan African populations with different modes of subsistence were analyzed, i) 144 were farmers/agriculturalists from three populations: 32 from Angola (ANG), 82 from Equatorial Guinea (EQG) and 30 from Mozambique (MOZ); ii) 116 were herders/pastoralists from the Karamoja region in Northern Uganda (UGN); and iii) 101 belonged to two hunter-gatherer/forager populations: 39 Baka Pygmies from Gabon (BPY) and 62 Ju/hoansi from Tsumkwe, a small settlement in North Eastern Namibia (KNA). Forty-eight individuals from the Portuguese population (PTG) were additionally studied. Each population was assigned to a specific mode of subsistence and main diet component according to Murdock Ethnographic Atlas (http://lucy.ukc.ac.uk/cgi-bin/uncgi/Ethnoatlas/atlas.vopts), Encyclopedia of World Cultures, Africa: An Encyclopedia for Students .
From blood samples stored in FTA™ cards (Whatman), total DNA was extracted using a standard phenol-chlorophorm protocol  in the sample from Equatorial Guinea, while in the remaining samples it was done as described in previous works: Angola , Mozambique , Uganda , Baka Pygmies , Khoisan (Marks et al., submitted) and Portugal.
The screened SNPs were c.32C > T (rs34116584; NM_000030.2 AGXT gene), c.1074G > A (rs4751995; NM_005396.3 PLRP2 gene), c.1130A > G (rs162036; NM_024010.2 MTRR gene), c.219-237G > A (rs776746; NM_000777.3 CYP3A5 gene) and the following 4 variants in NAT2 gene (NM_000015.2): c.191G > A (rs1801279), c.341 T > C (rs1801280), c.590G > A (rs1799930) and c.857G > A (rs1799931). The last 4 SNPS, defining the alleles NAT2*14, NAT2*5, NAT2*6 and NAT2*7, respectively, were selected in this study because reportedly they allow inferring the acetylator phenotypes with high accuracy .
A pair of primers was designed for each SNP using Primer3 software ver. 4.0. Possible secondary structures or interactions between primers were checked with AutoDimer software ver. 1.0  (Additional file 11: Table S4). A multiplex Polymerase Chain Reaction (PCR) system was developed to co-amplify the regions containing the eight SNPs. The genotyping methodology was based in a multiplex minisequencing reaction, using the Single Base Extension (SBE)  reaction kit (Applied Biosystems). SBE primers were designed and tested identically as described above. Poly tails of varying lengths were attached to the 5′ end of each primer in order to avoid identical fragment sizes, allowing simultaneous typing of multiple variants in the same reaction (Additional file 12: Table S5). SBE products were run on ABI 3130 Genetic Analyser (AB Applied Biosystems) and the electropherograms were analyzed using GeneMapper software ver. 4.0. (Applied Biosystems, Foster City, USA), based on fragment size inferred with GeneScan-120 size standard.
Chromosomal locations and genomic segments from the 5 genes were obtained using the latest version of the human genome assembly GRCh37 (http://www.ensembl.org/).
The Arlequin software ver. 3.5.1  was used to estimate allele and haplotype frequencies, to test for Hardy-Weinberg Equilibrium (HWE) and to calculate genetic distances (F ST). Regarding the NAT2 gene, to account for linkage disequilibrium (LD) between SNPs, from the unphased multilocus genotypic data, haplotypic frequencies defined by the 4 SNPs were computationally estimated also with Arlequin software ver. 3.5.1.  and the acetylation phenotypes deduced from the pair of haplotypes carried by each subject .
In order to determine whether means of allele frequencies were statistically different between groups of populations defined according to various criteria, One-Way ANOVA tests were performed. The Spearman Rank Correlation Score Corrected (SRCSC) was used to evaluate the correlation between allele frequencies and latitude. Both statistical analyses were performed on the website for statistical computation VassarStats (http://vassarstats.net).
The graphical representation of a F ST distance matrix was constructed by means of the Multidimensional Scaling (MDS) procedure implemented in StatSoft, Inc. (2007), Statistica version 8.0 (www.statsoft.com).
To investigate possible signals of selection we obtained the neutral distribution of F STs conditional on heterozygozity based on genotypic data from a validated panel of 52 assumedly neutral SNPs for human identification , using the 61 African and Eurasian populations contained in the SNPforID browser, to draw a neutral dispersion cloud 50,000 coalescent simulations of 100 demes were carried on Arlequin software. For variants at AGXT, PLRP2, MTRR, NAT2 and CYP3A5, F ST, average heterozygosity within populations (Het) and the scaled heterozygosity Het/(1-F ST) were also computed. The null sampling distribution of the empirical F ST values was calculated using two distinct models, both implemented in Arlequin software ver. 3.5.1 : i) the classical Island Model at migration-drift equilibrium (IM), conventionally referred to as Fdist approach, proposed by , considering together all populations; and ii) the Hierarchical Island Model (HIM) more recently recommended by Excoffier and Hofer , for which populations were clustered in 5 groups (Europe, Africa, Middle East, South and East Asia) when a worldwide scale was assumed, or in 4 groups (Northern, Southern, Western and Eastern Africa) when only Africa was considered. Since F ST strongly correlates with heterozygosity , the empirical P-value for each locus was calculated within bins of 2,0000 SNPs grouped according to Minimum Allele Frequency (MAF), as the proportion of the bivariate probability distribution which was less probable than the estimated values, in the same way as calculated in the DFdist software package .
Hierarchical Analysis of Molecular Variance (AMOVA) was performed in the Arlequin software ver. 3.5.1. defining population groups according to: i) mode of subsistence, ii) main diet component, and iii) geography in the context of Eurasia and Africa. Additionally for CYP3A5, populations groups were also defined according to latitude and location North or South the Tropic of Cancer.
Finally for LD patterns analyses, the .ped files containing the data sets were first manipulated with gPLINK software (http://pngu.mgh.harvard.edu/purcell/plink/)  and then exported to Haploview software ver. 4.1  to calculate D’ (normalized D, where D’ = D/Dmax measures the LD strength) and r 2 (squared correlation coefficient measure of LD between the two loci) parameters and visualize the LD plots, considering a window of 200, 300 and 500 Kb for PLRP2, CYP3A5 and NAT2 genes, respectively.
Viewing comparative analyses, data for other populations were retrieved from the database dbCLINE (http://genapps.uchicago.edu/software.html) from the Di Rienzo laboratory, which includes information from the International HaPMap Project Phase III (http://hapmap.ncbi.nlm.nih.gov/), the Human Genome Diversity Project Panel-Centre d’Etude du Polymorphisme Humain (HGDP-CEPH); and also we included data already published in other works [56-59]. The populations’ abbreviations used in Figs. 1 and 2 are MAN (Mandenka), NBT (North Bantu), MOB (Mozabite), SBT (South Bantu), LUH (Luhya), TIG (Tigray), AMH1 (Amhara1), AFA (Afar), YOR (Yoruba), AMH2 (Amhara2), ARC (Ari Cultivator), WOL (Wolayta), ANU (Anuak), ARB (Ari Blacksmith), JPT (Japanese), NAX (Naxi), LAH (Lahu), MIA (Miaozu), TUJ (Tujia), SHE (She), YIZ (Yizu), CAM (Cambodian), HAN (Han), DAU (Daur), DAI (Dai), BER (Bergamo), RUS (Russian), TSC1 (Tuscan1), SAR (Sardinian), FRC (French), BAS (Basque), TUSC2 (Tuscan2), ORC (Orcadian), ADY (Adygei), DRU (Druze), PAL (Palestinian) KAL (Kalash), PAT (Pathan), GUJ (Gujarati), BUR (Burusho), SIN (Sindhi), XIB (Xibo), GUM (Gumuz), SOM (Somali), MAS (Maasai), MON (Mongola), YAK (Yakut), BAL (Balochi), MAK (Makrani), TU (Tu), UYG (Uygur), BED (Bedouin), BRA (Brahui), HAZ (Hazara), SAN (San), BIP (Biaka Pygmies), KUV (Kung Vasekela), MBP (Mbuti Pygmies) KHO1 (Khomani2), SAD (Sadawe), HAD (Hadza), HEZ (Hezhen), NYU (Naukan Yup’ik), ORO (Orogen) and MCH (Maritime Chukchee). More detailed information is summarized in Additional file 2: Table S2.
We thank David Comas (Universitat Pompeu Fabra, Barcelona), Jean-Marie Hombert and Lolke van der Veen (Dynamique du Langage, Institut des Sciences De l’Homme, Lyon France) as well as Patrick Mouguiama Daouda (University Omar Bongo, Libreville, Gabon) and the Centre International des Recherches Médicales de Franceville (CIRMF, Gabon) for sharing the Baka Pygmies data from Gabon. We also would like to thank Renaud Vitallis from Centre de Biologie pour la Gestion des Populations, Université Montpellier for sharing his modified version for codominant data of the Dfdist software. Finally we are really grateful to all volunteers who contributed with their DNA to this study.
- Luca F, Perry GH, Di Rienzo A. Evolutionary adaptations to dietary changes. Annu Rev Nutr. 2010;30:291-314.View ArticlePubMed CentralPubMedGoogle Scholar
- Patin E, Quintana-Murci L. Demeter’s legacy: rapid changes to our genome imposed by diet. Trends Ecol Evol. 2008;23(2):56-9.View ArticlePubMedGoogle Scholar
- Lachance J, Vernot B, Elbers CC, Ferwerda B, Froment A, Bodo J-M, et al. Evolutionary history and adaptation from high-coverage whole-genome sequences of diverse African hunter-gatherers. Cell. 2012;150(3):457-69.View ArticlePubMed CentralPubMedGoogle Scholar
- Balaresque PL, Ballereau SJ, Jobling MA. Challenges in human genetic diversity: demographic history and adaptation. Hum Mol Genet. 2007;16 Spec No. 2:R134-9.View ArticlePubMedGoogle Scholar
- Bersaglieri T, Sabeti PC, Patterson N, Vanderploeg T, Schaffner SF, Drake JA, et al. Genetic signatures of strong recent positive selection at the lactase gene. Am J Hum Genet. 2004;74(6):1111-20.View ArticlePubMed CentralPubMedGoogle Scholar
- Jones BL, Raga TO, Liebert A, Zmarz P, Bekele E, Danielsen ET, et al. Diversity of lactase persistence alleles in Ethiopia: signature of a soft selective sweep. Am J Hum Genet. 2013;93(3):538-44.View ArticlePubMed CentralPubMedGoogle Scholar
- Liu X, Ong RT-H, Pillai EN, Elzein AM, Small KS, Clark TG, et al. Detecting and characterizing genomic signatures of positive selection in global populations. Am J Hum Genet. 2013;92(6):866-81.View ArticlePubMed CentralPubMedGoogle Scholar
- Vitalis R, Gautier M, Dawson KJ, Beaumont MA. Detecting and measuring selection from gene frequency data. Genetics. 2014;196(3):799-817.View ArticlePubMed CentralPubMedGoogle Scholar
- Tishkoff SA, Reed FA, Ranciaro A, Voight BF, Babbitt CC, Silverman JS, et al. Convergent adaptation of human lactase persistence in Africa and Europe. Nat Genet. 2007;39(1):31-40.View ArticlePubMed CentralPubMedGoogle Scholar
- Perry GH, Dominy NJ, Claw KG, Lee AS, Fiegler H, Redon R, et al. Diet and the evolution of human amylase gene copy number variation. Nat Genet. 2007;39(10):1256-60.View ArticlePubMed CentralPubMedGoogle Scholar
- Caldwell EF, Mayor LR, Thomas MG, Danpure CJ. Diet and the frequency of the alanine:glyoxylate aminotransferase Pro11Leu polymorphism in different human populations. Hum Genet. 2004;115(6):504-9.View ArticlePubMedGoogle Scholar
- Danpure CJ. Variable peroxisomal and mitochondrial targeting of alanine: glyoxylate aminotransferase in mammalian evolution and disease. Bioessays. 1997;19(4):317-26.View ArticlePubMedGoogle Scholar
- Segurel L, Lafosse S, Heyer E, Vitalis R. Frequency of the AGT Pro11Leu polymorphism in humans: does diet matter? Ann Hum Genet. 2010;74(1):57-64.View ArticlePubMedGoogle Scholar
- De Caro J, Eydoux C, Cherif S, Lebrun R, Gargouri Y, Carriere F, et al. Occurrence of pancreatic lipase-related protein-2 in various species and its relationship with herbivore diet. Comp Biochem Physiol B Biochem Mol Biol. 2008;150(1):1-9.View ArticlePubMedGoogle Scholar
- Hancock AM, Witonsky DB, Ehler E, Alkorta-Aranburu G, Beall C, Gebremedhin A, et al. Colloquium paper: human adaptations to diet, subsistence, and ecoregion are due to subtle shifts in allele frequency. Proc Natl Acad Sci U S A. 2010;107 Suppl 2:8924-30.View ArticlePubMed CentralPubMedGoogle Scholar
- Lowe ME. The triglyceride lipases of the pancreas. J Lipid Res. 2002;43(12):2007-16.View ArticlePubMedGoogle Scholar
- Sias B. Human pancreatic lipase-related protein 2 is a galactolipase. Biochemistry. 2004;43(31):10138-48.View ArticlePubMedGoogle Scholar
- Shaw GM, Lu W, Zhu H, Yang W, Briggs FB, Carmichael SL, et al. 118 SNPs of folate-related genes and risks of spina bifida and conotruncal heart defects. BMC Med Genet. 2009;10:49.View ArticlePubMed CentralPubMedGoogle Scholar
- Magalon H, Patin E, Austerlitz F, Hegay T, Aldashev A, Quintana-Murci L, et al. Population genetic diversity of the NAT2 gene supports a role of acetylation in human adaptation to farming in Central Asia. Eur J Hum Genet. 2008;16(2):243-51.View ArticlePubMedGoogle Scholar
- Sabbagh A, Langaney A, Darlu P, Gerard N, Krishnamoorthy R, Poloni ES. Worldwide distribution of NAT2 diversity: implications for NAT2 evolutionary history. BMC Genet. 2008;9:21.View ArticlePubMed CentralPubMedGoogle Scholar
- Sabbagh A, Darlu P, Crouau-Roy B, Poloni ES. Arylamine N-acetyltransferase 2 (NAT2) genetic diversity and traditional subsistence: a worldwide population survey. PLoS One. 2011;6(4):e18507.View ArticlePubMed CentralPubMedGoogle Scholar
- Luca F, Bubba G, Basile M, Brdicka R, Michalodimitrakis E, Richards O, Vershubsky G, Quintana-Murci L, Kozlov AI, Novelleto A: Multiple Advantageous Amino Acid Variants in the NAT2 Gene in Human Populations. PloS one 2008, 3(9).Google Scholar
- Patin E, Barreiro LB, Sabeti PC, Austerlitz F, Luca F, Sajantila A, Behar DM, Semino O, Sakuntabhai A, Guiso N et al: Deciphering the Ancient and Complex Evolutionary History of Human Arylamine N-Acetyltransferase Genes. American Journal of Human Genetics 2006, 78.Google Scholar
- Thompson EE, Kuttab-Boulos H, Witonsky D, Yang L, Roe BA, Di Rienzo A. CYP3A variation and the evolution of salt-sensitivity variants. Am J Hum Genet. 2004;75(6):1059-69.View ArticlePubMed CentralPubMedGoogle Scholar
- Thompson EE, Kuttab-Boulos H, Yang L, Roe BA, Di Rienzo A. Sequence diversity and haplotype structure at the human CYP3A cluster. Pharmacogenomics J. 2006;6(2):105-14.View ArticlePubMedGoogle Scholar
- Young JH, Chang YP, Kim JD, Chretien JP, Klag MJ, Levine MA, et al. Differential susceptibility to hypertension is due to selection during the out-of-Africa expansion. PLoS Genet. 2005;1(6):e82.View ArticlePubMed CentralPubMedGoogle Scholar
- Zhang L, Miyaki K, Wang W, Muramatsu M. CYP3A5 polymorphism and sensitivity of blood pressure to dietary salt in Japanese men. J Hum Hypertens. 2010;24(5):345-50.View ArticlePubMedGoogle Scholar
- Danpure CJ. Primary hyperoxaluria type 1: AGT mistargeting highlights the fundamental differences between the peroxisomal and mitochondrial protein import pathways. Biochim Biophys Acta. 2006;1763(12):1776-84.View ArticlePubMedGoogle Scholar
- Cordain L, Eaton SB, Miller JB, Mann N, Hill K. The paradoxical nature of hunter-gatherer diets: meat-based, yet non-atherogenic. Eur J Clin Nutr. 2002;56 Suppl 1:S42-52.View ArticlePubMedGoogle Scholar
- Patillon B, Luisi P, Poloni ES, Boukouvala S, Darlu P, Genin E, et al. A homogenizing process of selection Has maintained an “ultra-slow” acetylation NAT2 variant in humans. Hum Biol. 2014;86(3):185-214.View ArticlePubMedGoogle Scholar
- Eap CB, Bochud M, Elston RC, Bovet P, Maillard MP, Nussberger J, et al. CYP3A5 and ABCB1 genes influence blood pressure and response to treatment, and their effect is modified by salt. Hypertension. 2007;49(5):1007-14.View ArticlePubMedGoogle Scholar
- Eaton SB, Eaton 3rd SB. Paleolithic vs. modern diets-selected pathophysiological implications. Eur J Nutr. 2000;39(2):67-70.View ArticlePubMedGoogle Scholar
- Beaumont MA, Nichols RA. Evaluating loci for use in the genetic analysis of population structure. P Roy Soc B-Biol Sci. 1996;263(1377):1619-26.View ArticleGoogle Scholar
- Excoffier LH, T.; Foll, M.: Detecting loci under selection in a hierarchically structured population. Heredity 2009, 103(4).Google Scholar
- Excoffier L, Hofer T, Foll M. Detecting loci under selection in a hierarchically structured population. Heredity (Edinb). 2009;103(4):285-98.View ArticleGoogle Scholar
- Gleibermann L. Blood pressure and dietary salt in human populations. Ecol Food Nutr. 1973;2(2):143-56.View ArticleGoogle Scholar
- Lamba JK, Lin YS, Schuetz EG, Thummel KE: Genetic contribution to variable human CYP3A-mediated metabolism. Advanced drug delivery reviews 2012.Google Scholar
- Kosuge K, Chuang AI, Uematsu S, Tan KP, Ohashi K, Ko BC, Ito S: Discovery of Osmo-sensitive Transcriptional Regulation of Human Cytochrome P450 3As (CYP3As) by the Tonicity-Responsive Enhancer Binding Protein (TonEBP/ NFAT5). Molecular Pharmacology 2007.Google Scholar
- Xiao X, Mukherjee A, Ross LE, Lowe ME. Pancreatic lipase-related protein-2 (PLRP2) Can contribute to dietary Fat digestion in human newborns. J Biol Chem. 2011;286(30):26353-63.View ArticlePubMed CentralPubMedGoogle Scholar
- Berton A, Sebban-Kreuzer C, Crenon I. Role of the structural domains in the functional properties of pancreatic lipase-related protein 2. FEBS J. 2007;274(22):6011-23.View ArticlePubMedGoogle Scholar
- Hein DW, Doll MA. Accuracy of various human NAT2 SNP genotyping panels to infer rapid, intermediate and slow acetylator phenotypes. Pharmacogenomics. 2012;13(1):31-41.View ArticlePubMed CentralPubMedGoogle Scholar
- Talbot J, Magno LA, Santana CV, Sousa SM, Melo PR, Correa RX, et al. Interethnic diversity of NAT2 polymorphisms in Brazilian admixed populations. BMC Genet. 2010;11:87.View ArticlePubMed CentralPubMedGoogle Scholar
- Khan N, Pande V, Das A. NAT2 sequence polymorphisms and acetylation profiles in Indians. Pharmacogenomics. 2013;14(3):289-303.View ArticlePubMedGoogle Scholar
- Sim E, Lack N, Wang CJ, Long H, Westwood I, Fullam E, et al. Arylamine N-acetyltransferases: structural and functional implications of polymorphisms. Toxicology. 2008;254(3):170-83.View ArticlePubMedGoogle Scholar
- John Haley CR, Rebecca Stefoff, Joseph Ziegler: Africa: An Encyclopedia for Students. In: Africa: An Encyclopedia for Students. Edited by Middleton J: Charles Scribner’s Sons; 2002.Google Scholar
- Sambrook J. Molecular cloning : a laboratory manual, vol. 3. 2nd ed. NY: Cold Spring Harbor, N.Y: Cold Spring Harbor Laboratory Press; 1989.Google Scholar
- Beleza S, Alves C, Reis F, Amorim A, Carracedo A, Gusmao L. 17 STR data (AmpF/STR identifiler and powerplex 16 system) from Cabinda (Angola). Forensic Sci Int. 2004;141(2-3):193-6.View ArticlePubMedGoogle Scholar
- Alves C, Gusmao L, Amorim A. STR data (AmpFlSTR profiler plus and GenePrint CTTv) from Mozambique. Forensic Sci Int. 2001;119(1):131-3.View ArticlePubMedGoogle Scholar
- Gomes V, Sanchez-Diz P, Alves C, Gomes I, Amorim A, Carracedo A, et al. Population data defined by 15 autosomal STR loci in Karamoja population (Uganda) using AmpF/STR Identifiler kit. Forensic Sci Int Genet. 2009;3(2):e55-8.View ArticlePubMedGoogle Scholar
- Batini C, Lopes J, Behar DM, Calafell F, Jorde LB, van der Veen L, et al. Insights into the demographic history of African Pygmies from complete mitochondrial genomes. Mol Biol Evol. 2011;28(2):1099-110.View ArticlePubMedGoogle Scholar
- Vallone PM, Butler JM. AutoDimer: a screening tool for primer-dimer and hairpin structures. Biotechniques. 2004;37(2):226-31.PubMedGoogle Scholar
- Excoffier L, Lischer HE. Arlequin suite ver 3.5: a new series of programs to perform population genetics analyses under Linux and Windows. Mol Ecol Resour. 2010;10(3):564-7.View ArticlePubMedGoogle Scholar
- Sanchez JJ, Phillips C, Børsting C, Balogh K, Bogus M, Fondevila M, et al. A multiplex assay with 52 single nucleotide polymorphisms for human identification. Electrophor. 2006;27(9):1713-24.View ArticleGoogle Scholar
- Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MA, Bender D, et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet. 2007;81(3):559-75.View ArticlePubMed CentralPubMedGoogle Scholar
- Barrett JC, Fry B, Maller J, Daly MJ. Haploview: analysis and visualization of LD and haplotype maps. Bioinformatics. 2005;21(2):263-5.View ArticlePubMedGoogle Scholar
- Bryc K, Auton A, Nelson MR, Oksenberg JR, Hauser SL, Williams S, et al. Genome-wide patterns of population structure and admixture in West Africans and African Americans. Proc Natl Acad Sci U S A. 2010;107(2):786-91.View ArticlePubMed CentralPubMedGoogle Scholar
- Henn BM, Gignoux CR, Jobin M, Granka JM, Macpherson JM, Kidd JM, et al. Hunter-gatherer genomic diversity suggests a southern African origin for modern humans. Proc Natl Acad Sci U S A. 2011;108(13):5154-62.View ArticlePubMed CentralPubMedGoogle Scholar
- Pagani L, Kivisild T, Tarekegn A, Ekong R, Plaster C, Gallego Romero I, et al. Ethiopian genetic diversity reveals linguistic stratification and complex influences on the Ethiopian gene pool. Am J Hum Genet. 2012;91(1):83-96.View ArticlePubMed CentralPubMedGoogle Scholar
- Schlebusch CM, Skoglund P, Sjodin P, Gattepaille LM, Hernandez D, Jay F, et al. Genomic variation in seven Khoe-San groups reveals adaptation and complex African history. Science. 2012;338(6105):374-9.View ArticlePubMedGoogle Scholar
This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.