- Research article
- Open Access
Haplotype frequencies at the DRD2 locus in populations of the East European Plain
BMC Genetics volume 10, Article number: 62 (2009)
It was demonstrated previously that the three-locus RFLP haplotype, TaqI B-TaqI D-TaqI A (B-D-A), at the DRD2 locus constitutes a powerful genetic marker and probably reflects the most ancient dispersal of anatomically modern humans.
We investigated TaqI B, BclI, MboI, TaqI D, and TaqI A RFLPs in 17 contemporary populations of the East European Plain and Siberia. Most of these populations belong to the Indo-European or Uralic language families. We identified three common haplotypes, which occurred in more than 90% of chromosomes investigated. The frequencies of the haplotypes differed according to linguistic and geographical affiliation.
Populations in the northwestern (Byelorussians from Mjadel'), northern (Russians from Mezen' and Oshevensk), and eastern (Russians from Puchezh) parts of the East European Plain had relatively high frequencies of haplotype B2-D2-A2, which may reflect admixture with Uralic-speaking populations that inhabited all of these regions in the Early Middle Ages.
The DRD2 gene is located on chromosome 11 and encodes the neuronal dopamine receptor D2, which plays a role in movement, emotional memory, and appetitive behavior . The DRD2 locus was an object of numerous genetic association studies [2–5], and the most extensively studied polymorphism is a TaqI A RFLP (rs1800497; in the vicinity of the DRD2 gene), which has been associated with the pathology of psychoses (schizophrenia and manic-depressive disorder), Parkinson's disease, and various substance abuse syndromes. It has been proposed that TaqI A might be in linkage disequilibrium with some unidentified polymorphisms within the exons or regulatory regions of the DRD2 gene, but recently it has been mapped to the last exon of the ANKK1 (ankyrin repeat and kinase domain containing 1) gene, and it results in a Glu-Lys substitution . Other frequently studied RFLPs, for example, TaqI B and D (rs1079597 and rs1800498, respectively) are located in the introns of the DRD2 gene and, most probably, have no functional significance.
TaqI B, TaqI D, and TaqI A polymorphisms have also been studied on a worldwide scale [[7–11]; the ALFRED database, http://alfred.med.yale.edu/alfred/index.asp], and centers of dispersal, which probably reflect the most ancient dispersal of anatomically modern humans, have been proposed for their three-locus haplotypes . It has been shown that the B2, D2, and A1 alleles are ancestral alleles common to other hominoids [12–14]. Kidd et al.  proposed the following evolutionary sequence for the most common haplotypes: evolution of B2-D2-A1 to B2-D2-A2 and B1-D2-A1 and evolution of B2-D2-A2 to B2-D1-A2. The other less frequent haplotypes probably arose by recombination. Three-locus haplotypes exhibit pronounced geographical differentiation. With the exception of some tribal populations of India , the ancestral haplotype B2-D2-A1 is mainly confined to African groups. The singly derived haplotype B1-D2-A1 is most widespread among people of East Asian descent, including Native Americans. Haplotype B2-D2-A2 is present among all populations but is least prevalent in Western European and American populations. The doubly derived haplotype B2-D1-A2 is common in Europe and rare in East Asia. The other haplotypes are extremely rare and have sporadic distribution.
Here we provide data on the variability of DRD2 haplotypes in a previously uninvestigated region, the East European Plain. We investigated 14 contemporary populations of the East European Plain and Siberia that belong to the Indo-European and Uralic language families. In addition, two populations of the Altaic language family (Yakuts and Kalmyks) and a population of the North Caucasian language family (Adygeis) were included as reference groups. We also performed an updated global analysis of B-D-A haplotype frequencies using our data and data from the ALFRED database.
We studied five RFLP loci, TaqI B, BclI, MboI, TaqI D, and TaqI A. The locations of these polymorphisms are shown on the gene map (Figure 1), and allele frequencies in the populations studied are presented in Table 1. Allele 1 (the restriction site was absent) at the TaqI B locus was always present with allele 1 at the BclI locus; allele 2 (the restriction site was present) at the former locus was always found with allele 2 at the latter locus. The same tight linkage was observed for the MboI and TaqI D loci, and there were no exceptions among the 3198 chromosomes studied. In most populations, all loci exhibited Hardy-Weinberg equilibrium (assessed by the exact test using a Markov chain, P > 0.05). Significant deviations from Hardy-Weinberg equilibrium were observed only for the MboI and TaqI D loci in Khants (P = 0.044) and the TaqI A locus in Yakuts (P = 0.023).
Alleles 1 at the TaqI B, BclI, and TaqI A loci were more frequent in the populations of Russians 1 and 5, Veps, Komi 2, Khants, Nenets, Yakuts, and especially Kalmyks, than in the other populations (Table 1). Allele 1 at the MboI locus and allele 2 at the TaqI D locus were the most frequent alleles in Asian populations, i.e. Khants, Nenets, Yakuts, and Kalmyks. It is notable that combinations of allele 2 at the TaqI B locus with allele 1 at the BclI locus and allele 2 at the MboI locus with allele 2 at the TaqI D locus, found in the sequenced chimpanzee genome , were absent in human populations (Table 1).
Pairwise linkage disequilibrium (D') was strong in most cases (Table 2). However, disequilibrium values were lower for the MboI-TaqI A and TaqI D-TaqI A pairs in some populations, which is similar to the findings of Kidd et al. . In Yakuts, disequilibrium was very low, except for the TaqI B-TaqI A and BclI-TaqI A pairs. Only two of the SNPs studied have been involved in the international HapMap project http://www.hapmap.org: TaqI A (rs1800497) and TaqI B (rs1079597). Linkage disequilibrium between these SNPs is high and significant in all 11 populations included in the HapMap project phase 3. The MboI and TaqI D loci are located between SNPs rs2587548 and rs2734836 investigated in the HapMap project. Linkage disequilibrium between SNP rs2587548 and the TaqI A locus is low in the Utah population of European ancestry and high but nonsignificant in the Chinese and Japanese populations (HapMap project phase 2), which is in agreement with our results.
Taking into consideration perfect linkage in the TaqI B-BclI and MboI-TaqI D pairs, the BclI and MboI RFLPs were redundant for inter-population comparison, and we focused on the distribution of three-locus haplotypes TaqI B-TaqI D-TaqI A. Their frequencies are shown in Table 3. Only three haplotypes were common in the populations studied: B2-D1-A2, B2-D2-A2, and B1-D2-A1 (haplotypes termed according to ). The other haplotypes were extremely rare. Haplotype frequencies were used for calculation of FST-based genetic distances. The resulting distance matrix was visualized using multidimensional scaling (MDS) (Figure 2). Two large population groups can be distinguished: Asian (Khants, Nenets, Kalmyks, and Yakuts) and European (the other populations). According to UPGA cluster analysis, the European and Asian groups might be further subdivided into subclusters European-1 and European-2, Khants-Nenets group, Kalmyks, and Yakuts.
In the Asian cluster genetic distances were significant between Yakuts and Kalmyks (P < 0.00001), Khants and Yakuts (P < 0.00001), Khants and Kalmyks (P < 0.04661), Nenets and Yakuts (P < 0.00099), but not between Nenets and Kalmyks (P < 0.07438). The distance between Khants and Nenets was small and non-significant (P < 0.52033).
All populations of the European cluster had significant genetic distances from the populations of the Asian cluster. The European-1 and European-2 clusters were quite homogeneous; most distances were non-significant within the former, and all were non-significant within the latter. Only Russians 1 occupied distinct position within the European-1 cluster. In contrast, most pairwise genetic distances between the two subclusters were significant. Exceptions included Veps, which had no significant distances from the populations of the European-2 cluster, and Byelorussians 2, which, had a small number of significant distances from the populations of the European-1 cluster.
The matrix of haplotype-based genetic distances was also compared with matrices of great circle geographical distances (data not shown) to assess the possible effect of isolation by distance. The matrices including all 17 populations were highly correlated according to the Mantel test (r = 0.749, P-value = 0.0001). However, correlation of geographical and genetic distances was not observed among the populations of the European cluster, i.e., after exclusion of Khants, Nenets, Kalmyks (which have migrated from East Asia in historical times), and Yakuts (r = 0.121, P-value = 0.2999). The use of more realistic distance measurements around geographical barriers, such as the Azov Sea, the Caucasus, and some parts of the Ural Mountains had little effect on results of the test (the correlation coefficients for 17 and 13 populations were 0.754 and 0.117, respectively; P-values were 0.0001 and 0.3109, respectively). Thus, isolation by distance is not a likely cause of the genetic variation observed in the East European Plain.
We also compared our results with those of previous authors to examine population relationships in greater detail. To do this, we analyzed haplotype frequencies in the populations studied and in 38 populations from various continents (data from [7, 9], and the ALFRED database) using MDS of FST-based genetic distances (Figure 3 and Additional file 1). A good fit between the two-dimensional plot and the source data was obtained, demonstrated by the low stress value (0.062). Cluster analysis by the UPGA algorithm was performed using the same distance matrix and enabled us to define four large population clusters with two subclusters each, i.e., eight population groups in total. All clustering methods that we used (complete linkage, unweighted and weighted pair-group average, unweighted and weighted pair-group centroid, and Ward's method) revealed identical clusters at the level of eight groups. However at 'higher' and 'lower' clustering levels, some methods gave different results. To identify haplotypes responsible for the observed intercluster differences, haplotype frequencies for various clusters were compared using the Kolmogorov-Smirnov test (Table 4).
The following populations constituted the European-1 subcluster: Indo-European-speaking Irish, Danes, Byelorussians 1 and 3, Russians 1, 2, 3, and 7; North-Caucasian-speaking Adygeis 1 and 2; Jews 3 (Ashkenazi); Uralic-speaking Veps, Komi 1 and 2. The populations of this cluster had the following haplotype frequencies: 0.50-0.65 for the "European" haplotype B2-D1-A2, 0.21-0.31 for the "worldwide" haplotype B2-D2-A2, 0.08-0.25 for the "East Asian" haplotype B1-D2-A1. Some populations of the East European Plain (Indo-European-speaking Byelorussians 2, and Russians 4, 5, and 6; Uralic-speaking Finns and Komi 3; Altaic-speaking Chuvash) fell into the European-2 subcluster, which is characterized by the following haplotype frequencies: B2-D1-A2, 0.37-0.52; B2-D2-A2, 0.27-0.45; and B1-D2-A1, 0.04-0.25. Two European subclusters can be differentiated on the basis of their B2-D2-A2 and B2-D1-A2 frequencies (P-values according to the Kolmogorov-Smirnov test < 0.0001 and 0.001, respectively). This is also evident from Figure 3: on the MDS plot, both European subclusters occupy almost identical positions in the B1-D2-A1 frequency gradient (Figure 3, upward arrow) but have different positions in the B2-D1-A2 and B2-D2-A2 frequency gradients (Figure 3, arrows).
A low but highly significant level of population differentiation was observed between the European-1 and -2 subclusters (FCT 0.018, P < 0.00001, Table 5); 59% of genetic distances between populations of the two subclusters were significant, but only 14% and 22% of the intracluster distances were significant. Other close pairs of subclusters such as Intermediate-1 and -2 demonstrated higher levels of differentiation (FCT 0.027-0.045, Table 5). About 60% of intercluster genetic distances were significant in all instances.
Ugric-speaking Khants 1 and 2 and Samoyedic-speaking Nenets were distant from the other Uralic populations (Finns, Veps, Komi 1, 2, and 3) and grouped into the Intermediate-1 subcluster with Sino-Tibetan-speaking Riang and Tripperah from India, with Melanesians and Micronesians (the results for Melanesians and Micronesians should be interpreted with caution because of a low sample size; see Additional file 1). In Khants and Nenets, the frequencies of the "European" haplotype B2-D1-A2 (0.22-0.25) and the "East Asian" haplotype B1-D2-A1 (0.23-0.26) were intermediate between those of the European-1 and Asian-1 subclusters (Figure 3). It is interesting that the frequencies of the "worldwide" haplotype B2-D2-A2 in Khants (0.48-0.49) and Nenets (0.48) fell within the range typical of the Asian-1 subcluster, 0.44-0.53 (P-value = 0.9441 according to the Kolmogorov-Smirnov test).
The Asian-1 subcluster was formed by populations of East Asian descent (Altaic-speaking Kalmyks; Koreans and Japanese; Sino-Tibetan-speaking Hakka, Han 1 and 2; and Austro-Asiatic-speaking Cambodians). They had the following frequency pattern: B2-D1-A2, 0.04-0.14; B2-D2-A2, 0.44-0.53; and B1-D2-A1, 0.34-0.46.
The global FST value, 0.11413 (Table 5), falls within the range typical of autosomal markers (0.09-0.14, ). This value was estimated using haplotype frequencies without taking into account the extent of molecular differences between haplotypes. Calculation of FST based on numbers of pairwise differences between haplotypes gave a nearly identical value, 0.11383. Thus, the differentiation of populations may be explained by drift only, without any significant influence of mutation.
The East European (Russian) Plain is a region in which peoples of the Indo-European and Uralic language families have come into contact over an extended period. Uralic-speaking peoples have the longest validated archaeological record in this region . The most recent large-scale migration to this region involved the movement of Slavs (the Indo-European language family) to the east and northeast of their presumed homeland in Central Europe about 500 AD [18, 19]. Slavs were not the first Indo-European-speaking people who arrived in the Russian Plain: in the first millennium BC, Baltic-speaking tribes occupied a large part of the East European Plain . They were later displaced by Slavic tribes. According to the widely accepted hybridization theory of the origin of Eastern Slavs , Slavic populations arriving in the East European Plain were mixed with indigenous Uralic- and, probably, Baltic-speaking people.
In our study, all populations of the East European Plain (excluding the Kalmyks, which are of East Asian origin) fell into a single large cluster termed European. Many populations within this cluster are indistinguishable with our genetic marker, i.e., genetic distances between them were not significant, which is in agreement with the low FST value for the European cluster (0.013). However, some populations were characterized by a large percentage of significant genetic distances from the other populations of the cluster. Most such populations fell into the so-called European-2 subcluster defined by cluster analysis; the 'core' subcluster was termed European-1, and 59% of genetic distances between populations of the two subclusters were significant. European-1 and European-2 subclusters (Figure 3) are differentiated according to the B2-D2-A2 frequency, but not according to the B1-D2-A1 frequency, which might reflect the degree of Asian admixture. Natural selection probably was not responsible for separation of the two European subclusters as there is no difference in allele frequencies at the TaqI A locus (Table 4), which is considered the most likely candidate for selection in the whole DRD2 region [2–6].
The European-2 subcluster includes two Middle Eastern populations (Jews 2 from Yemen and Druze from Israel), two Uralic-speaking populations (Finns and Komi 3), also four Slavic-speaking populations (Byelorussians 2 and Russians 4, 5, and 6), and the Altaic-speaking Chuvash. All these linguistically and geographically distant populations are differentiated to some extent from the core of the European cluster, the European-1 subcluster, because of a relatively high B2-D2-A2 frequency.
The B2-D1-A2 and B1-D2-A1 haplotypes apparently have centers of dispersal in Europe/West Asia and East Asia, respectively . The B2-D2-A2 haplotype may also have a center of dispersal, the most probable location of which is in Africa. B2-D2-A2 was among the first haplotypes that evolved from the ancestral haplotype B2-D2-A1 in Africa  and still is the most abundant haplotype in all African populations (Additional file 1). Therefore, the first settlers of Eurasia that migrated to Arabia and Levant may mostly have carried the B2-D2-A2 haplotype and a small proportion of other haplotypes that were either subsequently eliminated or amplified by genetic drift and/or natural selection in various parts of the world.
B2-D2-A2 is the predominant haplotype in contemporary populations of Middle Eastern origin (Jews 1 from Ethiopia, Jews 2 from Yemen, and Druze from Israel). Populations of Levant began to disperse into Europe about 50,000 YBP [21, 22]. People of that diaspora might have carried B2-D2-A2 as a prevalent haplotype. However, another haplotype, B2-D1-A2, is the predominant haplotype in contemporary populations of Europe, for example, in linguistically and geographically distant populations such as the Irish, Danes, Russians, and Adygeis. Amplification of this haplotype might have taken place during the initial migration to Europe or might be associated with confinement in refugia of the last glacial maximum and reexpansion. According to one of several hypotheses based on archaeological evidence (reviewed by Zvelebil ), Uralic-speaking groups descended from Europeans who had been confined in the East European refugium in Ukraine and Southern Russia [24–29]. It is possible that the people of the East European refugium retained a high frequency of the B2-D2-A2 haplotype, in contrast with the other European populations that have spread from other refugia.
Thus, on a global scale, the B2-D2-A2 frequency possibly reflects genetic features of the first Eurasian settlers, e.g. in Jews 2 from Yemen and Druze from Israel, and, on a local scale, genetic features of Uralic-speaking populations, e.g., in Finns and Komi 3, Khants and Nenets. Therefore, the other members of the European-2 subcluster, Byelorussians 2, Russians 4, 5, and 6, and Chuvash, probably have a certain level of Uralic admixture. Moreover, according to other genetic and historical data, these populations have Uralic substratum.
Studies on mtDNA [30–34] and Y-chromosome haplogroups [33, 35, 36], autosomal VNTR diversity [37, 38], and autosomal haplotypes  consistently show a degree of Uralic admixture in populations of northern Russians. This admixture is manifested by a 'Uralic-specific' marker, the U4 mtDNA haplogroup [40–45], or by Asian-specific markers which Uralic populations acquired during earlier admixture with East Asians: Asian mtDNA haplogroups [40, 42, 45], N3 and N2 Y-chromosome haplogroups [46, 47]. For example, the Mezen' population investigated by Balanovsky et al.  has high N3 frequency, resembling some Uralic populations.
Russians 5 (Oshevensk) were most closely associated with Finns and Chuvash according to the MDS results (Figure 3). In the study of Verbenko et al.  on polymorphic tandem repeats at the D1S80 locus, the same Oshevensk sample (as well as another northern Russian sample) clustered together with Uralic-speaking Mari, Komi, and Udmurts, whereas other Russian populations clustered with Indo-Europeans and Adygeis. Analysis of other repeat loci, 3' ApoB, DMPK, DRPLA, and SCA1, also demonstrated remoteness of some northern Russian populations (including Russians 5, Oshevensk) from the core of the European cluster . Similar results were obtained using haplotypes at the TP53 locus : the Oshevensk population tended to form a cluster with Uralic-speaking Mordvins and with Altaic-speaking Kalmyks and Buryats, but not with Russians from Smolensk (Russians 2) or Byelorussians from Pinsk (Byelorussians 1).
According to archaeological data, the Arkhangelsk region (including Mezen' and Oshevensk) was populated by Uralic tribes in the Middle Ages (; see Figure 4). Russian colonization of this region began relatively recently (after the 12th century AD) . Thus, a very high level of Uralic admixture in Mezen' and Oshevensk is not surprising.
The Uralic genetic substratum is appreciable not only in the Northeast of the East European Plain but also in its northwestern part, for example, in the Pskov  and Novgorod regions , in Latvians and Lithuanians [46, 49, 50]. Baltic-speaking peoples, now represented by the Latvians and Lithuanians, came into contact with Uralic groups before the Slavs did (Figure 4; ). That Byelorussians 2 (Mjadel') fell into the European-2 subcluster may also reflect a general tendency in the northwestern region. Mjadel' is located in the northwestern part of Belarus near the contemporary Lithuanian border. The Russians 4 (Puchezh) population is distant from the northeastern and northwestern groups, but also belongs to the European-2 subcluster (Figure 3). Uralic admixture in this population may be explained by the presence of Uralic-speaking tribes in the region of Puchezh in historical times (; see Figure 4).
Russians 1 (Andreapol') and Uralic-speaking Veps are close to Uralic-speaking Komi 2 (Obyachevo) on the MDS plot (Figure 3). All these populations are located within the region occupied by Uralic peoples in the Middle Ages (Figure 4), but belong to the European-1 cluster and do not have high B2-D2-A2 frequencies typical of the European-2 cluster. However, they are shifted from the core of the European cluster because of a relatively high proportion of the "East Asian" haplotype B1-D2-A1. In fact, the Veps population has significant genetic distance only from Druze, but not from the other populations of the European-2 cluster, and Komi 2 only from Jews 2, Druze, Russians 6, and Komi 3. The Andreapol' sample had the highest B1-D2-A1 frequency of all European populations (Additional file 1), and eight of 13 genetic distances between this sample and the other populations of the European-1 subcluster are significant.
Komi populations demonstrate remarkable heterogeneity according to various marker systems. For example, Komi-Permyaks and Komi-Zyrians have rather different mtDNA haplogroup frequencies but both have a relatively high U4 frequency . In our study, one of the Komi-Zyrian populations (Komi 1, Izhma) belonged to the core of the European-1 cluster. It is interesting that the craniological results of Moiseyev  also place Komi-Zyrians at the core of the European cluster and distant from Uralic and Asian groups. According to three-site haplotype frequencies at the TP53 locus and VNTR frequencies at the D1S80 and 3' ApoB loci, the Komi 1 and 2 populations are distant from Uralic-speaking Finns, Mordvins and Khants, East Asian groups, and Slavic groups . Moreover, the Komi 2 (Obyachevo) population is distant from Komi 1 and closer to Slavic groups than Komi 1 . Thus, the position of Komi in genetic gradients remains uncertain because of substantial divergence of population samples and contradictory results, which may reflect a complex history of this group or natural selection.
B2-D2-A2 haplotype frequencies in Khants (0.48-0.49) and Nenets (0.48) fell within the range typical of the Asian-1 subcluster, 0.44-0.53. Therefore, the high frequency of the B2-D2-A2 haplotype in Uralic-speaking Khants and Nenets cannot be explained only by admixture of Europeans with typical East Asians; a combination of admixture and other processes such as gene drift or selection is a more likely explanation. The Khants and Nenets may have received the B2-D2-A2 haplotype both from the East European refugium and from East Asia, e.g., from Siberian populations of East Asian origin. One of such Siberian populations, the Yakuts, has a very high frequency of haplotype B2-D2-A2, 0.6-0.7. However, DRD2 haplotype frequencies in other Siberian populations of East Asian origin are unknown, and it is not possible to draw definitive conclusions about B2-D2-A2 frequencies in Siberia based on one ethnic group alone. Moreover, because Yakuts 1 exhibited low linkage disequilibrium between RFLPs, haplotype frequencies for this sample should be interpreted with caution. Yakuts 1 and 2 apparently do not belong to the cluster of typical East Asians, East Asian-1 (Figure 3), although they are clearly of East Asian origin. As suggested by archaeological and linguistic evidence, the Yakuts probably migrated north from their original area of settlement near Lake Baykal because of the Mongol expansion from the 13th to 15th century AD . Y-chromosome results reveal a very strong bottleneck in the Yakut population, which probably preceded their recent expansion [46, 54]. This bottleneck effect may be responsible for the aberrant haplotype frequencies for Yakuts observed in our study.
Populations in the northwestern (Byelorussians 2 from Mjadel'), northern (Russians 5 from Mezen' and 6 from Oshevensk; Komi 3), and eastern parts (Russians 4 from Puchezh and Chuvash) of the East European Plain have relatively high frequencies of haplotype B2-D2-A2, which may reflect admixture with Uralic-speaking populations. Uralic genetic substratum in these regions, which were inhabited by Uralic-speaking tribes as late as the Early Middle Ages, was also shown by studies in which other genetic markers were used (mtDNA, Y-chromosome, and autosomal). Thus, the analysis of DRD2 haplotypes supports results on Slavic-Uralic admixture obtained using other markers, mainly neutral and sex-specific markers.
The linguistic affiliations and geographical locations of the studied populations are shown in Table 1 and Figure 5. These populations have been described previously (see footnotes to Table 1), except for Andreapol' (Russians 1), Mezen' (Russians 6), Veps, and Nenets. The Andreapol' sample from the Tver region, the most western of the Russian populations involved in our study, consisted of individuals born in Andreapol' (a small town), Bologovo village, and other nearby villages. The Mezen' sample from the Arkhangelsk region, the most northern of the Russian populations sampled, consisted of individuals born in the small towns, Mezen' and Kamenka, in Dorogorskoe village, and in other nearby villages. The Nenets are Samoyedic-speaking tribes, and their economy is based on reindeer herding. In 2002, the population of Nenets was about 41,000. The Nenets sample, which belongs to the Tundra group, was collected in Yar-Sale and Panaevka villages in the Yamalo-Nenets autonomous district, Tyumen' region. The Veps are Finnic-speaking and reside in small settlements scattered among Russians of the Vologda region, the Leningrad region, and Karelia. In 2002, the population of Veps was about 8,000. Samples were collected from Veps in several villages (Pyazhozero, Pyazhelka, Koloshma, and others) in the Babaevo district of the Vologda region.
DNA isolation and typing
Blood samples (8 ml) were obtained by venipuncture and collected into EDTA-coated containers. Informed consent was obtained from each individual. The research protocols and forms of informed consent have been approved by the Ethic Commission of the Medico-Genetic Scientific Centre of the Russian Academy of Medical Sciences (an approval was signed by the Head of the Ethic Commission, PhD, professor L.F. Kurilo). All individuals belonged to the native ethnic group of the region, i.e., their lineage in the region extended for at least two previous generations, and were unrelated to each other. DNA was isolated from leucocytes by proteinase K treatment and extraction with phenol-chloroform . Each DNA sample was subjected to three PCR analyses: amplification of a 459 bp fragment for TaqI B and BclI RFLP analysis, amplification of a 300 bp fragment for MboI and TaqI D RFLP analysis, and amplification of a 237 bp fragment for TaqI A RFLP analysis (primer sequences and original PCR protocols were obtained from the website of K. Kidd: http://info.med.yale.edu/genetics/kkidd/Protocol_TOC_new2002.html). The locations of these polymorphisms are shown on the gene map (Figure 1). All endonuclease restriction reactions were carried out overnight. Samples containing unrestricted fragments were tested at least twice. Restriction products were separated by electrophoresis using a 2.5% agarose gel.
The general strategy of statistical analysis was similar to that used in the work of Poloni et al. . Allele frequencies, correspondence to Hardy-Weinberg equilibrium, and significance of linkage disequilibrium (P-values) were evaluated using Arlequin version 2.0 software http://cmpg.unibe.ch/software/arlequin. Linkage disequilibrium values (D') between polymorphic loci were calculated as suggested by Lewontin . Frequencies of haplotypes were estimated from RFLP genotype data using the expectation-maximization algorithm of Excoffier and Slatkin  implemented in Arlequin 2.0. Genetic affinities among populations were evaluated using coancestry coefficients, or linearized pairwise FST values  calculated on the basis of allele or haplotype frequencies. Significance of genetic distances was tested using permutations .
Correlation of geographical and genetic distances was assessed using the Mantel test and XLSTAT version 2008.6.04 software (Addinsoft). Great circle geographical distances were calculated using the haversine formula; latitudes and longitudes were determined using Google Earth software. Multidimensional scaling (MDS) and cluster analysis with the unweighted pair-group average (UPGA) method were performed using STATISTICA version 6.0 http://www.statsoft.com. Statistical comparison of haplotype frequencies in population groups was performed using the Kolmogorov-Smirnov test implemented in XLSTAT. Differentiation between population groups defined by MDS and cluster analysis was estimated using the analysis-of-molecular-variance approach, AMOVA , implemented in Arlequin 2.0. Conventional FST distances between haplotypes were used, i.e., all haplotypes were considered equidistant (a conservative scenario of pure drift). Significance of the genetic-structure indexes obtained with the AMOVA method was tested using a permutational procedure (1 × 106 permutations).
Kandel ER, Schwartz JH, Jessell TM: Principles of Neural Science. 2000, New York: McGraw-Hill, 4
McD Young R, Lawford BR, Nutting A, Noble EP: Advances in molecular genetics and the prevention and treatment of substance misuse: Implications of association studies of the A1 allele of the D2 dopamine receptor gene. Addict Behav. 2004, 29: 1275-1294. 10.1016/j.addbeh.2004.06.012.
Munafò MR, Clark TG, Johnstone EC, Murphy MFG, Walton RT: The genetic basis for smoking behavior: A systematic review and meta-analysis. Nicotine Tob Res. 2004, 6: 583-597. 10.1080/14622200410001734030.
Munafò MR, Matheson IJ, Flint J: Association of the DRD2 gene Taq1A polymorphism and alcoholism: a meta-analysis of case-control studies and evidence of publication bias. Mol Psychiatry. 2007, 12: 454-461.
Allen NC, Bagade S, McQueen MB, Ioannidis JPA, Kavvoura FK, Khoury MJ, Tanzi RE, Bertram L: Systematic meta-analyses and field synopsis of genetic association studies in schizophrenia: the szGene database. Nat Genet. 2008, 40: 827-834. 10.1038/ng.171.
Neville MJ, Johnstone EC, Walton RT: Identification and characterization of ANKK1: a novel kinase gene closely linked to DRD2 on chromosome band 11q23.1. Hum Mutat. 2004, 23: 540-545. 10.1002/humu.20039.
Kidd KK, Castiglione CM, Pakstis AJ, Speed WS, Bonne-Tamir B, Lu R-B, Goldman D, Lee C, Nam YS, Grandy DK, Jenkins T, Kidd JR: A global survey of haplotype frequencies and linkage disequilibrium at the DRD2 locus. Hum Genet. 1998, 103: 211-227. 10.1007/s004390050809.
Chakrabarti CS, Roy M, Sengupta NK, Lalthantluanga R, Majumder PP: Genetic relationships among some tribal groups inhabiting the north-eastern, eastern and sub-Himalayan regions of India. Ann Hum Genet. 2002, 66: 361-368. 10.1046/j.1469-1809.2002.00132.x.
Basu A, Mukherjee N, Roy S, Sengupta S, Banerjee S, Chakraborty M, Dey B, Roy M, Roy B, Bhattacharyya NP, Roychoudhury S, Majumder PP: Ethnic India: a genomic view, with special reference to peopling and structure. Genome Res. 2003, 13: 2277-2290. 10.1101/gr.1413403.
Vishwanathan H, Deepa E, Cordaux R, Stoneking M, Usha Rani MV, Majumder PP: Genetic structure and affinities among tribal populations of southern India: a study of 24 autosomal DNA markers. Ann Hum Genet. 2004, 68: 128-138. 10.1046/j.1529-8817.2003.00083.x.
Bhaskar LV, Thangaraj K, Mulligan CJ, Rao AP, Pardhasaradhi G, Kumar KP, Shah AM, Sabeera B, Reddy AG, Singh L, Rao VR: Allelic variation and haplotype structure of the dopamine receptor gene DRD2 in nine Indian populations. Genet Test. 2008, 12: 153-160. 10.1089/gte.2007.0073.
Castiglione CM, Deinard AS, Speed WC, Sirugo G, Rosenbaum HC, Zhang Y, Grandy DK, Grigorenko EL, Bonne-Tamir B, Pakstis AJ, Kidd JR, Kidd KK: Evolution of haplotypes at the DRD2 locus. Am J Hum Genet. 1995, 57: 1445-1456.
Kidd KK, Pakstis AJ, Castiglione CM, Kidd JR, Speed WC, Goldman D, Knowler WC, Lu R-B, Bonne-Tamir B: DRD2 haplotypes containing the TaqI A1 allele: implications for alcoholism research. Alcohol Clin Exp Res. 1996, 20: 697-705. 10.1111/j.1530-0277.1996.tb01674.x.
Iyengar S, Seaman M, Deinard AS, Rosenbaum HC, Sirugo G, Castiglione CM, Kidd JR, Kidd KK: Analyses of cross species polymerase chain reaction products to infer the ancestral state of human polymorphisms. DNA Seq. 1998, 8: 317-327. 10.3109/10425179809034076.
Chimpanzee Sequencing and Analysis Consortium: Initial sequence of the chimpanzee genome and comparison with the human genome. Nature. 2005, 437: 69-87. 10.1038/nature04072.
Relethford JH: Genetics and modern human origins. Evol Anthropol. 1995, 4: 53-63. 10.1002/evan.1360040207.
Makkay J: An archeologist speculates on the origin of the Finno-Ugrians. Mankind Quaterly. 2003, 43: 235-271.
Mallory JP: In search of the Indo-Europeans. Language, Archaeology and Myth. 1989, London: Thames and Hudson
Sedov VV: Slavyane v rannem srednevekovye (The Slavs in the Early Middle Ages). 1995, Moscow: Archaeological Fund, (in Russian)
Alekseeva TI: Etnogenez vostochnyh slavyan soglasno antropologicheskim dannym (Ethnogenesis of Eastern Slavs according to anthropological data). 1973, Moscow: Moscow State University, (in Russian)
Maca-Meyer N, González AM, Larruga JM, Flores C, Cabrera VM: Major genomic mitochondrial lineages delineate early human expansions. BMC Genet. 2001, 2: 13-10.1186/1471-2156-2-13.
Mellars P: A new radiocarbon revolution and the dispersal of modern humans in Eurasia. Nature. 2006, 439: 931-935. 10.1038/nature04521.
Zvelebil M: Revisiting Indreko's culture historical model: "Origin and area of settlement of the Finno-Ugrian peoples". TRAMES: A Journal of the Humanities and Social Sciences. 2001, 5: 26-47.
Nuñez MG: Old and new ideas about the origins of the Finns and Saami. The Roots of Peoples and Languages of Northern Eurasia I (Turku 30.5.-1.6.1997). Edited by: Julku K, Wiik K. 1998, Turku: Societas Historiae Fenno-Ugricae, 151-160.
Dolukhanov P: 'Prehistoric revolutions' and languages in Europe. The roots of peoples and languages of Northern Eurasia II and III. Edited by: Künnap A. 2000, Tartu: Tartu University Press, 71-84.
Poikalainen V: Palaeolithic art from the Danube to Lake Baikal. Folklore. 2001, 18/19: 7-60.
Tambets K, Tolk H-V, Kivisild T, Metspalu E, Parik J, Reidla M, Voevoda M, Damba L, Bermisheva M, Khusnutdinova E, Golubenko M, Stepanov V, Puzyrev V, Usanga E, Rudan P, Beckmann L, Villems R: Complex signals for population expansions in Europe and beyond. Examining the farming/language dispersal hypothesis. Edited by: Bellwood P, Renfrew C. 2002, Cambridge: McDonald Institute
Wiik K: Some ancient and modern linguistic processes in northern Europe. Time depth in historical linguistics (Papers in the prehistory of languages). Edited by: Renfrew C, McMahon A, Trask L. 2000, Cambridge: McDonald Institute for Archaeological Research, 463-479.
Wiik K: Who are the Finns?. A Man of Measure: Festschrift in Honour of Fred Karlsson on his 60th Birthday. Edited by: Suominen M, Arppe A, Airola A, Heinämäki O, Miestamo M, Määttä U, Niemi J, Pitkänen KK, Sinnemäki K. 2006, Turku: The Linguistic Association of Finland, 97-108.
Orekhov V, Poltaraus A, Zhivotovsky LA, Spitsyn V, Ivanov P, Yankovsky N: Mitochondrial DNA sequence diversity in Russians. FEBS Lett. 1999, 445: 197-201. 10.1016/S0014-5793(99)00115-5.
Malyarchuk BA, Derenko MV: Mitochondrial DNA variability in Russians and Ukrainians: Implication to the origin of the Eastern Slavs. Ann Hum Genet. 2001, 65: 63-78. 10.1046/j.1469-1809.2001.6510063.x.
Belyaeva O, Bermisheva M, Khrunin A, Slominsky P, Bebyakova N, Khusnutdinova E, Mikulich A, Limborska S: Mitochondrial DNA variation in Russian and Belorussian populations. Hum Biol. 2003, 75: 647-660. 10.1353/hub.2003.0069.
Malyarchuk B, Derenko M, Grzybowski T, Lunkina A, Czarny J, Rychkov S, Morozova I, Denisova G, Miścicka-Śliwka D: Differentiation of mitochondrial DNA and Y chromosomes in Russian populations. Hum Biol. 2004, 76: 877-900. 10.1353/hub.2005.0021.
Malyarchuk BA, Perkova MA, Derenko MV: On the origin of mongoloid component in the mitochondrial gene pool of Slavs. Russ J Genet. 2008, 44: 344-349.
Khar'kov VN, Stepanov VA, Borinskaya SA, Kozhekbaeva ZhM, Gusar VA, Grechanina EYa, Puzyrev VP, Khusnutdinova EK, Yankovsky NK: Gene pool structure of eastern Ukrainians as inferred from the Y-chromosome haplogroups. Russ J Genet. 2004, 40: 415-421.
Balanovsky O, Rootsi S, Pshenichnov A, Kivisild T, Churnosov M, Evseeva I, Pocheshkhova E, Boldyreva M, Yankovsky N, Balanovska E, Villems R: Two sources of the Russian patrilineal heritage in their Eurasian context. Am J Hum Genet. 2008, 82: 236-250. 10.1016/j.ajhg.2007.09.019.
Verbenko DA, Knjazev AN, Mikulich AI, Khusnutdinova EK, Bebyakova NA, Limborska SA: Variability of the 3'APOB minisatellite locus in eastern Slavonic populations. Hum Hered. 2005, 60: 10-18. 10.1159/000087338.
Verbenko DA, Slominsky PA, Spitsyn VA, Bebyakova NA, Khusnutdinova EK, Mikulich AI, Tarskaia LA, Sorensen MV, Ivanov VP, Bets LV, Limborska SA: Polymorphisms at locus D1S80 and other hypervariable regions in the analysis of Eastern European ethnic group relationships. Ann Hum Biol. 2006, 33: 570-584. 10.1080/03014460601012077.
Khrunin AV, Tarskaia LA, Spitsyn VA, Lylova OI, Bebyakova NA, Mikulich AI, Limborska SA: p53 polymorphisms in Russia and Belarus: correlation of the 2-1-1 haplotype frequency with longitude. Mol Gen Genomics. 2005, 272: 666-672. 10.1007/s00438-004-1091-8.
Bermisheva M, Tambets K, Villems R, Khusnutdinova E: Diversity of mitochondrial DNA haplogroups in ethnic populations of the Volga-Ural region. Mol Biol (Moscow). 2002, 36: 990-1001.
Derbeneva OA, Starikovskaya EB, Wallace DC, Sukernik RI: Traces of early Eurasians in the Mansi of Northwest Siberia revealed by mitochondrial DNA analysis. Am J Hum Genet. 2002, 70: 1009-1014. 10.1086/339524.
Derbeneva OA, Starikovskaya EB, Volodko NV, Wallace DC, Sukernik RI: Mitochondrial DNA Variation in the Kets and Nganasans and its implications for the initial peopling of Northern Eurasia. Russ J Genet. 2002, 38: 1554-1560. 10.1023/A:1021111530654.
Malyarchuk BA: Differentiation of the mitochondrial subhaplogroup U4 in the populations of Eastern Europe, Ural and West Siberia: Implication for the genetic history of the Uralic populations. Russ J Genet. 2004, 40: 1549-1556. 10.1023/B:RUGE.0000048671.32870.cb.
Tambets K: Towards the understanding of post-glacial spread of human mitochondrial DNA haplogroups in Europe and beyond: a phylogeographic approach. PhD thesis. 2004, Tartu University
Starikovskaya EB, Sukernik RI, Derbeneva OA, Volodko NV, Ruiz-Pesini E, Torroni A, Bromn MD, Lott MT, Hosseini SH, Huoponen K, Wallace DC: Mitochondrial DNA diversity in indigenous populations of the southern extent of Siberia, and the origins of Native American haplogroups. Ann Hum Genet. 2005, 69: 67-87. 10.1046/j.1529-8817.2003.00127.x.
Rootsi S, Zhivotovsky LA, Baldović M, Kayser M, Kutuev IA, Khusainova R, Bermisheva MA, Gubina M, Fedorova SA, Ilumäe A-M, Khusnutdinova EK, Voevoda MI, Osipova LP, Stoneking M, Lin AA, Ferak V, Parik J, Kivisild T, Underhill PA, Villems R: A counter-clockwise northern route of the Y-chromosome haplogroup N from Southeast Asia towards Europe. Eur J Hum Genet. 2007, 15: 204-211. 10.1038/sj.ejhg.5201748.
Derenko M, Malyarchuk B, Grzybowski T, Denisova G, Dambueva I, Perkova M, Dorzhu C, Luzina F, Lee HK, Vanecek T, Villems R, Zakharov I: Phylogeographic analysis of mitochondrial DNA in Northern Asian populations. Am J Hum Genet. 2007, 81: 1025-1041. 10.1086/522933.
Sedov VV: Finno-ugry i balty v epohu srednevekovya (Finno-Ugrians and Balts in the Middle Ages). 1987, Moscow: Nauka, (in Russian)
Kasperavićiūtė D, Kućinskas V, Stoneking M: Y chromosome and mitochondrial DNA variation in Lithuanians. Ann Hum Genet. 2004, 68: 438-452. 10.1046/j.1529-8817.2003.00119.x.
Pliss L, Tambets K, Loogväli E-L, Pronina N, Lazdins M, Krumina A, Baumanis V, Villems R: Mitochondrial DNA portrait of Latvians: towards the understanding of the genetic structure of Baltic-speaking populations. Ann Hum Genet. 2006, 70: 439-458. 10.1111/j.1469-1809.2005.00238.x.
Leont'ev AE: Arheologiya Meri. K predystorii Severo-Vostochnoj Rusi (The Archaeology of the Merya. The Early History of North-Eastern Russia). Russian Monographs in Migration-period and Medieval Archaeology. Edited by: Afanas'ev GE, Daim F, Kidd D. 1996, Moscow: Institute of Archaeology, Russian Academy of Sciences, 4: (in Russian)
Moiseyev V: Origins of Uralic-speaking populations: craniological evidence. HOMO. 2002, 52: 240-253. 10.1078/0018-442X-00032.
Khrunin AV, Verbenko DA, Nikitina K, Limborska SA: Regional differences in the genetic variability of Finno-Ugric speaking Komi populations. Am J Hum Biol. 2007, 19: 741-750. 10.1002/ajhb.20620.
Pakendorf B, Novgorodov IN, Osakovskij VL, Danilova AP, Protod'jakonov AP, Stoneking M: Investigating the effects of prehistoric migrations in Siberia: genetic variation and the origins of Yakuts. Hum Genet. 2006, 120: 334-353. 10.1007/s00439-006-0213-2.
Milligan BG: Total DNA isolation. Molecular genetic analysis of populations. Edited by: Hoelzel AR. 1998, London: Oxford University Press, 29-60.
Poloni ES, Semino O, Passarino G, Santachiara-Benerecetti AS, Dupanloup I, Langaney A, Excoffier L: Human genetic affinities for Y-chromosome P49a,f/TaqI haplotypes show strong correspondence with linguistics. Am J Hum Genet. 1997, 61: 1015-1035. 10.1086/301602.
Lewontin RC: On measures of gametic disequilibrium. Genetics. 1988, 120: 849-852.
Excoffier L, Slatkin M: Maximum-likelihood estimation of molecular haplotype frequencies in a diploid population. Mol Biol Evol. 1995, 12: 921-927.
Reynolds J, Weir BS, Cockerham CC: Estimation of the coancestry coefficient: basis for a short-term genetic distance. Genetics. 1983, 105: 767-779.
Excoffier L, Smouse PE, Quattro JM: Analysis of molecular variance inferred from metric distances among DNA haplotypes: application to human mitochondrial DNA restriction data. Genetics. 1992, 131: 479-491.
Khabarov VF, Zhukovsky VYe, Krayuhin AN, et al: Natsionalny atlas Rossii (The National Atlas of Russia). 2004, Moscow: Roskartografiya, 1: (in Russian)
Verbenko DA, Pocheshkhova EA, Balanovskaya EV, Marshanija EZ, Kvitzinija PK, Limborska SA: Polymorphisms of D1S80 and 3'ApoB minisatellite loci in Northern Caucasus populations. J Forensic Sci. 2004, 49: 178-180. 10.1520/JFS2003253.
The study was supported by the Russian Ministry of Science and Education program "Federal Support of Leading Scientific Schools" (grant No 3911.2008-04), by the Russian Academy of Sciences programs "Molecular and Cell Biology" and "Basic Science for the Advantage of Medicine", and by the Russian Foundation for Basic Research grant No 07-04-00027-a.
OVF carried out the polymorphism typing, performed the statistical analysis and drafted the manuscript. OIL participated in the polymorphism typing. AVK participated in the study design and helped with the statistical analysis and manuscript drafting. LAT, VAS, and AIM carried out sample collection and participated in the study design. SAL conceived of the study, participated in its coordination and helped to draft the manuscript. All authors read and approved the final manuscript.
Electronic supplementary material
Additional file 1: Frequencies of TaqI B-TaqI D-TaqI A haplotypes in the study populations and in reference populations. This is an extended variant of Table 3 including haplotype frequencies for all populations represented in Figure 3. (DOC 140 KB)
About this article
Cite this article
Flegontova, O.V., Khrunin, A.V., Lylova, O.I. et al. Haplotype frequencies at the DRD2 locus in populations of the East European Plain. BMC Genet 10, 62 (2009) doi:10.1186/1471-2156-10-62
- Haplotype Frequency
- DRD2 Gene
- East European Plain
- Siberian Population
- Arkhangelsk Region