Genetic diversities of cytochrome B in Xinjiang Uyghur unveiled its origin and migration history
- Abdurahman Ablimit†1,
- Wenbei Qin†1,
- Wenjuan Shan1,
- Weiwei Wu2,
- Fengjun Ling3,
- Kaitelynn H Ling4,
- Changjie Zhao4, 5,
- Fuchun Zhang1,
- Zhenghai Ma1Email author and
- Xiufen Zheng1, 4, 6, 7Email author
© Ablimit et al.; licensee BioMed Central Ltd. 2013
Received: 9 December 2012
Accepted: 4 October 2013
Published: 9 October 2013
Uyghurs are one of the many populations of Central Eurasia that is considered to be genetically related to Eastern and Western Eurasian populations. However, there are some different opinions on the relative importance of the degree of Eastern and Western Eurasian genetic influence. In addition, the genetic diversity of the Uyghur in different geographic locations has not been clearly studied.
In this study, we are the first to report on the DNA polymorphism of cytochrome B in the Uyghur population located in Xinjiang in northwest China. We observed a total of 102 mutant sites in the 240 samples that were studied. The average number of mutated nucleotides in the samples was 5.126. A total of 93 different haplotypes were observed. The gene diversity and discrimination power were 0.9480 and 0.9440, respectively. There were founder and bottleneck haplotypes observed in Xinjiang Uyghurs. Xinjiang Uyghurs are more genetically related to Chinese population in genetics than to Caucasians. Moreover, there was genetic diversity between Uyghurs from the southern and northern regions. There was significance in genetic distance between the southern Xinjiang Uyghurs and Chinese population, but not between the northern Xinjiang Uyghurs and Chinese. The European vs. East Asian contribution to the ten regional Uyghur groups varies among the groups and the European contribution to the Uyghur increases from north to south geographically.
This study is the first report on DNA polymorphisms of cytochrome B in the Uyghur population. The study also further confirms that there are significant genetic differences among the Uyghurs in different geographical locations.
KeywordsMitochondrial DNA Cytochrome B Uyghur Genetic diversity
Uyghur is one of the populations living in central Asian where people have undergone unceasing migration and interacted with other populations in prehistoric and historic times. As a consequence, Uyghurs share anthropometric and genetic traits with Eastern and Western Eurasian. They are genetically considered to be an admixture of Eastern and Western Eurasian populations, demonstrated by archaeological, anthropologic and genetic studies [1–6]. The Uyghur population has been used as a good human population sample for studying the human migration and gene inflow/drafting [3, 7]. However, the origin of the Uyghur is under discussion. Modern Uyghurs live primarily in the Xinjiang Uyghur Autonomous Region in China. The majority of the Xinjiang Uyghur live in the south of Xinjiang (south of Tian Shan), with minor populations spreading to the north. Due to environmental, war and political reasons, Uyghurs have undergone unceasing migration (from north to southwest) and interacted with other populations in prehistoric and historic times. This might result in genetically differential characteristics of modern Uyghurs living in different regions. However, most genetic studies of the Uyghur have focused on genetic difference between them and other ethnic groups, and utilized relatively small numbers of sample from very limited geographic locations. The differences in genetic characters between different regional Uyghur populations remain largely unclear. Furthermore, previous studies mainly investigated and compared the genetic diversity of mtDNA control region sequences, while DNA polymorphism of mitochondrial Cytochrome B (CYTB) in the Uyghur population in Xinjiang has not been reported.
CYTB is located in the coding region of mtDNA (positions 14576–16047), spanning the 1140 bp fragment. CYTB has undergone changes during evolution resulting in the occurrence of multiple single nucleotide polymorphic sites [8, 9]. Compared with the control region of mtDNA, CYTB is relatively conservative and has fewer mutations [8, 10]. CYTB is widely used in taxonomic research to determine phylogenetic relationships between organisms due to its sequence variability [11–13]. It is also considered to be most useful in determining relationships within families and genera. CYTB sequence information drawn from populations reflects their matrilineal ethnohistory and can deduce DNA sample's population based on population-specific nucleotide mutations existing in the CYTB gene [9, 14]. Therefore, we hypothesized that the genetic diversity of Uyghur living in different regions of Xinjiang can be identified by the analysis of CYTB sequence.
In this study, we for the first time report on the gene diversity of CYTB in the Uyghur of Xinjiang by completely sequencing their CYTB gene. We demonstrate that there was a genetic variance of CYTB occurring in the different regional Uyghurs and CYTB is a very useful genetic marker for the study of genetic differentiation of Uyghurs in Xinjiang. In particular, we showed the founder/ bottleneck event in the Xinjiang Uyghur.
High gene diversity of cytochrome B in the Uyghur population
1190 bp PCR fragments were amplified in all studied samples and 1140 bp of DNA fragment encoding CYTB (positions 14747–15886) were sequenced. Compared with the standard reference sequence (rCRS), a total of 102 mutant sites were observed in the studied 240 samples (Additional file 1: Table S1). The number of mutation sites in the samples varied from 0 to 10 with an average of 5.126. There were no deletions and/or insertions observed.
The frequency of population-specific polymorphic mutant nucleotides
C → T
T → C
T → C
T → C
A → G
G → A
G → A
A → G
T → C
A → G
Studying mutations among the regions, we observed the presence of regional group specific mutations in the CYTB sequence (Table 1). The mutations 14927 A/G and 15440 T/C were only observed in the Bortala with a frequency of 15.38%, 15746 A/G only in Kumul (30.00%), and 14857 T/C only in Turpan (11.11%). Only three samples of the total 240 Uyghurs showed a nucleotide change at position 14798 which occurs more frequently in Caucasians with a frequency of 0.2083, but not in other Asian groups .
The gene diversity, nucleotide diversity, and discrimination power noted in these populations
0.9546 ± 0.024
0.003450 ± 0.0019
0.9529 ± 0.026
0.003582 ± 0.0020
0.9744 ± 0.038
0.004026 ± 0.0023
0.9591 ± 0.030
0.003052 ± 0.0018
0.9708 ± 0.027
0.003098 ± 0.0018
0.9590 ± 0.014
0.003300 ± 0.0018
0.8800 ± 0.051
0.007863 ± 0.0041
0.9333 ± 0.077
0.003957 ± 0.0024
0.9111 ± 0.077
0.002690 ± 0.0017
0.9085 ± 0.051
0.003125 ± 0.0018
0.8864 ± 0.034
0.0028 ± 0.0016
0.9565 ± 0.022
0.0033 ± 0.0019
Founder / bottleneck haplotypes of CYTB in the Uyghur
Grouping samples into different geographic locations; there were 24 haplotypes observed in 35 Aksu individuals, 16 in 24 Atush, 11 in 13 Bortala, 14 in 19 Gulja, 15 in 19 Hotan, 41 in 67 Kashgar, 13 in 25 Korla, 8 in10 Kumul, 7 in 10 Sanji, and 11 in 18 Turpan individuals. Haplotype 15 (15326G) was predominantly found in the Aksu with a frequency of 20% and in the Kashgar with a frequency of 14.93%. Haplotype 13 (14766 T, 15326G) existed most frequently in the Turpan (27.78%), Atush (16.67%) and Korla (16%), but was not in the Bortala nor Kumul. Haplotype 11 (14766 T, 14783C, 15043A, 15301A, 15326G) appeared frequently in the Korla (32%), Sanji (30%), Turpan (16.67%), and Atush (12.5%). Haplotype 83 (14766 T, 14783C, 15043A, 15301A, 15326G, 15746G) was only observed in the Kumul group with a frequency of 30%, but not in other groups thereby showing region-specificity. Haplotype 89 (14766 T, 14783C, 14857C, 15043A, 15301A, 15326G) was observed only in the Turpan with a frequency of 11.11%, while haplotype 59 (14766 T, 14783C, 15043A, 15301A, 15326G, 15673A) and haplotype 66 (14766 T, 15314A, 15326G, 15452A) were only found in the Kashgar group with a frequency of 2.99%, respectively.
Different regional Uyghur represent different genetic features
Tianshan and the ancient Silk road are a geographic dividing line separating Xinjiang into two parts: Southern Xinjiang (Nanjiang) and Northern Xinjiang (Beijiang). Nanjiang consists of Kashgar, Aksu, Hotan, Atush and Korla, while Kumul, Turpan, Sanji, Gulja and Bortala belong to Beijiang. Accordingly we divided ten regional Xinjiang Uyghurs into two subgroups: Southern Uyghur (groups living in Nanjiang) and Northern Uyghur groups (living in Beijiang). We then performed MDS analysis with the Southern and Northern Uyghur and other populations [14–18] in order to reveal the genetic relationships among these populations. Figure 3B shows that both Northern and Southern Uyghurs and other 4 Asian populations (Thai, Vietnamese, Filipino and Chinese) were distributed closely to the left in MDS plot and formed a cluster, while Caucasians were isolated far away from them and the cluster, suggesting that both Southern and Northern Uyghur are closer to Asian in genetics than to Caucasian. In addition, the Northern Uyghur was separated from the Southern Uyghur. In comparison with the Southern Uyghur, the Northern Uyghur were more closely located to the Chinese.
Furthermore, we noticed that Gulja Uyghurs who geographically reside in northern Xinjiang appeared closely related to the southern subgroups (Hotan, Kashgar and Aksu) in the MDS plot, while geographic southern Uyghur subgroups Korla and Atush were close to the northern subgroups (Kumul, Sanji and Turpan) as shown in Figure 3A. Therefore we regrouped Uyghur samples by subtracting Gulja from northern group, Korla and Atush from the southern group, and conducted the third MDS analysis using new designed groups (Figure 3C). This regrouped MDS plot displayed that the northern group (Turpan, Kumul, Sanji and Bortala) was separated further away from the southern group (Hotan, Aksu and Kashgar). Korla was distributed between Chinese and the new northern group with similarly close distance to both of them. The Gulja and the new southern group formed a cluster, clearly separated from the northern group and Caucasian. In contrast, Atush was distributed relatively more closely to northern Uyghur.
The patterns of haplotypes of CYTB reflected migration history and origin of Uyghur
Genetic distances between ethnic groups
Genetic distances between ethnic groups
Fst based on Tajia Nei
In this study we first reported the genetic polymorphisms of cytochrome B in Uyghur population of Xinjiang. We demonstrated that cytochrome B is a very useful and informative marker for matrilineal identification and population studies, second to the mtDNA control region. There are population specific and regional population specific nucleotide positions in the cytochrome B sequence, which can help identify DNA sample's region and population information. The results of the present study conclude that: 1) there are founder/ bottle neck events in Xinjiang Uyghur; 2) the Uyghur is closer to Chinese, rather than to Caucasian in genetic distance; 3) There are genetic difference between the Southern and Northern Uyghur; 4) Gene influence from Asian (Chinese) is stronger on the Northern Uyghur, than the Southern Uyghur; and 5) CYTB is a good genetic marker for differentiation of subgroups.
Xinjiang is located in Central Asia which is an intermediate region of the Eurasian continent. There were 36 nations existing in Xinjiang during prehistorical and historical times. Some nations were defeated by others and destroyed, overrun or forced to migrate to other places. Unceasing wars propelled endless migration and coalition, leading to genetic mixture and gene drafting. Additionally, there were two silk roads extending from east to west through Xinjiang, resulting in extensive exchange of culture and marriage between different populations. Our data confirm that Uyghur is an admixture population with contributions from both Eastern and Western Eurasian ancestries, which is consistent with the results from mtDNA control region sequences [3–6]. Based on the mutation at position 14798, the CYTB sequences of the Uyghur in Xinjiang contain more inflow of East Asian than European. The proportion of European sequences varies from different geographic regions of the Uyghur (data not shown).
The Uyghurs are a Turkic ethnic group living in Eastern and Central Asia. The ancestors of the Uyghur tribe were Turkic pastoralists called Tiele in Northern China, Mongolia, and the Altay Mountains. Due to wars and environmental stress, Uyghur continued migrating from north to south. From the historic perspective, Uyghur originated from Mongolians, rather than Caucasians and inherited Mongolia genetics. The Uyghur population was gradually diluted as they migrated from north to south. Our data on the frequency of triple 14783 T/C-15043G/A-15301G/A mutant and founder/bottle neck haplotypes demonstrated that the Uyghur originated from Mongolia, migrated from north to south, supporting historic reports.
In this study, we found that nucleotide positions with most frequent mutation were 14766, 14783, 15043, and 15302, which were most commonly seen in Asians including the Chinese Han population . Mutation at position 14766 occurs in more than 93% of Asian samples, but less than 0.72% of Caucasians [8, 14]. The frequencies of mutation at the position 14766 in the ten different geographic Uyghur samples varied from 0.73 to 0.94. The frequencies of 14766 mutation declined from the eastern Tarim Basin to the western Tarim Basin, reflecting a clear geographic pattern. This implies that the East Asian portion of genetics was gradually diluted as people migrated from the east to the west of Xinjiang, and from north to south. Mutations occurring at position of 14783, 15043, and 15302 showed the same trend among the ten Uyghur groups as position 14766. Mutations at the above three positions were seldom observed in the Caucasian population. In contrast, mutation at position 14798 that is Caucasian specific was observed in only one Aksu sample and two Hotan samples. Taken together, we found that at the cytochrome B gene, Uyghurs generally have more imprint of East Asian genetic portion than that of Caucasian. These genetic results are in agreement with the history of Uyghur formation. The Uyghurs living in the Nanjiang are different to Beijiang, and the southern Uyghur have relatively less influence from East Asians than the northern. Our data showed that in the same ethnic group, people residing at different places have genetic differences.
In this study, we reported that Uyghur from different geographic locations in Xinjiang have differences in the percentage of European/East Asian ancestry component, distinctly by the difference between Aksu/Hotan and Kumul or Korla. There is a significant difference between the south (Hotan, Kashgar, and Aksu ) and the north of Xinjiang (Kumul, Bortala, Turpan and Sanji). East Asian ancestry dominates the Uyghur from the north, while European ancestry imprints more on the southern Uyghur. In addition, we observed that Uyghurs living in two geographically south cities Atush and Korla shared northern Uyghur genetics (Kumul, Turpan and Sanji), in contrast with the southern Uyghur, while Uyghurs from Gulja which is located in the north of Tianshan in geography presented the southern regional Uyghur genetic features. Demographic records show that the dominant ethnic group is Kyrgyz who defeated the Uyghurs in AD 840, despite the fact that Atush is located in the south of Tianshan [Wikipedia, the free encyclopedia, History of the Kyrgyzstan]. Moreover, there were intermarriages in the Uyghur living in the Atush region. This might explain why the gene pattern of cytochrome B in the Atush Uyghurs was between that of the southern and northern Uyghur group, different from the other three southern group samples. As for Gulja, there were two large migrations of the southern Uyghur from south to north Gulja in history. Over time, the migrated southern Uyghurs expanded in Gulja and became the largest minority in Gulja [Wikipedia, the free encyclopedia, History of the Uyghur people.]. As a result, the Gulja Uyghur was and still is a branch of southern Uyghur, having little relationship with the northern Uyghur, which is in line with our data that the Uyghur from Gulja shares similar genetic characters with the Southern Uyghur. Although Korla is located in the south of Tianshan belonging to Nanjiang ( southern of Xinjiang), it is also adjacent to Turpan and Sanji. Korla is located in Bayingolin Mongol Autonomous Prefecture where the current major population is Chinese Han. As early as 94 CE, the Chinese government and military started to administrate this area and forced different populations to exchange and mix. The results from CYTB genotyping are consistent with that from the control region of mtDNA (unpublished data). Our data showed that the Uyghurs from Korla in genetics are different to the south Xinjiang Uyghurs.
Cytochrome B is a very useful DNA marker with high discriminate power for matrilineal identification, as well as deduction of the region and population of people. The polymorphisms of CYTB were significantly different between different geographical Uyghur (between south and north). The influences of East Asian and European genetics in Uyghur varies between different geographic locations (particularly south and north) of Uyghur.
PCR and sequencing
Sequences of primers used for mitochondrial DNA amplification and sequencing
PCR products were visualized on a 1.5% agarose gel and purified using a Biomiga kit (Biomiga, Beijing, China). Sequencing reactions were conducted in a PE-9600 thermocycler (ABI Applied Biosystems, Fostor City, CA, USA), using a BigDye terminator v3.1 Cycle sequencing kit (Applied Biosystems) with the following conditions: 25 cycles of 95°C for 30 s; 50°C for 30 s; and 60°C for 4 min. The primers L14724, H15149, L15283 and H15363 were used for sequencing listed in Table 6. The DNA sequences were detected with an ABI3730 DNA sequencer (Applied Biosystems) in HuaDa Genome Centre, Beijing.
Sequences were aligned using MEGA5.0 and compared with the standard reference sequence (rCRS) (http://www.megasoftware.net/mega.php). The haplotypes of CYTB were designed using DnaSP5.0. The genetic distances between Xinjiang regional Uyghur populations and other populations were determined by the analysis of molecular variance using ARLEQUIN3.5 (http://cmpg.unibe.ch/software/arlequin3). A difference with a P-value < 0.05 was considered statistically significant. Multidimensional scaling (MDS) plot of mtDNA was created by SPSS 19 for studying the genetic relationship between populations. Phylogenetic trees were constructed by the neighbor-joining method using a Phylip 3.69 program (http://en.bio-soft.net/tree/Phylip.html). Median networks were constructed using NETWORK 18.104.22.168 software (http://www.Fluxus-engineering.com/sharenet.htm).
The study was supported by grants from National Natural Science Foundation of 31260267, Heart and Stroke Foundation of Canada, and Lawson Health Research Institute.
- Comas D, Calafell F, Mateu E, Perez-Lezaun A, Bosch E, Martinez-Arias R, Clarimon J, Facchini F, Fiori G, Luiselli D, et al: Trading genes along the silk road: mtDNA sequences and the origin of central Asian populations. Am J Hum Genet. 1998, 63 (6): 1824-1838. 10.1086/302133.PubMed CentralView ArticlePubMedGoogle Scholar
- Comas D, Plaza S, Wells RS, Yuldaseva N, Lao O, Calafell F, Bertranpetit J: Admixture, migrations, and dispersals in Central Asia: evidence from maternal DNA lineages. Eur J Hum Genet. 2004, 12 (6): 495-504. 10.1038/sj.ejhg.5201160.View ArticlePubMedGoogle Scholar
- Yao YG, Kong QP, Wang CY, Zhu CL, Zhang YP: Different matrilineal contributions to genetic structure of ethnic groups in the silk road region in china. Mol Biol Evol. 2004, 21 (12): 2265-2280. 10.1093/molbev/msh238.View ArticlePubMedGoogle Scholar
- Xu S, Jin L: A genome-wide analysis of admixture in Uyghurs and a high-density admixture map for disease-gene discovery. Am J Hum Genet. 2008, 83 (3): 322-336. 10.1016/j.ajhg.2008.08.001.PubMed CentralView ArticlePubMedGoogle Scholar
- Xu S, Huang W, Qian J, Jin L: Analysis of genomic admixture in Uyghur and its implication in mapping strategy. Am J Hum Genet. 2008, 82 (4): 883-894. 10.1016/j.ajhg.2008.01.017.PubMed CentralView ArticlePubMedGoogle Scholar
- Xu S, Jin W, Jin L: Haplotype-sharing analysis showing Uyghurs are unlikely genetic donors. Mol Biol Evol. 2009, 26 (10): 2197-2206. 10.1093/molbev/msp130.View ArticlePubMedGoogle Scholar
- Lei SF, Yang TL, Tan LJ, Chen XD, Guo Y, Guo YF, Zhang L, Liu XG, Yan H, Pan F, et al: Genome-wide association scan for stature in Chinese: evidence for ethnic specific loci. Hum Genet. 2009, 125 (1): 1-9. 10.1007/s00439-008-0590-9.PubMed CentralView ArticlePubMedGoogle Scholar
- Andreu AL, Bruno C, Hadjigeorgiou GM, Shanske S, DiMauro S: Polymorphic variants in the human mitochondrial cytochrome b gene. Mol Genet Metab. 1999, 67 (1): 49-52. 10.1006/mgme.1999.2843.View ArticlePubMedGoogle Scholar
- Lee SD, Lee YS, Lee JB: Polymorphism in the mitochondrial cytochrome B gene in Koreans. An additional marker for individual identification. Int J Legal Med. 2002, 116 (2): 74-78. 10.1007/s004140100238.View ArticlePubMedGoogle Scholar
- Esposti MD, De Vries S, Crimi M, Ghelli A, Patarnello T, Meyer A: Mitochondrial cytochrome b: evolution and structure of the protein. Biochim Biophys Acta. 1993, 1143 (3): 243-271. 10.1016/0005-2728(93)90197-N.View ArticlePubMedGoogle Scholar
- Bataille M, Crainic K, Leterreux M, Durigon M, de Mazancourt P: Multiplex amplification of mitochondrial DNA for human and species identification in forensic evaluation. Forensic Sci Int. 1999, 99 (3): 165-170. 10.1016/S0379-0738(98)00185-6.View ArticlePubMedGoogle Scholar
- Forrest AR, Carnegie PR: Identification of gourmet meat using FINS (forensically informative nucleotide sequencing). Biotechniques. 1994, 17 (1): 24-26.PubMedGoogle Scholar
- Parson W, Pegoraro K, Niederstatter H, Foger M, Steinlechner M: Species identification by means of the cytochrome b gene. Int J Legal Med. 2000, 114 (1–2): 23-28.View ArticlePubMedGoogle Scholar
- Hwa HL, Ko TM, Chen YC, Chang YY, Tseng LH, Su YN, Lee JC: Study of the cytochrome b gene sequence in populations of Taiwan. J Forensic Sci. 2010, 55 (1): 167-170. 10.1111/j.1556-4029.2009.01195.x.View ArticlePubMedGoogle Scholar
- Szibor R, Michael M, Spitsyn VA, Plate I, Ginter EK, Krause D: Mitochondrial D-loop 3′ (CA)n repeat polymorphism: optimization of analysis and population data. Electrophor. 1997, 18 (15): 2857-2860. 10.1002/elps.1150181523.View ArticleGoogle Scholar
- Chung U, Lee HY, Yoo JE, Park MJ, Shin KJ: Mitochondrial DNA CA dinucleotide repeats in Koreans: the presence of length heteroplasmy. Int J Legal Med. 2005, 119 (1): 50-53. 10.1007/s00414-004-0487-7.View ArticlePubMedGoogle Scholar
- Xiufen Zheng LW: The polymorphism of CA repeat in Chinese. Chin Forensic Sci. 1999, 14 (3): 2-Google Scholar
- Yan J, Tang H, Liu Y, Ma W, Zhang Q, Gao J, Jiao Z, Shang L, Jing Y, Hu S, et al: Allele frequencies of mitochondrial D-loop (CA)n repeat polymorphism in six Chinese ethnic groups. Leg Med (Tokyo). 2007, 9 (6): 330-331. 10.1016/j.legalmed.2007.05.002.View ArticleGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.