Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Paternal and Maternal Genetic Analysis of a Desert Keriyan Population: Keriyans Are Not the Descendants of Guge Tibetans

  • Kaixu Chen,

    Affiliation Xinjiang Key Laboratory of Biological Resources and Genetic Engineering, College of Life Science and Technology, Xinjiang University, Urumqi, Xinjiang, China

  • Abdurahman Ablimit,

    Affiliation Xinjiang Key Laboratory of Biological Resources and Genetic Engineering, College of Life Science and Technology, Xinjiang University, Urumqi, Xinjiang, China

  • Fengjun Ling,

    Affiliation Beijing Entry-Exit Inspection and Quarantine Bureau, Beijing, China

  • Weiwei Wu,

    Affiliation Institute of Forensic Science of Zhejiang, Hongzhou, China

  • Wenjuan Shan,

    Affiliation Xinjiang Key Laboratory of Biological Resources and Genetic Engineering, College of Life Science and Technology, Xinjiang University, Urumqi, Xinjiang, China

  • Wenbei Qin,

    Affiliation Xinjiang Key Laboratory of Biological Resources and Genetic Engineering, College of Life Science and Technology, Xinjiang University, Urumqi, Xinjiang, China

  • Tuerhong Keweier,

    Affiliation Blood Center of Urumqi, Xinjiang, China

  • Hongli Zuo,

    Affiliation Blood Center of Urumqi, Xinjiang, China

  • Fuchun Zhang,

    Affiliation Xinjiang Key Laboratory of Biological Resources and Genetic Engineering, College of Life Science and Technology, Xinjiang University, Urumqi, Xinjiang, China

  • Zhenghai Ma ,

    xzheng26@uwo.ca (XZ); zhenghai98@gmail.com (ZM)

    Affiliation Xinjiang Key Laboratory of Biological Resources and Genetic Engineering, College of Life Science and Technology, Xinjiang University, Urumqi, Xinjiang, China

  • Xiufen Zheng

    xzheng26@uwo.ca (XZ); zhenghai98@gmail.com (ZM)

    Affiliations Xinjiang Key Laboratory of Biological Resources and Genetic Engineering, College of Life Science and Technology, Xinjiang University, Urumqi, Xinjiang, China, Department of Pathology, The University of Western Ontario, London, Canada, Lawson Health Research Institute, London, Canada

Abstract

The Keriyan people live in an isolated village in the Taklimakan Desert in Xinjiang, Western China. The origin and migration of the Keriyans remains unclear. We studied paternal and maternal genetic variance through typing Y-STR loci and sequencing the complete control region of the mtDNA and compared them with other adjacent populations. Data show that the Keriyan have relatively low genetic diversity on both the paternal and maternal lineages and possess both European and Asian specific haplogroups, indicating Keriyan is an admixture population of West and East. There is a gender-bias in the extent of contribution from Europe vs. Asia to the Keriyan gene pool. Keriyans have more genetic affinity to Uyghurs than to Tibetans. The Keriyan are not the descendants of the Guge Tibetans.

Introduction

The Taklimakan Desert located in the centre of the Tarim Basin in Western China is China’s second largest desert. There is an ancient isolated village along the Keriya River in the depth of the Desert. The village with more than 200 families and over 1000 individuals spans more than 200 kilometers from the south of the Keriyan River to the north. It is said they have lived there for more than 400 years. They call themselves as “Keriyan”. The village and Keriyans were first introduced to the world by a Swedish geographer and explorer Sven Anders Hedin when he explored Western China in1896. Due to its geographical location and traffic blockage, Keriyans still live isolated from the outside of the world, except for some essential commercial trade with adjacent areas. The village is surrounded by the desert and the living environment is very harsh. Life there is very simple and primitive. Keriyans speak a kind of language similar to the Uyghur language belonging to the Altai linguistic family. However, the origin and migration history of the Keriyan is unclear. There are three hypotheses about their ancestors: one is that Keriyan people are descendants of the ancient Guge, Tibetans who climbed through the Kunlun Mountains after the collapse of the ancient Guge Kingdom in 1630 and have lived in that area since then; The second argument is that they are originally Keriyan indigenous peoples, who resided there as early as the Neolithic Age; the third argument is that they are a branch of the ancient Loulan people who mysteriously disappeared 2000 years ago, the descendants of the Mi Dynasty in the Western Han. A finer patrilineal and matrilineal genetic investigation of Keriyans would help to dissect the issue.

In this study, we collected DNA samples from the Keriyan village and genotyped them with 17 Y-chromosomal short tandem repeats (STR) or microsatellite loci and the complete control region of the mitochondria DNA. Analyses of the genetic association with other reference populations were performed to dissect the origin of the Keriyan.

Materials and Methods

1. DNA samples

A total of 70 blood samples were randomly collected from unrelated healthy individuals from the Keriyan village in Xinjiang (Figure 1) with written informed consents. The study was approved by the Ethnic Study Committee of Xinjiang University. DNA was extracted from the blood using the classic phenol/chloroform method. For comparative analysis, we examined information on haplotypes of Y-STR loci and mtDNA sequence. The reference population data for Y-STR genotyping included: Archangelsk, Russian Federation [Russian], population sample with 42 haplotypes(YA003171); Amdo, China [Tibetan], population sample with 88 haplotypes(YA003694); Kham, China [Tibetan], population sample with 109 haplotypes(YA003695); Liaoning, China [Mongolian], population sample with 206 haplotypes(YA003758); Liaoning, China [Northern Han], population sample with 376 haplotypes (YA003756); Central Mongolia [Khalkh], population sample with 87 haplotypes (YA003737); Northern Mongolia [Khalkh, Darkhad, Uriankhai] population sample with 29 haplotypes (YA003734); Rjasan, Russian Federation [Russian], population sample with 36 haplotypes (YA003178); Wallachia, Romania [Romanian], population sample with 96 haplotypes (YA003457); Western Mongolia [Khalkh, Durvud, Bayad, Uriankhai, Khazak, Torguud, Uuld, Zakhchin], population sample with 33 haplotypes (YA003732); Xinjiang, China [Uyghur], population sample with 218 haplotypes (YA003847); Kazakhstan, population set of 3 population samples with 347 haplotypes (YHRD database); Mongolia, population set of 9 population samples with 445 haplotypes (YHRD database); Romania, population set of 14 population samples with 817 haplotypes (YA003457); and Russian Federation, population set of 50 population samples with 3364 haplotypes (YHRD database).

thumbnail
Figure 1. Geographic locations of the samples in this study.

https://doi.org/10.1371/journal.pone.0100479.g001

For the mtDNA data, reference populations included Kazakhs, Kirghiz, Uzbeks and Uyghurs [1], [2]; Mongolians [2][5]; Daurs, Ewenkis and Oroqens [4] [6]; Huis [2]; Armenians, Azeris and Georgians [7]; Tuvans and Buryats [8]; Northern and Southern Hans [4], [9], [10], and Southern East China ethic groups [11], [12], Tibetan [12], [13], Korean [4], [14], Japanese [15], Russian and Siberian [16], [17]. We also included our own collected data from Uyghurs from Xinjiang China (unpublished).

2. Y-STR genotyping

Genotyping of 17 Y-STR loci (DYS 19, 389I, 389II, 390, 391,392,393, 385I, 385II, 437, 438, 439, 448, 456, 458, 635 and GATAH4) in Keriyans was conducted with an Applied Biosystems (ABI) AmpFlSTR Yfiler PCR amplification kit (ABI, Carlsbad, CA) and analysed by an Applied Biosystems 3100 DNA sequencer (ABI), according to the manufacturer’s instruction. Genotypes were identified based on the number of tandem repeats.

The Keriyan samples were classified into haplotypes based on their genotypes summarized from the 17 Y-STR loci and further assigned into haplogroups using an online haplogroup predictor tool (http://www.hprg.com/hapest5).

3. mtDNA sequencing and sequence comparison

The control regions of the mtDNA were amplified using the primer set L15996/H601 as described previously [18], [19]. PCR product was purified with a fragment DNA/RNA purification kit (Biomiga, Beijing, China) and used as a DNA template for DNA sequencing. Cycling sequencing for HVS I and HVS II was conducted using the ABI BigDye terminator v3.1 cycle sequencing kit in an ABI 3100 DNA sequencer.

Nucleotide sequences were edited and aligned using the MEGA 5.0 (DNAStar, Inc.) [20], and compared with the Revised Cambridge Reference Sequence (rCRS) [21]. mtDNA haplogroups were assigned using online tool HaploGrep (http://www.haplogrep.uibk.ac.at) with the strategy described by Yao et al [22].

4. Data analyses

The frequencies of Y-STR haplotypes and of mtDNA haplogroups of the Keriyan samples were estimated. The basic parameters of molecular diversity (nucleotide diversity, haplotype diversity, and mean number of pairwise differences) and population genetic structure (including analyses of molecular variance, AMOVA), neutrality tests (Tajima’s D and Fu’s Fs), and genetic distance Fst were estimated for mtDNA using the computer program Arlequin 3.11 (http://www.cmpg.unibe.ch/software/arlequin3. Computational and Molecular Population Genetics Lab, Institute of Zoology, University of Berne, Baltzerstrasse 6, 3012 Bern, Switzerland). The statistical significance of Fst-values was estimated by permutation analysis using 10000 permutations. Neighbor joining tree was constructed using Fst values by Phylip v3.6.9 (http://evolution.genetics.washington.edu/phylip.html).

Using an AMOVA tool provided by the YHRD website (http://www.yhrd.org/Analyse), genetic distances (Rst) between the Keriyan population and other reference populations for the Y-STR data set were analyzed and p values were calculated with 10000 permutations using Rst between populations. A multiple dimension scaling plot (MDS) was obtained using Rst generated by AMOVA. Median-joining (MJ) networks were constructed with the Network 4. 6.12 program (http://www.fluxus-engineering.com) using the HVRI sequences or Y-STR genotypes [23], [24]. All polymorphic nucleotide positions were assigned weights following the recommendations provided by Roostalu et al [25]. The Y-STR loci were weighted based on the inverse of their variances.

To compare mtDNA variation among Keriyan and the surrounding other populations, a principal component analysis based on haplogroup frequencies among populations was performed using SSPS version 17 software.

In addition, the coalescence times of haplogroups M7c, M8a, and M9ab, and M11 were computed using rho statistics [26] calibrated with two different mutation rates, including: i) the most widely used mutation rate of one transitional step between nps 16090–16365 every 20,180 years [26] and ii) the revised rate of one mutation every 18,845 years for the same sequence region [27]. The standard deviation for the rho estimates was obtained according to Saillard et al [28], and the length variation in the polycytosine tract between nps 16180–16193 was excluded from the analysis.

Results

1. Genetic diversity of Y-chromosomal markers

We detected 17 Y-STR loci using Applied Biosystems AmpFlSTR Yfiler. A total of 20 haplotypes were observed in the 50 Keriyan male samples. The gene diversity was 0.896. We also designed these haplotypes into 9 haplogroups according to their genotypes using online haplogroup predictor tool (http://www.hprg.com/hapest5). The frequencies and genotypes of haplotypes are summarized in Table 1. Five haplotypes were observed more than twice in the Keriyans while 13 haplotypes were observed once and two haplotypes were observed twice. The most common haplotype was H4 belonging to haplogroup R1a, shared by 13 people at the frequency of 26%. More than 60% of samples belong to Haplogroup R1a. The European- specific haplogroups R1a, J and I account for 82% of the total haplogroups, whereas Asian-specific haplogroups H, L and T for 10%. Central Asian or Eurasian specific haplogroup Q and N account for 8%. A network of Y-STR haplotype was constructed for the Keriyan samples (Figure 2). The stretched, not star-like network showed that the Keriyan samples had at least two ancestry haplotypes.

thumbnail
Figure 2. Medium Network of Keriyans using haplotypes from 13 Y-STR loci.

A Median Joining network was constructed using 13 Y-STR loci (DYS389I, 390, 391, 392, 393, 437, 438, 439, 448, 456, 458, 635 and GATAH4) as described in the Materials & Methods. Node size is proportional to frequencies of haplotype.

https://doi.org/10.1371/journal.pone.0100479.g002

thumbnail
Table 1. Haplotypes of 17 Y-STR loci and their frequencies in the Keriyans.

https://doi.org/10.1371/journal.pone.0100479.t001

We searched all 20 haplotypes observed in the Keriyan in the YHRD database comprising of 71234 haplotypes from 477 populations. Three haplotypes (H2, H6 and H19) matched certain samples in the database. Haplotype 2 (J2a1b) representing at the frequency of 10% in the Keriyan was found to match one haplotype in the searching database. The hit haplotype belonging to Eurasian Met population database (32785 matching haplotypes in 221 populations) is from Xinjiang Uyghur population reported by our group (YA003847) [29]. Haplotype 6 occurring twice in the Keriyan population was found to have 4 matching haplotypes in the searched database. Three of them were found in the Eurasian metapopulations, among which two matched the Central Mongolia Khalkh (YA003737) and one matched the Mongolian population from Liaoning, China (YA003758), respectively. The other hit was in the Northern Han population from Liaoning, China (YA003756). Moreover, Haplotype 19 occurring once in the Keriyan population was found in the Croatian of Zagreb, Croatia dataset in Eurasian-European-South Eastern European database (YA003130).

2. Patrilineage affinity of the keriyan population

In order to know the genetic relationship of the Keriyan samples with other populations, we performed an AMOVA analysis and calculated Rst with the genotype of 17 Y-STR loci from the Keriyan and other 11 populations and 4 population sets consisting of a total of 6314 haplotypes published in the YHRD database. The obtained Rst is shown in Table S1. To visualize the extent of genetic diversity among these populations, a MDS plot based on Rst was made as shown in Figure 3. Not surprisingly, the MDS plot clearly showed a geographic affinity pattern. For instance, Chinese mainland Hans including north Han and Beijing Han, China populations and north-China Mongolian clustered together; east European populations from Russia and Siberia gathered together and formed a cluster; Mongolia populations were close to each other, and Romania populations gather together and formed their own cluster. Two Tibetan populations did not form a close cluster like the above populations, but distinctively separated from the Keriyan and the other populations. The Keriyan population was not attached to any other populations. However, the Keriyan population was relatively closer to the Russia populations and Xinjiang Uyghur than to Mongolia, Romania, China north Mongolian and Han, and Tibetan populations. The greatest distances between the Keriyan samples were populations from the Southern Chinese Han and Tibetan, followed by Mongolians. Even so, the distance between the Keriyans and the Russian, or Siberians, or Uyghur was significant.

thumbnail
Figure 3. Multiple dimension Scaling plots of Rst value estimated from haplotypes comprised 17 Y-STR loci.

Rst values were calculated between each populations and MDS plot based on Rst using an online tool AMOVA provided by YHRD.

https://doi.org/10.1371/journal.pone.0100479.g003

In order to visualize the phylogenetic relationship between Keriyan and other populations, we constructed a MJ network. We used the haplotypes from13 Y-STR loci (DYS 389I, 390, 391, 392, 393, 437, 438, 439, 448, 456, 458, 635 and GATAH4) in making a network due to mutations occurring at DYS19, 389II and 385. The reference populations were selected based on the genetic affinity between Keriyan and the other populations based on the MDS analysis (Figure 3). Only the part of the common haplotypes from selected populations listed on YHRD was included to construct MJ network of Y-STR due to limitations in the software of the Network 4.6.12 program. As shown in Figure 4, the haplotypes from the Keriyan were generally close to European and Mongolian populations, but far away from Sino-Tibetan populations (Tibetan and Han populations). Two thirds of Keriyan haplotypes gathered together with the European populations Russian and Siberia and Eurasian Uyghur population, while the remainder gathered with Mongolian populations.

thumbnail
Figure 4. Median Joining network of Keriyans and other 14 referenced populations using 13 Y-STR haplotypes.

A Median Joining network was constructed using the same 13 Y-STR loci as Figure 2. Node size is proportional to frequencies of haplotype. Populations were labelled by different colors.

https://doi.org/10.1371/journal.pone.0100479.g004

3. Maternal analysis

We sequenced the complete control region of the mtDNA for 42 samples from the Keriyan. Considering both hyper variable segments (1040 bp), a total of 59 positions were mutant. The average number of nucleotide differences (k) was 9.29568. Nucleotide diversity (Pi) and haplotype (gene) diversity Hd were 0.00832 and 0.947, respectively. Due to lack of HVS II sequence information for some populations, we just selected HVS I sequences to compare molecular diversity among populations. Shown in Table 2, the Keriyan have the lowest gene and nucleotide diversity. We also calculated the neutrality indexes, Tajima’s D and Fu’s Fs to examine whether there is a population expansion. Tajima D and Fu’s F values were −1.13 and −1.157, respectively, but both were not significant (P>0.05), indicating more recent but not historical population expansion.

thumbnail
Table 2. Gene diversity of Keriyan and its surrounding populations.

https://doi.org/10.1371/journal.pone.0100479.t002

We observed a total of 20 different sequences (haplotypes) and assigned them into 13 different sub-haplogroups (Table S2). The nucleotide mutations of each haplotype and the frequencies of each haplogroup are listed in Table S2. A Neighbour Joining Tree (Figure 5) was constructed using haplotypes from the Keriyan samples. The Keriyan population consists both of Europe-specific lineages (HV, H2a2, H6, J1, U3, U7a, and W) and Asian-specific-lineages (C5c, M7c, M8a2a1, M9 and M11a) as shown in Figure 5 and Table S2. The percentages of West Eurasian and East Eurasian specific lineages were equal, 50% and 50%, respectively. The most dominant haplogroup was HV with a frequency of 19.05%, followed by sub-Hg H2a2 and M11a both with a frequency of 14.29%. Except for C5c, all other East Asian specific haplotypes were represented more than twice in the Keriyan population, while the majority of European specific haplogroups (H6, J1, U3 and U7a) except H2a2, HV2 and W appeared only once in the population. We merged sub-haplogroups into a major haplogroup in order to compare with other reference populations. The frequencies of the final merged haplogroups were shown in Table 3.

thumbnail
Figure 5. A Neighbour-Joining tree for the Keriyans based on the control region sequences of mtDNA.

https://doi.org/10.1371/journal.pone.0100479.g005

thumbnail
Table 3. Frequencies of haplogroup in the Keriyan and referenced populations.

https://doi.org/10.1371/journal.pone.0100479.t003

To understand the genetic relationship of Keriyan population with other surrounding populations and to visualize the difference in the haplogroups among the populations, we performed a principal component analysis (PCA) using the basal frequencies of haplogroups of the Keriyan population in the study and the other 29 previously published populations as input vectors (Table 3). Figure 6 shows a PCA plot for the first two PCs, which account for 53.14%, and 14.96% of the total variance, respectively. In the PCA plot, the first two PCs revealed the following five distinguishing clusters: 1) The first one is comprised of the seven Tibetan groups and two Chinese Han groups at the right bottom corner of the plot; 2) a white population cluster including the Armenian and Georgia groups in the left of the plot; 3) a cluster comprising Xinjiang Eurasians, and the north China minority ethnic groups occupying the right upper corner; 4) a Russian cluster close to Siberia populations; and 5) the Keriyan and Daheyan populations. The Keriyan is placed very close to the Daheyan without any distance in the first component, but clearly separated from the other groups. The next closest to the Keriyan populations are the Uyghur, then Russian and Siberian populations, whereas the Keriyan is far away from the Tibetans.

thumbnail
Figure 6. Phylogenetic relationship between Keriyan and reference populations analyzed by principal component analysis with the frequencies of haplogroups.

Populations were coded as: MMg- Mongolians from Mongolia [3]; MWMg- Mongolians from Western Mongolia [5]; MEMg - Mongolians from east Inner Mongolia of China [4]; MIMg- Mongolians from central Inner Mongolia; MYMg - Mongolians from Yunnan Province of China [6]; YLMg- Mongolians from Yili; Uyg1 - Xinjiang Uyghurs from the Yao’s Study; Uyg2- Xinjiang Uyghurs from our unpublished data; Kaz - Kazaks; Uzb – Uzbeks [2]; Dau - Daurs; Ewe-Ewenkis; Oro- Oroqens; Arm- Armenians; Geo- Georgians from Cauasus [7]; TV- Tuvans; BR- Buryats from the south extreme of Siberia [8]; SH - South Han; NH- North Han [10]; DHY- Daheyan; Rus - Russian; Tib1- Nakchu; Tib 2- Tibetans from Shigats; Tib 3- Tibetans from Tibet; Tib 4- Tibetans from Qinghai; Tib5- Tibetans from Yunnan; Tib 6- Tibetans from Yunnan; Tib 7- Tibetans from Gansu.

https://doi.org/10.1371/journal.pone.0100479.g006

To understand the genetic structure of Keriyan, we performed an AMOVA analysis and calculated Fst. To visualize the extent of genetic diversity among these populations, we plotted pairwise Fst values estimated from HVS I sequence data through MDS analysis (data not shown). Not surprisingly, Keriyans clustered together with the Daheyan group, with which they showed a similar haplogroup composition. The next closest to the Keriyan is the Uyghurs. By contrast, the Keriyans were quite genetically distinct from Tibetan, Han populations.

In order to trace the origin of common Asian haplogroups of Keriyan, we constructed common haplogroups M7c, M8a, M9, and M11 networks using the Network 4.6.12 program as shown in Figures 7A–D, respectively. Overall, we observed that: 1) Each haplogroup displays as a star-like expansion cluster, indicating the ancestral haplotype expansion, 2) The Keriyan samples are not at the their respective central nodes of M7c, M8a, M9 and M11, implying these haplogroups do not exist in the Keriyan population for long time; and 3) Most of the Keriyan haplotypes are shared with Daheyan, except haplogroup M8a. Afterwards, we looked closer at haplotype differentiation within each haplogroup. M7c specifically found in East Asia [16], [30][32] has not been found in north eastern Asia [16], [33], [34] and is very rare in central Asians [35], [36], but was observed both in the Keriyan in this study and the Daheyan group reported by Zhou group [37] at the frequencies of 9.5% and 6.9, respectively. The Keriyans along with the Daheyans and one Mongolian are at the branch node with a step mutation at nucleotide position16254 away from the ancestral M7c characterized by 16223-6254-16295. The coalescence time of M7c in the Keriyan is ∼4037 years. In contrast to M7c, the Keriyan is at the very far end of the branch of the M8a network with multiple mutations away from the ancestral haplotype, but only one step away from a Uyghur with a mutation at position16189 or a Mongolian with a mutation at position 16129. M8a arose in the Keriyan an estimated 4036 years ago. It seems to have a very short history compared with the ancestral M8a which occurred in southern Asia 34437 years ago. M9 lineage which is prevalent with the greatest diversity in Tibetan, also exists in the Keriyan. The frequency of the M9 peaks in the Keriyans as compared with all other observed Asian specific haplogroups. In the Keriyans, there were two sub-haplogroups of M9 and M9a’b. The two Keriyans of sub-haplogroup M9 are at the same node with the two Daheyan samples and two Tibetans, while four Keriyans of M9a’b are shared by five Mongolian and two Tibetans with a back mutation. With regard to M11, all the six Keriyan of M11 and the four of Daheyan individuals are at the end of the same branch of M11. There are two step mutations at positions 16172 and 16051 differing to the ancestral root. The sub-haplogroup M11 seems to occur in the Keriyan and Daheyan groups very lately with an estimated coalescence time of 4324 years.

thumbnail
Figure 7. Reduced Median joining network of haplogroups M7c, M8a, M9 and M11 based on the mutation site of HVS I.

The size of the circles is proportional to the number of sequences. Populations are coded as Mg- Mongolians; Sib- Siberia; SEA- South East Asian populations; Uyg1 - Xinjiang Uygurs; Kaz- Kazaks; Kor- Korean; Viet- Vietenese; Uzb – Uzbeks [2]; Dau - Daurs; Ewe- Ewenkis; Oro- Oroqens; Han- Chinese Han; DHY- Daheyan; Rus - Russian; Tib- Tibetans. Mutated sites (minus16, 000) were indicated along the lines. (A). Haplogroup M7c; (B). Haplogroup M8a; (C). Haplogroup M9; (D). Haplogroup M11.

https://doi.org/10.1371/journal.pone.0100479.g007

Discussion

The Taklimakan Desert is located in the Tarim Basin which lies between the European and Asian Continents. Populations in this area have undergone substantial migrations and assimilations by other tribes during prehistoric and historic periods. The origin and migration history of populations in this area have drawn great attention. The archaeological and anthropological reports on the remains from the excavated tombs show that human beings inhabited the desert area in the Bronze age, approximately 4000 years BP and those found mummies display Eurasian features and belong to a West and East admixture population in genetics [38]. Data from extant people in the Daheyan village in the Taklimakan Desert has also verified this according to mtDNA sequence [37]. In this study, we for the first time concurrently analyzed paternal and maternal genetic variance in the Keriyans living in the Taklimakan Desert by genotyping 17 Y-STR loci and sequencing the complete control region of mtDNA.

The data from Y-STR in this study show that the Keriyans possess not only Asian (south and central Asia) specific NRY haplogroup H, T L, Q and N, but also European specific haplogroup R1a, J2a and I. European specific R1a has been observed in Central Asia and emerged from the Kurgan culture in Central Asia about 15,000 years ago and expanded there afterwards [39]. Higher frequency of R1a in Keriyan suggests that the Keriyan belong to a Central Asian population. In contrast, two dominant Y haplogroups D [40] and O [41] in Tibetans which are also Asian-specific were not observed in the Keriyans in this study, indicating that the modern Keriyans are not offspring of the Tibetans. It is also noted that the genetic distances between the Keriyans and the Uyghur from Xinjiang, or Siberia or Russians are much closer than the ones between the Keriyans and other populations including Chinese Han, Mongolian, and Tibetans. In addition to this, in the Network of NRY Haplotypes comprised of the selected 13 Y-STR loci from 14 populations, the majority of Keriyan individuals clustered together with the Uyghur, Russian and East Croatian, whereas a very small portion of the Keriyans were distributed among Mongolians. Haplotypes of Tibetans and Chinese Hans are separated from the Keriyans, indicating that the patrilineages of the Keriyan apparently do not have an affinity with Chinese Han and Tibetan. Taken together, we could conclude that the Keriyan is an admixture population of West and East Eurasian with West Eurasian prevalence in terms of patrilineages and male Tibetans are not a contributor of Y chromosome to the Keriyan population.

In addition to patrilineages, we also confirmed from their matrilineages that the Keriyan is a West-East Eurasian admixed population by analysis of the control regional sequences of the mtDNA. It is worthy of note that European maternal lineage makes relatively less contribution to the Keriyan than their patrilineages (50% vs. 82%), indicating a sex-bias in forming the Keriyan gene pool. In order to track the origin of matrilineages of the Keriyan, we compared the maternal genetic variance among the surrounding populations. As demonstrated by haplogroup patterns, Keriyan groups including our study here and another previously reported by Zhou [37] are closer to the Uyghur, Russian and Siberia populations than to Chinese Han, or Tibetans or Mongolians in genetics, which is in line with the patrilineage results.

East Asian specific mtDNA haplogroup M7c, M9 and M11 and North Asian haplogroup M8a frequently exist in the Keriyans with one or two step mutation from the ancestral roots. The above subgroup of haplogroup M arose in the Keriyan recently ∼3000–4000 years, much earlier than the Guge Kingdom collapse in 1630. This relative short coalescence time is consistent with the results from archeological and genetic studies on remains excavated from ancient tombs in the Tarim Basin. Except haplogroup M9a’b, the Keriyan does not share any sequences in the HV I segment with the Tibetans. Nevertheless, only two Tibetans shared the same sequences with the Keriyans in each of the two sub-haplotypes, while the majority of haplotypes of M9 in the Tibetans are not shared with the Keriyan or Daheyan populations. This further confirms that it is impossible that modern Keriyans are the descendants of ancient Guge Tibetans.

Moreover, we also found that low gene diversities on mtDNA and Y-STR loci in the Keriyan population reflect the isolation of the Keriyan population from other populations and less interaction with other populations. We have to admit that further analysis on NRY SNP will help address the coalescence time of Y chromosomal haplogroups since Y-STR only reflect recent mutations, not like SNP for ancient mutations.

The genetic distances between Keriyan and Uyghur both in Maternal and paternal lineages are nearly the shortest compared with the ones between the Keriyan and the other populations. Three Keriyans and one Uyghur from Xinjiang share the same haplotype comprised 17 Y-STR loci, indicating gene inflow from the adjacent Uyghur. Additionally, Keriyans share the same appearance, costume, language, written script, religion, and custom with the Uygurs in Yutian and Yutian is only 250 km away from the Keriyans. Taken together, we can’t rule out the hypothesis that the modern Keriyan is an off-spring of Uyghurs.

In conclusion, the Keriyan is an admixture population of Western and East Eurasian. There are a sex-bias on gene pool formation and human being migration of Keriyan. The Keriyans are not the descendants of the Tibetan tribe, but may originate from local Uyghur populations.

Supporting Information

Table S1.

Pairwise Rst value calculated by AMOVA analysis offered by the YRDH website.

https://doi.org/10.1371/journal.pone.0100479.s001

(XLSX)

Table S2.

Mutations and frequencies of sub-haplogroups observed in Keriyan.

https://doi.org/10.1371/journal.pone.0100479.s002

(XLS)

Acknowledgments

We thank all participants in this study. We also thank Dr. Wei-Hua Wang for his assistance in data analysis, Dr. Xianhua Jiang and Dr. Erdenechimeg Tsagaan for providing Y-STR data of Han and Mongolian populations from Liaoning, China, and of Mongolians in Mongolia, respectively.

Author Contributions

Conceived and designed the experiments: XZ. Performed the experiments: KC AA WW. Analyzed the data: WQ WS FL XZ. Contributed reagents/materials/analysis tools: TK HZ FZ ZM XZ. Wrote the paper: FL XZ.

References

  1. 1. Comas D, Calafell F, Mateu E, Perez-Lezaun A, Bosch E, et al. (1998) Trading genes along the silk road: mtDNA sequences and the origin of central Asian populations. Am J Hum Genet 63: 1824–1838.
  2. 2. Yao YG, Kong QP, Wang CY, Zhu CL, Zhang YP (2004) Different matrilineal contributions to genetic structure of ethnic groups in the silk road region in china. Mol Biol Evol 21: 2265–2280.
  3. 3. Kolman CJ, Sambuughin N, Bermingham E (1996) Mitochondrial DNA analysis of Mongolian populations and implications for the origin of New World founders. Genetics 142: 1321–1334.
  4. 4. Kong QP, Yao YG, Liu M, Shen SP, Chen C, et al. (2003) Mitochondrial DNA sequence polymorphisms of five ethnic populations from northern China. Hum Genet 113: 391–405.
  5. 5. Gokcumen O, Dulik MC, Pai AA, Zhadanov SI, Rubinstein S, et al. (2008) Genetic variation in the enigmatic Altaian Kazakhs of South-Central Russia: insights into Turkic population history. Am J Phys Anthropol 136: 278–293.
  6. 6. Cheng B, Tang W, He L, Dong Y, Lu J, et al. (2008) Genetic imprint of the Mongol: signal from phylogeographic analysis of mitochondrial DNA. J Hum Genet 53: 905–913.
  7. 7. Schonberg A, Theunert C, Li M, Stoneking M, Nasidze I (2011) High-throughput sequencing of complete human mtDNA genomes from the Caucasus and West Asia: high diversity and demographic inferences. Eur J Hum Genet 19: 988–994.
  8. 8. Starikovskaya EB, Sukernik RI, Derbeneva OA, Volodko NV, Ruiz-Pesini E, et al. (2005) Mitochondrial DNA diversity in indigenous populations of the southern extent of Siberia, and the origins of Native American haplogroups. Annals of human genetics 69: 67–89.
  9. 9. Yao YG, Kong QP, Man XY, Bandelt HJ, Zhang YP (2003) Reconstructing the evolutionary history of China: a caveat about inferences drawn from ancient DNA. Mol Biol Evol 20: 214–219.
  10. 10. Wen B, Li H, Lu D, Song X, Zhang F, et al. (2004) Genetic evidence supports demic diffusion of Han culture. Nature 431: 302–305.
  11. 11. Li H, Cai X, Winograd-Cort ER, Wen B, Cheng X, et al. (2007) Mitochondrial DNA diversity and population differentiation in southern East Asia. Am J Phys Anthropol 134: 481–488.
  12. 12. Yao YG, Zhang YP (2002) Phylogeographic analysis of mtDNA variation in four ethnic populations from Yunnan Province: new data and a reappraisal. J Hum Genet 47: 311–318.
  13. 13. Zhao M, Kong QP, Wang HW, Peng MS, Xie XD, et al. (2009) Mitochondrial genome evidence reveals successful Late Paleolithic settlement on the Tibetan Plateau. Proc Natl Acad Sci U S A 106: 21230–21235.
  14. 14. Jin HJ, Kwak KD, Hong SB, Shin DJ, Han MS, et al. (2006) Forensic genetic analysis of mitochondrial DNA hypervariable region I/II sequences: an expanded Korean population database. Forensic Sci Int 158: 125–130.
  15. 15. Maruyama S, Minaguchi K, Saitou N (2003) Sequence polymorphisms of the mitochondrial DNA control region and phylogenetic analysis of mtDNA lineages in the Japanese population. Int J Legal Med 117: 218–225.
  16. 16. Derenko M, Malyarchuk B, Grzybowski T, Denisova G, Dambueva I, et al. (2007) Phylogeographic analysis of mitochondrial DNA in northern Asian populations. Am J Hum Genet 81: 1025–1041.
  17. 17. Derenko MV, Grzybowski T, Malyarchuk BA, Dambueva IK, Denisova GA, et al. (2003) Diversity of mitochondrial DNA lineages in South Siberia. Ann Hum Genet 67: 391–411.
  18. 18. Vigilant L, Pennington R, Harpending H, Kocher TD, Wilson AC (1989) Mitochondrial DNA sequences in single hairs from a southern African population. Proc Natl Acad Sci U S A 86: 9350–9354.
  19. 19. Nohira C, Maruyama S, Minaguchi K (2010) Phylogenetic classification of Japanese mtDNA assisted by complete mitochondrial DNA sequences. Int J Legal Med 124: 7–12.
  20. 20. Tamura K, Dudley J, Nei M, Kumar S (2007) MEGA4: Molecular Evolutionary Genetics Analysis (MEGA) software version 4.0. Mol Biol Evol 24: 1596–1599.
  21. 21. Andrews RM, Kubacka I, Chinnery PF, Lightowlers RN, Turnbull DM, et al. (1999) Reanalysis and revision of the Cambridge reference sequence for human mitochondrial DNA. Nat Genet 23: 147.
  22. 22. Yao YG, Kong QP, Bandelt HJ, Kivisild T, Zhang YP (2002) Phylogeographic differentiation of mitochondrial DNA in Han Chinese. Am J Hum Genet 70: 635–651.
  23. 23. Bandelt HJ, Forster P, Rohl A (1999) Median-joining networks for inferring intraspecific phylogenies. Mol Biol Evol 16: 37–48.
  24. 24. Bandelt HJ, Forster P, Sykes BC, Richards MB (1995) Mitochondrial portraits of human populations using median networks. Genetics 141: 743–753.
  25. 25. Roostalu U, Kutuev I, Loogvali EL, Metspalu E, Tambets K, et al. (2007) Origin and expansion of haplogroup H, the dominant human mitochondrial DNA lineage in West Eurasia: the Near Eastern and Caucasian perspective. Mol Biol Evol 24: 436–448.
  26. 26. Forster P, Harding R, Torroni A, Bandelt HJ (1996) Origin and evolution of Native American mtDNA variation: a reappraisal. Am J Hum Genet 59: 935–945.
  27. 27. Soares P, Ermini L, Thomson N, Mormina M, Rito T, et al. (2009) Correcting for purifying selection: an improved human mitochondrial molecular clock. Am J Hum Genet 84: 740–759.
  28. 28. Saillard J, Forster P, Lynnerup N, Bandelt HJ, Norby S (2000) mtDNA variation among Greenland Eskimos: the edge of the Beringian expansion. Am J Hum Genet 67: 718–726.
  29. 29. Shan W, Ablimit A, Zhou W, Zhang F, Ma Z, et al. (2014) Genetic polymorphism of 17 Y chromosomal STRs in Kazakh and Uighur populations from Xinjiang, China. Int J Legal Med.
  30. 30. Kivisild T, Tolk HV, Parik J, Wang Y, Papiha SS, et al. (2002) The emerging limbs and twigs of the East Asian mtDNA tree. Mol Biol Evol 19: 1737–1751.
  31. 31. Hill C, Soares P, Mormina M, Macaulay V, Clarke D, et al. (2007) A mitochondrial stratigraphy for island southeast Asia. Am J Hum Genet 80: 29–43.
  32. 32. Peng MS, He JD, Liu HX, Zhang YP (2011) Tracing the legacy of the early Hainan Islanders–a perspective from mitochondrial DNA. BMC Evol Biol 11: 46.
  33. 33. Torroni A, Sukernik RI, Schurr TG, Starikorskaya YB, Cabell MF, et al. (1993) mtDNA variation of aboriginal Siberians reveals distinct genetic affinities with Native Americans. Am J Hum Genet 53: 591–608.
  34. 34. Derenko MV, Shields GF (1997) [Diversity of mitochondrial DNA nucleotide sequences in three groups of aboriginal inhabitants of Northern Asia]. Mol Biol (Mosk) 31: 784–789.
  35. 35. Metspalu M, Kivisild T, Metspalu E, Parik J, Hudjashov G, et al. (2004) Most of the extant mtDNA boundaries in south and southwest Asia were likely shaped during the initial settlement of Eurasia by anatomically modern humans. BMC Genet 5: 26.
  36. 36. Comas D, Plaza S, Wells RS, Yuldaseva N, Lao O, et al. (2004) Admixture, migrations, and dispersals in Central Asia: evidence from maternal DNA lineages. Eur J Hum Genet 12: 495–504.
  37. 37. Cui Y, Li C, Gao S, Xie C, Zhou H (2010) Early Eurasian migration traces in the Tarim Basin revealed by mtDNA polymorphisms. Am J Phys Anthropol 142: 558–564.
  38. 38. Li C, Li H, Cui Y, Xie C, Cai D, et al. (2010) Evidence that a West-East admixed population lived in the Tarim Basin as early as the early Bronze Age. BMC Biol 8: 15.
  39. 39. Underhill PA, Myres NM, Rootsi S, Metspalu M, Zhivotovsky LA, et al. (2010) Separating the post-Glacial coancestry of European and Asian Y chromosomes within haplogroup R1a. Eur J Hum Genet 18: 479–484.
  40. 40. Gayden T, Cadenas AM, Regueiro M, Singh NB, Zhivotovsky LA, et al. (2007) The Himalayas as a directional barrier to gene flow. Am J Hum Genet 80: 884–894.
  41. 41. Yan S, Wang CC, Li H, Li SL, Jin L (2011) An updated tree of Y-chromosome Haplogroup O and revised phylogenetic positions of mutations P164 and PK4. Eur J Hum Genet 19: 1013–1015.