- Research article
- Open Access
Dispersal route of the Asian house rat (Rattus tanezumi) on mainland China: insights from microsatellite and mitochondrial DNA
BMC Genetics volume 20, Article number: 11 (2019)
Rattus tanezumi is a common commensal rat and an important host animal of bubonic plague in South China and Southeast Asia. The northward dispersal of this species in mainland China has been reported in recent decades, along with more recent intercontinental expansion. Population genetics of R. tanezumi in mainland China were studied to explain the relationship between dispersal history and the ancient and modern transportation networks of China.
In total, 502 individuals belonging to 18 populations were collected from 13 provincial areas. Nine microsatellite loci and two mtDNA sequences were analyzed. The results indicate that R. tanezumi populations from Yunnan have highest genetic diversity and populations from Tibet with lowest genetic diversity. 18 populations can be divided into four clusters, the first cluster including populations from southwest Yunnan, the second including two populations of Tibet, the third for populations from middle and east of mainland China, and the forth for two populations from north Yunnan. Both microsatellite and mtDNA data reveal that the populations from coastal areas are closely related to populations from Yunnan, whereas populations from Tibet are closely related with populations from Sichuan.
The results suggest that early dispersal of R. tanezumi in mainland China depended on shipping transportation, with subsequent expansion from coastal areas into Central China occurring along the Yangzi River. Further, the linkages between populations in Tibet and Sichuan point to a modern era introduction via the Chuan-Zang highway, rather than along the Tea Horse Ancient Road.
The Asian house rat, Rattus tanezumi (Temminck, 1844) (Rodentia: Muridae), is a common commensal rat in East and Southeast Asia. From South China to Southeast Asia, R. tanezumi can be found in indoor and outdoor habitats. In outdoor habitats, it is the dominant pest rodent in rice fields, causing crop damage before harvest [1, 2]. After the harvest season, R. tanezumi will migrate to residential areas and even indoor areas for food , causing household damage . Because of its wide range of habitats and these seasonal migrations, R. tanezumi can carry a number of different pathogens including zoonotic agents of public health concern. Thus, R. tanezumi is regarded not only as a pest related to agriculture but also as a host of several zoonoses, including plague (Yersinia pestis), haemorrhagic fever with renal syndrome (Hantaan virus), and leptospirosis (Leptospira) [5,6,7].
The range expansion of R. tanezumi was not noted until recently after Huang et al. reported the northern border of the distribution area of R. tanezumi in China as the Yellow River . From the 1990s on, R. tanezumi was found in provinces north of the Yellow River, including Hebei  and Shanxi , and in 2012 Ma et al. reported the first record of R. tanezumi northeast of Qinghai, a northwestern province of China . The colonization of R. tanezumi in north China was easily recognized because this species can be distinguished from another commensal rat, R. norvegicus, with morphological characters. However, the expansion of R. tanezumi out of Asia was not recognized until genetic methods were used in congeneric species identification  because of the morphological similarities among members of the R. rattus species group. With molecular genetic methods, Bastos et al reported the presence of R. tanezumi in 13 locations in South Africa . Mitochondrial haplotypes of R. tanezumi were also found in California in the United States and in Madagascar . Based on these and additional data, Aplin et al inferred the prehistory expansion and the global distribution of R. tanezumi .
The extension of the geographic range of commensal rats is considered a result of human activity. The phylogeography of R. rattus in the western Indian Ocean is congruent with human colonization events in this area . The distribution of R. exulans is also a good example and has been used to imply the history of human colonization on Pacific islands [17, 18]. Rattus norvegicus was not reported at Xinjiang before the 1970s because of the Gobi Desert between Xinjiang and Central China. In the 1970s, colonies of R. norvegicus were first found in railway stations of Xinjiang, after the railway was built . From then on, R. norvegicus spread to other cities of Xinjiang with railway and highway traffic. Because the dispersion of R. tanezumi was reported only in recent decades, the dispersal routes of this species and how these relate to human activity is not known.
In ancient China, because of the complex landscape of Southwest China, commercial transportation in this area (including Sichuan, Yunnan, Shaanxi and Tibet) usually depended on caravans. Starting approximately 1200 years ago (Tang Dynasty), a famous caravan road system for tea, salt and horse trading developed in Southwest China, which is now called the Tea Horse Ancient Road. This road system linked Sichuan, Yunnan and Tibet (Fig. 1a) and became a unique route for trade between Tibet and Central China . Another important transportation system in ancient China was shipping via sea, river and canal. Shipping transportation has been used on the Yangzi River and Pearl River for more than two thousand years. During the twentieth century, a modern highway system was gradually constructed in Southwest China, and modern transportation based on the automobile and highway system became a primary method of commodity trade. An important highway linking Tibet and Central China is the Chuan-Zang Highway, which starts from Chengdu, the capital city of Sichuan, and ends at Lhasa in Tibet (Fig. 1a). Whether and how these transportation systems affected the migration of R. tanezumi on mainland China, especially from Yunnan to Tibet, could be explained by population genetics analysis of rats.
In this study, we collected samples of R. tanezumi from mainland China and examined the genetic structure of different populations with microsatellite and mitochondrial markers. Based on the results of these analyses, we aimed to explain the expansion of R. tanezumi through mainland China, the possible expansion routes of R. tanezumi through mainland China, including the Tea Horse Ancient Road for caravans, shipping transportation along the rivers and coastline of China, and transportation along modern highway and railway systems.
Sampled populations and DNA extraction
Samples belonging to 18 populations were collected at 22 locations (N = 10–39 rats per location) in 13 provinces covering almost the entire range of R. tanezumi in China (Fig. 1a and Table 1). Samples of four populations (SHH, CHQ, SIC and QNJ) consisted of specimens pooled from two sites in close proximity. Rats were captured with living traps in rice fields and farms after getting verbal consents from owners. Individuals of R. tanezumi were selected and carried to the laboratory. Compressed carbon dioxide in gas cylinder was used to euthanize the rats. For the safety the flow rate of CO2 was controlled at 9 l per minute. Rats were euthanized in plastic bag with CO2 for ten minutes, and the death of them was confirmed with checking the heartbeat. Muscle or liver tissue was collected and stored in 95% ethanol at − 80 °C until DNA extraction according to the handbook of the DNeasy Blood & Tissue kit (Qiagen, GER).
All samples were genotyped for 9 microsatellite loci developed for this research (Additional file 1: Table S1). DNA was amplified in a thermocycler (SensoQuest, GER) in 25-μL reaction mixes containing 1 μL of total DNA (about 30 μg), 2.5 μL of 10× LA PCR Buffer II (Mg2+ Plus) (Takara, JPN), 2.5 mM of each dNTP (Takara, JPN), 10 pmol of each primer, and 0.75 U of LA Taq polymerase (Takara, JPN). The 5′ end of the forward primer was labelled with a fluorescent dye (FAM, HEX, and ROX). Cycling conditions were as follows: 5 min at 95 °C, followed by 40 cycles of 30 s at 95 °C, 30 s at the annealing temperature (TA), and 30 s at 72 °C, and then 5 min at 72 °C. The length of the DNA fragments was analyzed manually in GeneMapper v. 4.0 (Applied Biosystems).
The genetic diversity of every locus was characterized by estimating number of different alleles (Na), number of effective alleles (Ne), Shannon’s information index (I), number of private and locally common alleles, allele richness (Rs), observed heterozygosity (Ho), unbiased expected heterozygosity (He) and positive inbreeding coefficient (FIS) using Fstat V184.108.40.206 (https://www2.unil.ch/popgen/softwares/fstat.htm) and GenAlEx 6.41 . Student’s t-Test was used to determine whether two sets of data were significantly different from each other in SPSS 16.0 . Fstat was also used to test for conformance of the genotype to Hardy-Weinberg equilibrium (HWE) for each locus and each population and to explore linkage disequilibrium (LD) between pairs of loci. For all nine loci, Microchecker V2.2.3  was used to detect the presence of null alleles and genotyping errors such as large allele dropout or stuttering using 1000 randomizations. The null-allele-adjusted dataset was compared to the original dataset to explore the effect of null alleles on estimate of genetics differentiation.
Isolation by distance (IBD) was estimated with Mantel’s test in Web Service of IBD , using the correlation between genetic and geographic distances by the regression of pairwise FST/(1-FST) on the natural logarithm (Ln) of straight-line geographic distance. Two approaches were used for testing genetic bottlenecks at the microsatellite loci for each population, we calculated M, a ratio based on the number of alleles to the allelic size range  in Arlequin3.5 , and heterozygosity excess tests implemented in Bottleneck V1.2.02  to assess deviation from Mutation-Drift Equilibrium. The tests were performed using the Infinite Allele Model (IAM) , Stepwise Mutation Model (SMM) , and Two-Phase Model (TPM) , which were run twice for each population, assuming that the percentage of stepwise mutations was 80%. Statistical significance of the tests were assessed for each population across all loci by the Wilcoxon signed-rank test available in Bottleneck .
To identify the population genetic structure of R. tanezumi, two Bayesian model-based cluster analyses were contrasted: Structure  and Geneland . Structure 2.3.4  was first used to infer the number of clusters (Κ) in the dataset without prior information of the sampling locations. A model in which allele frequencies correlated within populations was assumed (λ was set at 1, the default value). The software was run with the option of admixture, allowing for some mixed ancestry within individuals, and α was allowed to vary. We used 20 independent runs for each value of Κ (K = 1 to 10), with a burn-in period of 50,000 iterations and 250,000 replications. The method of Evanno et al.  was used to determine the most likely number of clusters. This approach uses an ad hoc quantity, ΔK, based on the second-order rate of change of the likelihood function between successive values of K. The results of 20 replicate runs for each value of K were combined using the Greedy algorithm of Clumpp 1.1.1 , and summary outputs for each value of K were displayed graphically using Distruct v1.1 . Differentiation among clusters was estimated with Wright’s F-statistics  computed in Fstat according to Weir and Cockerham . Pairwise FST for statistical significance was also tested in Fstat to evaluate genetic differentiation between populations. The number of putative migrants (Nm) per generation between populations was estimated from pairwise FST, Nm ≈ (1-FST)/4FST . Critical significance levels for multiple testing were corrected using Bonferroni correction .
The second method relied on the spatial cluster model implemented in the Geneland package [33, 41]. Following the user’s manual recommendations, the Markov chain Monte Carlo (MCMC) repetitions were set to 100,000, the thinning was set to 100 and the burn-in period was set to 100 (we eliminated the first 100 iterations when the curve was not constant); the number of groups (K) to be tested was set from 1 ± 7. Each individual was assigned to one of K populations (1 ≤ K ≤ 7) based on its multilocus genotypes and spatial coordinates. To confirm that the run was long enough, we obtained 10 different runs and compared the parameter estimates (K, individual population membership, maps). The best result was chosen based on the highest average posterior density.
The Neighbour-Joining (NJ) tree was computed with Powermarker 3.25 , based on microsatellite Nei’s genetic distance . The robustness of population trees was evaluated by bootstrapping over loci. One thousand bootstrapped trees were summarized to obtain a consensus tree, using the Consens module in the Phylip package , followed by editing with Treeview .
Mitochondrial DNA analysis
In total 502 individuals were used to explore sequence polymorphism in the mitochondrial COI gene and D-loop region. The mitochondrial COI gene was used not only in population genetics analysis bust also as DNA barcoding at first in specimens identification. DNA samples were used as templates to amplify a 702-bp fragment of COI and a 546-bp fragment of D-loop according to Robins et al. . PCR products were sequenced directly in two directions with an ABI 3100 automatic sequencer (Perkin-Elmer, Waltham, Massachusetts) using the ABi PRISM BigDye Terminator Cycle Sequencing Ready Reaction Kit with AmpliTaq DNA polymerase (Applied Biosystems, Foster City, California).
Sequences were aligned using Muscle  within MEGA 6.0 . Basic sequence statistics, including the number of segregating sites (S), the number of haplotypes per sample (Hp), haplotype diversity (HpD), the average number of nucleotide differences (K) and nucleotide diversity (π) were computed with DnaSP v5 . Neutrality tests used the statistics from the Tajima test, Fu and Li test and Fu test in DnaSP v5.
The phylogenetic relationships of mitochondrial sequences and haplotypes were performed using Bayesian inference (BI), maximum likelihood (ML) and median-joining network (MJ) methods as implemented in Mrbayes 3.2.5 , Raxmlgui 1.5 , and Network 220.127.116.11 , respectively. For the mitochondrial COI sequence, all reported sequences of R. tanezumi in Genbank [52,53,54,55] (Additional file 2: Table S2) were downloaded and added to the dataset for analysis. COI sequences of Rattus rattus (KY754543, JQ668025), R. pyctoris(JF499340), R. argentiventer(HM217484), R. exulans (KC617851), R. andamanensis (KY605368), R. norvegicus (FQ211257, FQ228470), R. nitidus (KR996517, JQ918374) and Mus musculus (AB444046, KC617843) were added into dataset, and Cricetulus migratorius (KU182948, KU182947, KJ466858, JX962138, JX962139) was set as outgroup in the construction of ML and BI trees.
A total of 502 individual rats collected in this study from 18 populations were successfully genotyped at all 9 microsatellite loci. All loci were polymorphic, showing the number of alleles (Na) as 21 (Rt1), 26 (Rt2), 27 (Rt3), 33 (Rt4), 16 (Rt5), 19 (Rt6), 26 (Rt7), 30 (Rt8) and 24 (Rt9) (Additional file 3: Table S3). The average number of alleles per locus ranged from 9.837 (Rt1) to 14.682 (Rt8). The minimum mean number of alleles of all loci was recorded for the NYC population from Tibet (6.56) and the maximum in the RUL population from Yunnan (14.67). For allele richness, the maximum Rs in RUL (12.353) was almost twice the minimum Rs in the LHS population from Tibet (6.14). The observed heterozygosity (Ho) for each locus ranged from 0.28 (Rt4) to 1.00 (Rt9); the minimum Ho was in LHS (0.58) and the maximum Ho in CHQ (0.82) (Table 2). The expected heterozygosity (He) for each locus ranged from 0.37 (Rt1) to 0.94 (Rt4); the minimum He was in LHS (0.67) and the maximum He in RUL (0.91) (Table 2). The same situation also occurred in Ne and I: the minimum value was found in LHS of Tibet and the maximum value was in RUL of Yunnan. Private alleles were mainly detected in the LIH, QNJ, BAS and RUL populations and were not found in SJZ, XIS, SHH, CHQ, SIC, NYC, LHS and XIY populations. Student’s t-Test showed that the mean values of He and Rs for Yunnan’s and other region’s populations had statistically significant differences (He: P = 0.022; Rs: P = 0.005). In general, populations in Yunnan showed a higher genetic diversity than other regions, while the genetic variability of populations in the Tibetan areas and SJZ was low.
Some specimens failed to amplify at one locus while succeeded at the remaining loci, suggesting the presence of null alleles. Null alleles were present in only a few populations for three loci, there were one individual at locus Rt6, two individuals at locus Rt5 and four individuals at locus Rt7. To determine whether the null alleles impacted the population genetic analyses, we performed these analyses both before and after the dataset were adjusted for estimated null allele frequencies. The effect of this treatment was minimal and did not significantly change the degree or statistical significance of the estimated parameters.
Hardy-Weinberg equilibrium and linkage disequilibrium
The Hardy-Weinberg exact tests were performed at nine loci. No locus was in HWE for all of samples assayed. However, at the population level, 28 out of 162 (17.28%) comparisons did not conform to Hardy-Weinberg expectations after sequential Bonferroni correction, and the deviations were associated with a positive inbreeding coefficient (FIS), reflecting heterozygosity deficits. The populations in Yunnan (Southwest China) had the highest number of loci in departure from HWE (21 out of 28), LOY sample had 3 loci in departure from HWE, WZS sample had 2 loci in departure from HWE, SHH and LEZ populations had 1 locus in departure from HWE respectively (Table 3).
Fisher’s exact tests were conducted for linkage disequilibrium within all populations. Only 4 pairs out of 648 comparisons (0.62%) were at linkage disequilibrium after sequential Bonferroni correction, and all three such pairs were found in SHH and LOY populations (Rt1/Rt3, Rt8/Rt9, Rt1/Rt8). No pair of loci appeared in LD in more than one population, suggesting genetic independence between loci.
Tests for genetic bottlenecks yielded mixed results (Table 2). The M ratio was low and indicated historical bottleneck in all populations. Significant deviations from mutation-drift equilibrium conditions under the IAM model were found in all populations except the SJZ, CHS, and LHS populations. In contrast, the test for each population gave non-significant results under the SMM model. According to the TPM model, the RUL population showed heterozygote excess in both percentages of stepwise mutations, and the results for the LOY and QUZ populations suggested a heterozygosity excess only in 80% of stepwise mutations, which indicated that bottlenecks would have occurred recently under pressure of rodent control in these populations.
Genetic differentiation and gene flow
The significant deviations from HWE with heterozygote deficiency and the presence of linkage disequilibrium suggest the presence of population subdivision within the samples (the Wahlund effect). Bayesian analysis divided all individuals into three clusters (posterior probability of Bayesian clustering Ln (D) likelihood score optical for K = 3). Most individuals of LIH, XGL, QNJ, BAS and RUL populations belonged to Cluster I; almost all individuals of WZS, QUZ, SHH, CHQ, XIS, CHS, SJZ, XIY, and SIC populations were of Cluster II; and individuals of NYC and LHS were members of Cluster III. While populations of LOY, LEZ were composed with individuals of clusters I and II (Table 3 and Fig. 2a), which indicated the coexistence of two gene pools in these two populations.
Moderate levels of genetic differentiation were found between populations. The overall FST across all loci was 0.064 and was highly significant (P < 0.001). We then tested the genetic heterogeneity among populations within and between the clusters. We chose populations in which more than 70% of individuals were assigned to cluster I or II for analysis. Cluster I included 5 populations (LIH, XGL, QNJ, BAS and RUL), while cluster II included 9 populations (QUZ, SHH, CHQ, XIS, CHS, SJZ, and XIY). Within each cluster, pairwise FST was moderate: 0.020 (CHQ-CHS) to 0.077 SHH-SIC) in cluster I and 0.022 (LIH-RUL) to 0.048 (LIH-XGL) in cluster III. Among the clusters, a higher level of differentiation was demonstrated, with FST from 0.051 (QUZ-QNJ) to 0.168 (LIH-LHS). Nm estimates among populations were higher (2.980 to 12.005) within clusters and lower (1.24 to 4.49) among clusters (Additional file 4: Table S4).
Analysis using Geneland yielded a modal number of populations between 2 and 7, with a higher proportion of K = 4 (posterior probability = 0.52). The run with the highest average posterior density was selected (− 14,771.26). Cluster 1 included three populations from Southwest of Yunnan (RUL, BAS and LIH), and cluster 4 included two populations from North of Yunnan (XGL and QNL). Cluster 2 was constituted by populations from Tibet (LHS and NYC). And cluster 3 was composed of other populations from south, east and middle of China (Fig. 2b).
Tests of isolation by distance were performed for each population cluster and for all the populations together. For all populations, weak correlations were detected between genetic and geographic distances (R2 = 0.111, P = 0.025), while for cluster I and II, the correlations were stronger (R2 = 0.527, P < 0.01). The correlations for cluster III were not significant (R2 = 0.209, P = 0.17). These results suggest that geographic distance weakly contributes to the genetic differentiation in R. tanezumi populations.
The unrooted NJ dendrogram based on Nei’s distance depicted the structure of population differentiation (Fig. 1b). It demonstrated that the populations of Yunnan clustered first and then showed a close relationship with populations from coastal areas (LOY of Zhejiang, LEZ of Guangdong, WZS of Hainan, and QUZ of Fujian). At the other end of the tree were populations from Central China and Tibet (LHS and NYC).
DNA sequencing analysis
MtDNA polymorphism and divergence
Nucleotide sequences of mtDNA COI (702 bp) and D-loop (546 bp) were obtained for 502 R. tanezumi specimens from 18 populations. Overall, the polymorphism in the COI region was moderate, with 37 segregating sites defining 38 haplotypes. The nucleotide diversity indices of the XGL population from Yunnan were the highest (HpD = 0.842, K = 3.458, π = 0.00493), while the lowest indices occurred in the WZS population from Hainan (HpD = 0.061, K = 0.061, π = 0.00009). The SHH and LHS populations from Shanghai and Tibet also showed lower genetic diversity. The most frequent haplotype, H1 (48.1%), was detected in all geographic populations except LIH and RUL. For the D-loop region, the polymorphism was much higher than in COI. There were 60 segregating sites defining 70 haplotypes. The highest nucleotide diversity indices were similarly among the QNJ population in Yunnan (HpD = 0.905, K = 11.315, π = 0.02076), while the lowest indices occurred in the XIS population from Hubei (HpD = 0.148, K = 0.151, π = 0.00028) (Table 4). Populations in Tibet (NYC and LHS) clearly manifested low genetic polymorphism compared with other regions. In common with the COI results, the most frequent D-loop region haplotype, H1 (44.5%), was detected in all geographic populations but was absent from LIH and RUL, as well as BAS. Tajima’s D, and Fu & Li’s statistics were negative for both genes (D = − 1.724, D* = − 1.722 and F* = − 2.088 for COI; D = − 0.288, D* = − 0.334 and F* = − 0.376 for D-loop) but not statistically significant, while Fu’s Fs statistic was significantly negative (Fs = − 26.381, P < 0.001 for COI; Fs = − 22.993, P < 0.001 for D-loop).
The Median-Joining Networks show a significant geographical component to variation. For COI, the network shows three distinct haplogroups. Haplotypes from non-Yunnan regions form a star-like pattern centred at H1, and haplotypes from southwest and northeast regions of Yunnan form two star-like patterns centred around H10 and H17, respectively. Furthermore, H4 (only found in LEZ) and H24 (found in both QNJ and LOY) are most closely related to centre H17 (Fig. 3b, Additional file 5: Table S5 for details). The structure of the network for D-loop (Additional file 5: Table S5) is approximately equivalent to that for COI, but the network is more complicated.
For the mitochondrial COI data, the BI analysis indicated that the whole haplotypes, including haplotypes of reported COI sequences, were separated into three middle supported clades (Fig. 3a). These clades were named after the three centre haplotypes of the networks (Fig. 3b). The H1 and H10 clades were more closely related than to the h17 clade (Fig. 3a). These reported sequences from Southeast Asia and Africa did not share haplotypes with sequences of present research.
Population genetics of R. tanezumi
In Yunnan R. tanezumi is controlled almost every year in either urban or rural areas because it is host of Yersinia pestis, the pathogen of bubonic plague. As a result, the high departure from HWE were observed in all populations from Yunnan, which indicated a population subdivision (Table 3). The results of genetic bottlenecks test (M ratio test and IAM modle) also suggested that bottleneck occurred in most of populations after control with high pressure.
Rattus tanezumi population was first found at Shijiazhuang of Hebei at 2004 . The genetic diversity of the population in present research showed that SJZ population and Tibet populations had lower genetic variability (Rs and He in Table 2). Because of the short colonization history of SJZ population, the low genetic diversity could be attributed to the founder effect. The low genetic diversity of populations from Tibet (LHS and NYC) could only be explained as a result of the founder effect as there were no rodent control measures in Tibet. The recent demographic expansion of these populations was also supported with the results of Tajima’s D, and Fu & Li’s statistics.
The origin area of R. tanezumi
Two different analysis methods based on microsatellites data showed different results. According to the result with Structure, all samples of R. tanezumi could be divided into three clusters, and populations collected from Yunnan belonged to cluster III. While the results of Geneland divided populations of Yunnan into two groups, the first including three populations from Southwest Yunnan and the second including two populations from North Yunnan, which was concurrent with the results of haplotype network based on mitochondrial COI sequence. The haplotype network of COI shows two haplotype groups from Yunnan, one from Southwest and another from North Yunnan (Fig. 3b, Additional file 5: Table S5 for detail). The haplotype group from North Yunnan were scarce in other populations.
Populations of R. tanezumi from Yunnan showed distinctive characteristics in this study: first, they had many alleles in microsatellite analysis (average Na for five populations was 12.78, Table 2); second, they had a high number of haplotypes in mtDNA analysis (Table 4). These characters mean that the populations of R. tanezumi from Yunnan have high genetic diversity compared with populations from other areas of China.
Yunnan is located at the border area of Southwest China and Southeast Asia. These areas are special regions with various habitats both for aquatic and terrestrial organisms [56,57,58,59] because of the intricate geological and climatic history and were therefore regarded as two hotspots of high biodiversity and endemism . In these areas, the young uplifting plateau (Tibetan Plateau), the old eastern escarpment and the southeast slope (Hengduan Mountains) yield highly heterogeneous habitats not only to ‘living fossil’ species, such as the Laotian rock rat Laonastes aenigmamus  but also to some new speciated species, for example Apodemus ilex . Besides the species level diversity, high intraspecific genetic diversity of some species in these areas was also frequently reported [63, 64] because these areas were refuges for different species during glacial periods. According to Musser and Carleton , R. tanezumi is indigenous to the north of the India subcontinent, South and Central China, and mainland Indochina in which Yunnan is the central area. Therefore, according to the genetic diversity of Yunnan populations of R. tanezumi, the location and the biodiversity background of Yunnan, it could be assumed that Yunnan and the adjacent area of Southeast Asia could be regarded as the original centre of R. tanezumi.
The expansion route of R. tanezumi into mainland China
Although Yunnan is adjacent to Guizhou, Sichuan and Tibet, the rat populations from these provincial areas do not show a close relationship (Additional file 4: Table S4). The microsatellite analysis shows that there is no close relationship between populations of Yunnan, and those from Guizhou, Sichuan and Tibet (Fig. 1b and Fig. 2). The haplotype analysis of mtDNA sequences shows that the populations of Yunnan have a high diversity of haplotypes (H10 and H17 groups in Fig. 3), while populations from these three provincial areas share only a few haplotypes (H1 group). Populations from Tibet show only two haplotypes of COI and lack any particular commonality with Yunnan haplotypes (Fig. 3b), although the geographic distance between Tibet and Yunnan is very short (Fig. 1a). This result suggests that R. tanezumi in Yunnan did not spread to Tibet naturally or with the help of caravan transportation.
The unrooted NJ dendrogram based on genetic distance demonstrated a visualized relationship of populations from different areas of China (Fig. 1b). Based on the relationship, the early expansion of R. tanezumi in China could be inferred as having two steps: the first step as spreading through coastal areas via ocean-based shipping transportation, and the second as expanding from the coastal areas into Central China via shipping along the Yangzi River. Aplin et a.l reported that R. tanezumi expanded in West Pacific with the diaspora of Austronesian-speakers about 4000 years ago , which supported the possibility of expanding with ocean shipping transportation in that period. Because the populations of Tibet are closely related to the populations of Sichuan, the sources of the Tibetan populations are presumed to be Sichuan instead of Yunnan, and the route of expansion was likely the modern Chuan-Zang Highway instead of the Tea Horse Ancient Road.
Data of allelic patterns across populations showed in detail that three populations (SJZ, LHS and NYC) have a low number of alleles and effective alleles (Na and Ne) compared with other populations, with the two populations from Tibet (NYC and LHS) having the lowest values (Table 2). These data supported that the populations from Tibet are newly imported and, as a result, suffer from the founder effect and lose genetic diversity during importation and colonization. The population from Hebei (SJZ) also has few alleles and effective alleles, and was only higher than those of populations from Tibet (Table 2). These data reveal that the SJZ is also a newly colonized population with low genetic diversity. As R. tanezumi was found in Tibet before 1995 , and this species was not reported in Hebei until the 2000s, SJZ population could be considered a young population compared with the population from Tibet. The relative high genetic diversity of SJZ compared with that of NYC and LHS suggests that there were more individuals imported to Hebei from South and Central China in a short time period through the complex modern transportation system. The convenient transportation systems between North and Central China give more chances for R. tanezumi to expand to North China [67, 68] than to West China.
All reported COI sequences in Genbank were added in the phylogeny analysis. Most of these sequences came from samples of Southeast Asia, such as Thailand and Vietnam [52, 54, 55]. The result showed that the haplotypes of these sequences belonged to the three haplotype groups (Additional file 2: Table S2). And these sequences did not share haplotype with sequences from China. This result suggested that there was no genetic exchange between populations from sampling areas of China and Southeast Asia, although there were imported individuals intercepted from international cargo ships from different ports of China .
The dispersal ability of R. tanezumi and other Rattus species
The Mantel test for IBD presented interesting results. The dispersal pattern of R. tanezumi in Yunnan does not fit IBD, while populations in other areas of China show an obvious IBD pattern. Because of the complex landscape, the dispersal of R. tanezumi in Yunnan should be restricted by geographic barriers, such as valleys and mountains, and thus could not fit IBD. The dispersal of R. tanezumi in other areas was also expected to lack IBD because R. tanezumi, as a typical commensal rat, can use human transportation to achieve long-distance dispersal. However, the Mantel test shows a strong pattern of IBD for these populations. This result suggests that long-distance dispersal events were rare in the historical expansion of R. tanezumi in China, even though this species is widely distributed in China. However, in recent decades, the northward expansion of R. tanezumi has become obvious in North China, such as in Shanxi and Qinghai [11, 68], which suggests that modern, quick transportation has begun to give more opportunities to R. tanezumi in long-distance dispersal.
Some species of Rattus are regarded as commensal species, including R. rattus, R. norvegicus, and R. tanezumi. These species are all able to utilize human-made environments and transportation tools for survival and expansion. However, when two Rattus species are analyzed together for their commensal ability, there are differences between the different species. R. rattus of the Western Ghats of India shows a stronger dispersal ability compared with R. satarae, a native species of the southwest Indian Peninsula . While R. rattus in the United States shows low long-distance dispersal ability when it competes with R. norvegicus . In China R. norvegicus and R. tanezumi are two important commensal rodents. The former occurs mainly in southern China, and the latter is recorded in all provincial areas except Tibet . In the last half-century, R. norvegicus invaded Xinjiang along the railway and expanded through rural areas of Xinjiang with the help of modern transportation [19, 73]. R. tanezumi has been reported in some provinces in North and Northwest China in recent decades [11, 68]. Compared to the northwestward expansion of R. norvegicus in China, R. tanezumi shows a low speed in spreading northward in recent years. This phenomenon suggests that R. tanezumi is not as fit as R. norvegicus for long-distance dispersal using modern transportation systems.
R. tanezumi populations in mainland China originated from Yunan and adjacent Southeast Asia areas. The ancient trade in Southwest China based on caravans and the Tea Horse Ancient Road did not affect the range expansion of R. tanezumi in this area. The early dispersal of this species depended on shipping transportation over the sea and then expanded from coastal areas into Central China along the Yangzi River. The populations of Tibet were imported from Sichuan in the modern era via the Chuan-Zang highway rather than along the Tea Horse Ancient Road.
Infinite Allele Model
Isolation by distance
Markov chain Monte Carlo
Stepwise Mutation Model
Htwe NM, et al. Breeding ecology of rice field rats, Rattus argentiventer and R. tanezumi in lowland irrigated rice systems in the Philippines. Agric Ecosyst Environ. 2012;161:39–45.
Singleton GR, Brown PR, Jacob J. Ecologically-based rodent management: its effectiveness in cropping systems in South-East Asia. Njas-wageningen Journal of Life Sciences. 2004;52:163–71.
Miller RW, et al. Biology and management of rodent communities in complex agroecosystems - rice terraces. In: Singleton G, Joshi R, Sebastian L, editors. Philippine rats: ecology and management. Luzon: Philippine Rice research institute (PhilRice); 2008. p. 25–36.
Barnett SA. The story of rats. Their impact on us, and our impact on them. Crows Nest: Allen & Unwin; 2001.
Blasdell K, et al. Rodent-borne hantaviruses in Cambodia, Lao PDR, and Thailand. EcoHealth. 2011;8:432–43.
Huang L, et al. Analysis of gamasid mites (Acari: Mesostigmata) associated with the Asian house rat, Rattus tanezumi (Rodentia: Muridae) in Yunnan Province, Southwest China. Parasitol Res. 2013;112:1967–72.
Plyusnina A, Ibrahim IN, Plyusnin A. A newly recognized hantavirus in the Asian house rat (Rattus tanezumi) in Indonesia. J Gen Virol. 2009;90:205–9.
Huang WJ, Chen YX, Wen YX. Glires of China. Shanghai: Publishing House of Fudan University; 1995.
Hou Y, Jiang X. The investigation of Rattus tanezumi in Shijiazhuang during 2004- 2007. Chinese Journal of Vector Biology and Control. 2008;19:125.
Yang X, et al. The distribution characteristics and development tendency of R. falvipectus as an invasive species in Shanxi. Journal of Shanxi. Agric Sci. 2011;39:462–4.
Ma Y, et al. Spatial niche analysis of the small mammals in Haidong region of Qinghai Province. China Chinese Journal of Zoology. 2012;46:126–30.
Robins JH, et al. Identifying Rattus species using mitochondrial DNA. Mol Ecol Notes. 2007;7:717–29.
Bastos ADS, et al. Genetic monitoring detects an overlooked cryptic species and reveals the diversity and distribution of three invasive Rattus congeners in South Africa. BMC Genet. 2011;12:26.
Conroy C, et al. Cryptic genetic diversity in Rattus of the San Francisco Bay region, California. Biol Invasions. 2013;15:741–58.
Aplin KP, et al. Multiple geographic origins of commensalism and complex dispersal history of black rats. PLoS One. 2011;6:e26357.
Tollenaere C, et al. Phylogeography of the introduced species Rattus rattus in the western Indian Ocean, with special emphasis on the colonization history of Madagascar. J Biogeogr. 2010;37:398–410.
Holdaway RN. A spatio-temporal model for the invasion of the New Zealand archipelago by the Pacific rat Rattus exulans. J R Soc N Z. 2010;29:91–105.
Wilmshurst JM, et al. Dating the late prehistoric dispersal of Polynesians to New Zealand using the commensal Pacific rat. Proc Natl Acad Sci U S A. 2008;105:7676–80.
Yu X, et al. Notes on rodent composition and its changes both along the railway line and on the trains in Xinjiang, the discovery of Rattus norvegicus in this region. Zool Res. 1980;1:135–8.
Freeman M, Ahmed S. Tea Horse Road: China’s ancient trade road to Tibet, ed. S Ahmed. Bangkok: River Books; 2011.
Peakall R, Smouse PE. Genalex 6: genetic analysis in excel. Population genetic software for teaching and research. Mol Ecol Resour. 2006;6:288–95.
Spss I. SPSS 16.0 Base User's Guide. 2007: Prentice Hall Press. 741–743.
Van Oosterhout C, et al. MICRO-CHECKER: software for identifying and correcting genotyping errors in microsatellite data. Mol Ecol Notes. 2004;4:535–8.
Jensen JL, Bohonak AJ, Kelley ST. Isolation by distance. web service BMC Genetics. 2005;6:13.
Garza JC, Williamson EG. Detection of reduction in population size using data from microsatellite loci. Mol Ecol. 2001;10:305–18.
Excoffier L, Lischer H. Arlequin suite ver 3.5: a new series of programs to perform population genetics analyses under Linux and windows. Mol Ecol Resour. 2010;10:564–7.
Piry S, Luikart G, Cornuet J. Computer note. BOTTLENECK: a computer program for detecting recent reductions in the effective size using allele frequency data. The Journal of Heredity. 1999;90:502–3.
Crow JF, Kimura M. An introduction to population genetics. New York, NY: Harper & Row; 1970.
Ohta T, Kimura M. A model of mutation appropriate to estimate the number of electrophoretically detectable alleles in a finite population. Genet Res. 1973;22:201.
Di Rienzo A, et al. Mutational processes of simple-sequence repeat loci in human populations. Proc Natl Acad Sci U S A. 1994;91:3166.
Peery MZ, et al. Reliability of genetic bottleneck tests for detecting recent population declines. Mol Ecol. 2012;21:3403–18.
Pritchard JK, Stephens M, Donnelly P. Inference of population structure using multilocus genotype data. Genetics. 2000;155:945–59.
Guillot G, Estoup A. GENELAND: a computer package for landscape genetics. Mol Ecol Notes. 2005;5:712–5.
Evanno G, Regnaut S, Goudet J. Detecting the number of clusters of individuals using the software structure: a simulation study. Mol Ecol. 2005;14:2611–20.
Jakobsson M, Rosenberg NA. CLUMPP: a cluster matching and permutation program for dealing with label switching and multimodality in analysis of population structure. Bioinformatics. 2007;23:1801.
Rosenberg NA. Distruct: a program for the graphical display of population structure. Mol Ecol Resour. 2004;4:137–8.
Wright S. Evolution and the genetics of populations, Vol. 4: variability within and among natural populations. Chicago: University of Chicago Press; 1978.
Weir BS, Cockerham CC, Weir BS, Cockerham CC. Estimating F-statistics for the analysis of population-structure. Evolution. 1984;38:1358–70.
Slatkin M. A measure of population subdivision based on microsatellite allele frequencies. Genetics. 1995;139:457–62.
Holm S, Simple Sequentially A. Rejective multiple test procedure. Scand J Stat. 1979;6:65–70.
Guillot G. Inference of structure in subdivided populations at low levels of genetic differentiation - the correlated allele frequencies model revisited. Bioinformatics. 2008;24:2222–8.
Liu K, Muse SV. PowerMaker: An integrated analysis environment for genetic maker analysis 2005; 21: 2128–9.
Nei M. Genetic polymorphism and the role of mutation in evolution. In: Koehn PK, Nei M, editors. Evolution of genes and proteins. Sunderland: Sinauer Assoc; 1983. p. 165–90.
Plotree D, Plotgram D. PHYLIP-phylogeny inference package (version 3.2). Cladistics-the international journal of the Willi Hennig. Society. 1989;5:163–6.
Page RDM. Visualizing phylogenetic trees using TreeView. Curr Protoc Bioinformatics, 2002. Chapter 6: Unit 6.2.
Edgar RC. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 2004;32:1792–7.
Tamura K, et al. MEGA6: molecular evolutionary genetics analysis version 6.0. Mol Biol Evol. 2013;30:2725–9.
Librado P, Rozas J. DnaSP v5: a software for comprehensive analysis of DNA polymorphism data. Bioinformatics. 2009;25:1451–2.
Ronquist F, et al. MrBayes 3.2: efficient Bayesian phylogenetic inference and model choice across a large model space. Syst Biol. 2012;61:539–42.
Silvestro D, Michalak I. raxmlGUI: a graphical front-end for RAxML. Organisms Diversity & Evolution. 2012;12:335–7.
Bandelt HJ, Forster P, Röhl A. Median-joining networks for inferring intraspecific phylogenies. Molecular Biology & Evolution. 1999;16:37–48.
Balakirev A, Rozhnov V. Contribution to the species composition and taxonomic status of some Rattus inhabiting southern Vietnam and Sundaland. Russian Journal of Theriology. 2012;11:33–45.
Nakamura I, et al. Cross-reactivity of secondary antibodies against African rodents and application for sero-surveillance. J Vet Med Sci. 2013;75:819–25.
Latinne A, et al. Diversity and endemism of Murinae rodents in Thai limestone karsts. Syst Biodivers. 2013;11:323–44.
Pagès M, et al. Revisiting the taxonomy of the Rattini tribe: a phylogeny-based delimitation of species boundaries. BMC Evol Biol. 2010;10:184.
Fu J, Zeng X. How many species are in the genus Batrachuperus? A phylogeographical analysis of the stream salamanders (family Hynobiidae) from southwestern China. Mol Ecol. 2008;17:1469–88.
Lu L, et al. Molecular phylogeny and the underestimated species diversity of the endemic white-bellied rat (Rodentia: Muridae: Niviventer) in Southeast Asia and China. Zool Scr. 2015;44:475–94.
Qu Y, et al. Long-term isolation and stability explain high genetic diversity in the eastern Himalaya. Mol Ecol. 2014;23:705.
Zeng L, et al. Out of southern East Asia of the Brown rat revealed by large-scale genome sequencing. Molecular Biology & Evolution. 2018;35:149–58.
Myers N, et al. Biodiversity hotspots for conservation priorities. Nature. 2000;403:853–8.
Jenkins PD, et al. Morphological and molecular investigations of a new family, genus and species of rodent (Mammalia: Rodentia: Hystricognatha) from Lao PDR. Syst Biodivers. 2005;2:419–54.
Liu X, et al. Molecular phylogeny and taxonomy of wood mice (genus Apodemus Kaup, 1829) based on complete mtDNA cytochrome b sequences, with emphasis on Chinese species. Molecular Phylogenetics & Evolution. 2004;33:1–15.
Chen W, et al. Complex topographic configuration in the Hengduan Mountains shaped the phylogeographic structure of Chinese white-bellied rats. J Zool. 2011;284:215–23.
Ge D, et al. An endemic rat species complex is evidence of moderate environmental changes in the terrestrial biodiversity Centre of China through the late Quaternary. Sci Rep. 2017;7:46127.
Musser GG, Carleton MD. Superfamily Muroidea. In: Wilson DE, Reeder DM, editors. Mammal species of the World a taxonomic and geographic reference. Baltimore, Maryland: Johns Hopkins University Press; 2005.
Aplin KP, et al. Multiple geographic origins of commensalism and complex dispersal history of black rats. PLoS One. 2011;e26357:6.
Li B, et al. Investigation of rodents in rural areas along Qinghai-Tibet railway inside Qinghai province, China. Chinese Journal of Vector Biology & Control. 2013;24:117–20.
Lu J, Wang T. Studies on the glires (Rodentia and Lagomorpha) fauna and its division of Henan Province. Acta Theriologica Sinica. 1996;16:119–28.
Lu Y, et al. Analysis of imported rodents at Jiangsu ports from 2004-2011. Chinese Journal of Hygienic Insecticides & Equipments. 2015:288–90.
Varudkar AS, Ramakrishnan U. Commensalism facilitates gene flow in mountains: a comparison between two Rattus species. Heredity. 2015;115:253.
Lack JB, et al. Comparative phylogeography of invasive Rattus rattus and Rattus norvegicus in the U.S. reveals distinct colonization histories and dispersal. Biol Invasions. 2013;15:1067–87.
Wang YX. A complete checklist of mammal species and subspecies in China: a taxonomic and geographic reference. Beijing: China Forestry Publishing House; 2003.
Liao L, Bate W, Yan S. Migration patterns of Rattus norvegicus in desert area. Chinese Journal of Vector Biology and Control. 2014;25:162–4.
We thank Linlan Wang of Fujian CDC, Yi Huang of Hunan CDC, Wenfeng Cai of Yunnan IEDC, Jianrong Yi of Guangdong CDC, Zhaobing Liu of Guizhou CDC, and Pei’en Leng of Shanghai CDC for their help in sample collection. We appreciate the help of Dr. Shi Chengming in data analysis. The manuscript has been improved by comments from the editor and anonymous reviewers.
This work was supported by Ministry of Science and Technology of the People’s Republic of China through National Key R&D Program of China (2016YFC1200502 and 2016YFC1200503 for LL) and the Special Infectious Disease Program (2017ZX10303404005 for QL). The role of the funding bodies is limited to direct funding of the project activities in the field and laboratory that results in this manuscript.
Availability of data and materials
The STR datasets generated during this study are included in this published article (Additional file 6: Figure S1). All mtDNA sequences generated in this study have been deposited in GenBank under accession numbers: MH303212 - MH304211.
Ethics approval and consent to participate
Ethical approval for this study was obtained from the Ethical Committee of National Institute for Communicable Disease Control and Prevention, Chinese Center for Disease Control and Prevention.
All rats were live trapped in rodents control in residential areas, which were not privately owned or protected. No specific permits were required for the described field studies.
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Table S1. Microsatellite loci of R. tanezumi and primers used in this research. (XLSX 10 kb)
Table S2. List of haplotypes used in the phylogram of Fig. 3, haplotype 39–129 were found in downloaded sequences from Genbank. (XLSX 14 kb)
Table S3. Genetic variability at nine microsatellite loci in R. tanezumi populations from China. (XLSX 17 kb)
Table S4. Pairwise genetic distance (FST) and gene flow (Nm) for 15 populations based on microsatellite data. (XLSX 11 kb)
Table S5. The haplotype networks of COI and Dloop sequences. (XLSX 64 kb)
Figure S1. STR data. (TIF 6056 kb)