Skip to main content

Mining of favorable alleles for seed reserve utilization efficiency in Oryza sativa by means of association mapping



Wet direct-seeded rice is a possible alternative to conventional puddled transplanted rice; the former uses less water and reduces labor requirements. Improving seed reserve utilization efficiency (SRUE) is a key factor in facilitating the application of this technology. However, the QTLs controlling this trait are poorly investigated. In this study, a genome-wide association study (GWAS) was conducted using a natural population composed of 542 accessions of rice (Oryza sativa L.) which were genotyped using 266 SSR markers. Large phenotypic variations in SRUE were found in the studied population.


The average SRUE over 542 accessions across two years (2016 and 2017) was 0.52− 1, ranging from 0.22 to 0.93− 1, with a coefficient of variation of 22.66%. Overall, 2879 marker alleles were detected in the population by 266 pairs of SSR markers, indicating a large genetic variation existing in the population. Using general linear model method, 13 SSR marker loci associated with SRUE were detected and two (RM7309 and RM434) of the 13 loci, were also detected using mixed linear model analyses, with percentage of phenotypic variation explained (PVE) greater than 5% across two years. The 13 association loci (P < 0.01) were located on all chromosomes except chromosome 11, with PVE ranging from 5.05% (RM5158 on chromosome 5) to 12% (RM297 on chromosome 1). Association loci RM7309 on chromosome 6 and RM434 on chromosome 9 revealed by both models were detected in both years. Twenty-three favorable alleles were identified with phenotypic effect values (PEV) ranging from 0.10− 1 (RM7309–135 bp on chromosome 9) to 0.45− 1 (RM297–180 bp on chromosome 2). RM297–180 bp showed the largest phenotypic effect value (0.44− 1 in 2016 and 0.45− 1 in 2017) with 6.72% of the accessions carrying this allele and the typical carrier accession was Manyedao, followed by RM297–175 bp (0.43− 1 in 2016 and 0.44− 1 in 2017).


Nine novel association loci for SRUE were identified, compared with previous studies. The optimal parental combinations for pyramiding more favorable alleles for SRUE were selected and could be used for breeding rice accessions suitable for wet direct seeding in the future.


Rice (Oryza sativa L.) is the basic daily food for billions of people worldwide. It is considered to be the oldest domesticated grain (~ 10,000 years) and grown in the largest single use of land, covering 9% of the earth’s arable land (158.8 million hectares). Asia holds over 90% of the world’s production of rice, with China (208.6 million metric tons), India (109.15 million metric tons) and Indonesia (74.2 million metric tons) producing the bulk of the continental production [1].

To keep up with the accelerated development of the economy, labor force migration, the decline in fresh water quality and volume, and changing crop cultivation practices and mechanization, adopting direct seeding technology in rice crop cultivation has become a necessary transformation. Wet direct seeding involves the sowing of pre-germinated seeds with a radical variation in size, from 1 to 3 mm on or into puddle soil and is proving to be a promising technology. The essence of this technology is the seedling vigor which can be considered as the product of three components: (1) initial seed weight, (2) the fraction of seed reserves which are mobilized, and (3) the conversion efficiency of mobilized seed reserves to seedling tissues [2, 3]. Seed reserve utilization efficiency (SRUE) is an important characteristic of seedling vigor, since seedling growth can be limited by decreased mobilization of seed reserve and/or the conversion efficiency of mobilized seed reserves.

The physiological characteristics of SRUE had been evaluated in different crops such as Lithocarpus densiflora [4], wheat [2, 5], maize [6, 7], soybean [3] and sorghum [8]. As for rice, Cheng et al. (2013) identified thirteen additive QTLs (on chromosomes 2, 4, 8 and 12) and two pairs of epistatic QTLs (on chromosomes 7, 8 and 12) for SRUE using the recombinant inbred lines (RILs) derived from Jiucaiqing and IR26 and found qSRUE4.3 explained more than 20% of the total phenotypic variance [9]. Cheng et al. (2015) found that α-amylase (OsAmy3B, OsAmy3C, and OsAmy3E) and sucrose synthase (OsSus2, OsSus3, and OsSus4) genes might be involved in seed reserve utilization [10]. However, linkage mapping is limited by the fact that only two alleles can be studied at any given locus in bi-parental crosses of inbred lines.

Association mapping based on linkage disequilibrium (LD) using natural populations for QTL analysis is widely used in plant kingdom, as a popular method to search for, and discover favorable alleles for many traits, including agronomic traits [11,12,13,14,15,16,17,18,19,20,21,22] and seed vigor traits [23,24,25]. However, no studies have been undertaken to discover favorable alleles for SRUE in natural rice populations. The aims of this study were (1) to investigate the phenotypic variation of SRUE trait in the natural population composed of 542 accessions in Oryza sativa. (2) to mine favorable alleles of SRUE for improving accessions suitable for wet direct sowing cultivation by machine, and (3) to provide optimal parental combinations for pyramiding excellent alleles into a single plant.


Phenotypic variations of SRUE in the natural population

The mean value, standard deviation, skewness, and kurtosis for SRUE measured in 542 rice accessions in 2016 were shown in Table 1. Variance analysis showed that there were significant genetic differences among 542 rice accessions at the probability level of α = 0.01. The average of SRUE over 542 accessions was 0.52− 1 ranging from 0.21− 1 to 0.96− 1, with a coefficient of variation of 23.80%. 31.55% of total accessions had SRUE values larger than 0.55− 1 and 30.44% of total accessions had SRUE values greater than 0.65− 1. The generalized heritability of SRUE was 99.72%, indicating that the variation of SRUE trait was less affected by the environment. The mean, range of phenotypic values, generalized heritability and coefficient of variation of SRUE in 2017 were similar to those of 2016 (Table 1). These results indicated that there exists abundant genetic variation of SRUE in this natural population used.

Table 1 Descriptive statistics of SRUE* (− 1) in 542 rice accessions across 2 years

Molecular marker allele diversity of SSR loci in the natural population

The genetic diversity of all 542 rice accessions was evaluated using 266 SSR markers distributed in the whole genome. Different sizes of DNA fragments (Additional file 1: Figure S1) amplified by the same pair of SSR primers among the 542 accessions were regarded as allelic variation fragments of the pair of primers. 2879 alleles were detected in 542 rice accessions. The average number of alleles per SSR locus was 10.82. The variation ranges were from 2 (RM437 on chromosome5, RM7163 on chromosome11) to 38 (RM3428 on chromosome11) (Additional file 5: Table S1). The average genetic diversity per locus over all the 266 SSR loci was 0.74 and the variation range was 0.08 (RM7163 on chromosome11) - 0.9506 (RM3428 on chromosome11), and was mainly distributed between 0.75 and 0.95. The average PIC value was 0.71, ranging from 0.08 (RM7163 on chromosome11) to 0.95 (RM3428 on chromosome11). PIC represents an indicator of the degree of microsatellite DNA variation, reflecting the level of microsatellite DNA polymorphism. Two hundred and thirty-one SSR loci (occupied 86.84% of all SSR loci used) showed highly informative (PIC > 0.5), 29 loci (10.90%) moderately informative (0.5 > PIC > 0.25), and 6 loci (2.25%) slightly informative (PIC < 0.25) (Additional file 5: Table S1).

Genetic structure of the population used

Using SSR marker molecular data and STRUCTURE 2.2 software to analyze the genetic structure of the total population of rice accessions, it was found that the log-likelihood function values increase with the number of sub-populations (Fig. 1a). The number of subpopulation k value is then determined by ∆K value (the rate of change of the log-likelihood values on successive K values) calculated using the analytical method of Evanno et al. (2005) [26]. Fig. 1b shows that ∆K value reached maximum at K = 6. Therefore, the entire population can be divided into 6 sub-populations. Each accession was sorted into the corresponding subpopulation according to the obtained Q value (Q > 0.9) (Additional file 6: Table S2). Based on the Q value the 542 rice accessions were grouped into six subpopulations, that is, POP1 (94 accessions), POP2 (89 accessions), POP3 (81 accessions), POP4 (68 accessions), POP5 (83 accessions), POP6 (91 accessions) and an admix group (36 accessions). The posterior probability value of each accession belonging to the six subpopulations is shown in Fig. 2.

Fig. 1

Changes in the number of subpopulations with a the log-likelihood function value, b with ∆K values

Fig. 2

All 542-rice variety belonging to six subpopulations defined by STRUCRURE software. Identified sub-populations are POP1 (red color), POP2 (green color), POP3 (navy blue color), POP4 (yellow color), POP5 (purple color), POP6 (light blue color)

Furthermore, it was found that each subpopulation is consist of accessions with the same geographic origin. For example, POP1 accessions were from Jiangsu province, China and Vietnam (Tej and Indica), POP2 has accessions most of which are modern breeds in north-central Jiangsu (Tej), POP3 contains accessions with the majority of quality accessions in Jiangsu Province (Tej), POP4 contains accessions which were tall, late-maturing accessions and a small number of northeast accessions in the Taihu Lake Basin (Tej), POP5 accessions were mainly from Vietnam (Indica) and POP6 contains accessions of Taihu tall, early maturing accessions (Tej).

In order to verify the reliability of population genetic structure partitioning, a neighbor-joining (NJ) clustering map was constructed, for the total population of 542 rice accessions by using Nei’s (1983) genetic distance [27], calculated by software POWERMARKER 3.25 and observed by software MEGA 4.0. The NJ cluster map (Fig. 3) shows that the total population of the 542 rice accessions is clearly clustered into 6 subpopulations. This is consistent with the structural analysis based on the STRUCTURE model, indicating that the total population of this study was divided into 6 subpopulations with good reliability.

Fig. 3

Neighbor-joining tree for the 542 accessions generated using Nei’s genetic distance

Genetic differentiation among subpopulations

The average genetic differentiation index Fst among the six subpopulations was 0.36, with the Fst for each locus ranging 0.008 for RM5479 on chromosome 12 to 0.88 for RM218 on chromosome 3. Pairwise comparisons based on Fst values can reflect the standard genetic distance between two populations [28]. Fst values ranged from 0.26 (POP1 and POP5) to 0.42 (POP3 and POP4), and the corresponding standard genetic distance between the two subpopulations ranged from 0.45 (POP1 and POP5) to 0.69 (POP3 and POP4) (Table 2). AMOVA indicated that 64.42% of the total genetic variation occurred among the subpopulations, whereas 35.58% occurred among the individuals within the subpopulations (Additional file 7: Table S3). These results indicate the existence of a high degree of genetic differentiation across the six subpopulations.

Table 2 Pairwise estimates of Fst and Nei’s genetic distance among the 6 subpopulations

Linkage disequilibrium analysis

Among the 35,245 pairs of loci generated by 266 SSR loci, 23,081 pairs showed significant LD (based on D value, P < 0.01), of which 1919 pairs (5.44%) were intra-chromosomal pairs of SSR loci. Table 3 shows the percentage of significant LD locus pairs to the total number of pairwise loci in each subpopulation, of which POP1 is the highest (4.78%), while the POP6 is the lowest (3.13%). From the average of D values, POP1 was the highest (0.83), followed by POP5 (0.81) while POP3 was the lowest (0.58). Further regression analysis of D values and genetic distances of syntenic (intra-chromosome) marker pairs revealed that the attenuation of D values in each subpopulation was in accordance with the equation y = blnx + c (Additional file 2: Figure. S2). Therefore, the minimum distances of LD decay (D < 0.5) of each subpopulation were determined to be 58.08 cM (POP1), 27.75 cM (POP2), 17.57 cM (POP3), 19.23 cM (POP4), 34.05 cM (POP5) and 30.36 cM (POP6). It is clear that POP3 exhibited the highest decay velocity with the shortest decay distance, while POP1 showed the lowest decay velocity among the six sub-populations.

Table 3 D’ of LD for pairwise SSR loci each subpopulation

Detection of association loci

In total, thirteen SSR marker loci (with PVE > 5%) associated with SRUE were detected in both 2016 and 2017 by GLM and two of them were also detected by MLM in both years. The 13 marker loci were distributed on all chromosomes except chromosome 11. The percentage of phenotypic variation explained by single individual locus ranged from 5.03 to 12.01% in 2016 and 5.07 to 11.98% in 2017 respectively (Table 4). RM 297 on chromosome 1 explained the maximum phenotypic variation, viz. 12.01% in 2016 and 11.98% in 2017, respectively, followed by RM184 on chromosome 10 located at 41.6 cM (7.2% in 2016 and 7.32% in 2017) and the lowest was RM5158 on chromosome 5 located at 144.9 cM (5.03 and 5.07% in 2016 and 2017 respectively) (Table 4).

Table 4 SSR marker loci associated with SRUE (PVE > 5%) and percentage of phenotypic variation explained by the locus derived from 266 markers and 506 rice accessions

Among the 13 SSR association loci detected by GLM method, RM7309 on chromosome 6 and RM434 on chromosome 9, were also detected by MLM method associated with SRUE (Table 4). RM7309 had the higher contribution rate (viz 7.18% in 2016 and 7.10% in 2017, respectively) than those of RM434 (5.51% in 2016 and 5.52% in 2017, respectively). Compared with previous studies, 9 out of 13 loci (including RM434 detected by both GLM and MLM) are novel for SRUE ( (Additional file 8: Table S4).

Discovery of favourable alleles

In this study, the alleles with positive effects are considered favorable alleles for SRUE. Table 5 shows a summary of favorable alleles of the significant association loci and their typical carriers for SRUE. In total, 23 favorable alleles with phenotypic effect value (PEV) more than 0.1− 1 for SRUE were detected across 506 rice accessions (Table 5). RM297–180 bp allele on chromosome 1showed the largest phenotypic effect (0.44− 1 in 2016 and 0.45− 1 in 2017), and 34 accessions (6.72%) carried this excellent allele, with Manyedao as the typical carrier. Fifty- eight accessions (11.46%) carried the excellent alleleRM297–175 bp, with Daniaodao as a typical carrier (Additional file 6: Table S2, Table 5). Excellent allele RM184–225 bp was carried by 30 (5.93%) accessions, with Yandao6 as a typical carrier. Excellent allele RM184–215 bp was carried by 51 (10.08%) accessions, with Daniaoda as a typical carrier. 30 accessions (5.93%) possessed an excellent alleleRM184–205 bp, with Manyedao as a typical carrier. 19 accessions (3.75%) possessed an excellent alleleRM7309–135 bp, which showed the smallest phenotypic effect (0.11− 1 in 2016 and 0. 10− 1 in 2017), with Manyedao as a typical carrier.

Table 5 Favorable alleles, their effects and typical carriers for SRUE of the 13 loci detected across 506 rice accessions in 2016 and 2017 (listed in descending order of phenotypic effect values)

Excellent combination designs for improving SRUE

Favorable alleles carried by the superior parents for SRUE and corresponding phenotypic effect were summarized in Table 6. According to the phenotypic values and the number of favorable alleles that could be substituted or pyramided into an individual plant, the top 5 cross combinations predicted for SRUE and corresponding phenotypic increment effect (%) are listed in Table 7. For example, after crossing Yue40 × Manyedao, thirteen favorable alleles predicted could be pyramided into a single genotype, which led to a 0.16− 1 increase in SRUE value (Table 7). Certain accessions were found repeatedly in these proposed parental combinations (For example, Daniaodao), indicating that these accessions possess unique favorable alleles. Fig. 4 shows phenotypes of seeds of the superior parents and Fig. 5 shows the 10 days-old etiolated seedlings of the superior parents (Daniaodao, Manyedao, Suwujing, Yue 40 and Baimangnuo).

Table 6 Favorable alleles carried by the superior parents for SRUE and corresponding phenotypic effect
Table 7 Prediction of optimal parental combinations, favorable allele number and increment for SRUE after pyramiding
Fig. 4

Un-hulled grains (above) and brown rice (down) of the favorable parents for improving of SRUE trait. Bar, 10 mm

Fig. 5

Etiolated seedlings of 10-days old of the favorable parents for improving of SRUE trait. Bar, 10 mm

Difference of seedling establishment rates between accessions with high and low SRUE in soil condition

An experiment in soil condition was conducted to ascertain and confirm that the accessions with higher SRUE obtained in a growth chamber has a higher seedling establishment rate (SER) in soil cultivation condition. Under the soil trial, 42 selected accessions were divided into two groups, the first group comprised of accessions with high SRUE values (n = 22) and the second group comprised of accessions with low SRUE values (n = 20). The seeds were sown for a period of 15 days and kept under close observation. The number of established seedlings were recoded at the end of the trail period and SER(%) was calculated. The high SRUE group had numerically higher SER (%) than that of the low SRUE group. To determine if the effect of SRUE on SER was significat, an independent samples t-test was conducted. Table 8 show that there was a significant difference (P < 0.01) between the high SRUE varieties group (71.28 ± 4.22) and the low SRUE value varieties group (43.15 ± 1.54) in SER values; t (27) = 29.23, P = 0.000. Therefore, the high SRUE varieties have statistically significantly higher SER values than the low SRUE varieties. The conclusion is that different SRUE values show significant differences in SER (%) and higher SRUE improved the SER. Fig. 6 represents the mean and the 95% confidence intervals for SER.

Table 8 Comparison of SER (%) between high and low SRUE (−1) groups in the soil experiment
Fig. 6

SER (%) bar graphic (with 95% CIs)


There were large variations in SRUE in natural population of rice used in this study. This is related to the wide geographic distribution of accessions used.The accessions were selected from 17° N in Vietnam to 54° N in northeast China, spanning 37° latitudes. And the large variations in SRUE are also related to the range of accession types, which included local varieties, modern bred varieties, high-stalk precocity varieties, and high-quality late maturing varieties. In addition, the two-year generalized heritability for SRUE is greater than 95%, indicating the variation of the trait was mainly controlled by genes and less affected by the environments. Therefore, molecular marker-assisted selection technologies can be used to improve SRUE trait for wet direct seeding.

In the soil trial, there was a significant different in SER (%) between the high and low SRUE groups at P = 0.01 (Table 8). The results indicate that accessions with high SRUE obtained from the growth chamber experiment had higher SER (%) under the soil conditions compared with the low SRUE. This suggests that SRUE is an important trait for seedling establishment rate. Although the soil trial is vital in confirming the accessions ability to emerge in the field, the growth chamber trial is a simpler and a more direct method for crop breeders to screen desirable germplasms for SRUE.

Population genetic structure is a substantive element in association studies that focus on traits that are important in local adaptation or diversifying selection with recent co-ancestry [29]. Using STRUCUTURE software and the neighbor- joining methods, the population used was divided into six subpopulations tied to the geographical origin. For example, POP1 accessions were from Jiangsu province, China and Vietnam, POP2 has accessions mainly from modern cultivars bred in north-central Jiangsu. This agreement between the genetic background and predefined clusters suggests that knowledge of the ancestral background can facilitate choices of parental lines in rice breeding programs [11, 13].

The accessions in the natural population have experienced a particular geographical isolation, and therefore there will be subpopulations with their own characteristics in the genetic composition, and genetic differentiation among the total populations. Fst, fixed index refers to whether the actual frequency of genotype in the population deviates from the ratio of genetic equilibrium. Therefore, Fst can be used to compare the genetic differentiation between the two subpopulations, and then identify the genetic differences among varieties. In this study, the Fst values and the genetic distance between POP3 and POP4 were the largest among the other pairs of subpopulations. Agrama et al. (2007) [13] confirmed that markers with higher Fst values have greater resolving power and produce more consistent genetic distance estimates and the significant Fst among the subpopulations represents a real difference between them. Therefore, hybridization among subpopulations with different Fst values is possible to improve the trait value. Genome-wide analysis of the genetic diversity of 506 rice accessions using 266 SSR markers showed that 74% of the marker loci showed genetic diversity value larger than 0.7, with an average of 0.74. It was higher than 0.64 for Borba et al. (2009) [30], 0.73 for Dang et al. (2014, 2015) [20, 24] and 0.53 of Liu et al. (2015) [21]. However, it is less than 0.75 of Li et al. (2012) [16]. The average polymorphism information content was 0.71, this figure is higher than the 0.37 of Ordonez et al. (2010) [31], higher than the 0.48 of Liu et al. (2015) [21] and 0.70 of Dang et al. (2015) [20]; similar to Dang et al. (2014) [24] and Li et al. (2012) [16] and less than the 0.75 of Borba et al. (2009) [30]. More than 56% of the marker loci showed more than 10 alleles, with the average number of alleles per locus equal to 10.82, ranging from 2 (RM437_chromosome5, RM7163_chromosome11) to 38 (RM3428_chromosome11). The number of alleles per locus in our study was higher than that reported in Vanniarajan et al. (2012) 2.5 [17], Liu et al. (2015) 9.93 [21], Dang et al. (2014) 10.52 [24], and Dang et al. (2015) 10.40 [20], and less than those reported by Borba et al. (2009) 12.86 [30]. This variation may be due to the fact that the materials in the present study span from a wide geographical area stretching from north-central Vietnam to the northeastern part of China. In different climates and geographical conditions, the natural population experienced long-term natural selection and evolution, as well as different cultivation and management practices, have accumulated a high degree of genetic variation and a rich genetic background.

Linkage disequilibrium (LD) is the basis of association analysis. In comparison to other populations, the attenuation distances of POP2, POP3 and POP4 (27.75 cM, 17.57 cM and 19.23 cM, respectively) were consistent with the attenuation distances of 10 cM–30 cM reported by Vanniarajan et al. (2012) [17]. The attenuation of other subpopulations ranged from30cM to 60 cM. The extent of LD attenuation has been reported in rice [13, 17, 24, 32,33,34,35] but the results are quite different. For example, Olsen et al. (2006) [35] and Mather et al. (2007) [36] detect LD attenuation distances of less than 1 cM through DNA sequence. Jin et al. (2010) [37] detected LD attenuation distances of 25–50 cM using SSR markers. This difference is believed to be related to different genetic regions, different rice varieties and different markers [34, 36]. Therefore, the factors that affect the decay rate of LD are: population size, population source, number of loci and artificial selection. Based on the LD decay range in this population, genome wide LD mapping is possible. In this study, distances of LD decay of the 6 sub-populations were from 17.57 cM to 58.08 cM (Additional file 2: Figure. S2). This may suggest that 266 SSR markers are enough to detect significant loci associated with phenotypic variation of SURE in GWAS. However, to detect high-reliability and a greater number of significant loci in GWAS for SURE, it would be important to increase marker density and population size in the future experiments.

The association mapping helps to utilize the genetic variation in natural populations [38]. However, the population genetic structure and unequal relatedness among individuals could increase the false discoveries and lead to spurious associations. GLM consider only Q matrix generated during the study of population structure while MLM accounts for both population structure and the kinship (genetic relatedness among individuals) so generally GLM will detect higher number of significant marker-trait associations than MLM [39], Alternatively, MLM is more accurate in claiming associations than GLM, it had statistical advantage and detected more true associations than GLM [40]. In the current study, thirteen sites on chromosomes were found to be significantly associated with SRUE (PVE > 5%) and 23 favorable alleles (PEV > 0.1− 1) were detected in two years (Table 4 and Table 5).

RM 297 on chromosome 1 explained the maximum phenotypic variation, 34 accessions (6.72%) out of 506 carried excellent allele RM297–180 bp, with the largest phenotypic effect (0.44 mg.mg1 in 2016 and 0.45− 1 in 2017) and the typical carrier was Manyedao. Fifty-eight accessions (11.46%) carried excellent allele RM297–175 bp, with Daniaodao as a typical carrier. Followed by RM184 on chromosome 10 located at 41.6 cM, 30 (5.93%) and 51 (10.08%) accessions showed an excellent allelic variation of RM184–225 bp and RM184–215 bp, respectively and the typical carriers are Yandao6 and Daniaoda. Comparing with previous studies, Cheng et al. (2013) detected qSRUE1 interval (41166774–43,043,114 bp) with 10Mbp different from RM128 (30737705-30,737,861 bp). The interval of qSRUE4.1 (688353–2,030,305 bp) is 4Mbp different from RM 3471 (6310055-6,310,203 bp); the interval of qSRUE4.2 (2030135-8,067,386 bp) included RM3471 (6310055-6,310,203 bp). The interval of qSRUE6 (28,149,879 bp) is 2 Mb different from RM7309 (26297238-26,297,595 bp) on chromosome 6 [9]. RM297 (32099566-32,099,760 bp) on chromosome 1 has been identified by Cairns et al. (2009) to be related to the shoot length [41]. RM525 on chromosome 2 is located in the region (28292005-28,292,040 bp) in which a QTL for seedling dry weight has been detected by Han et al., (2007) [42]. RM232 on chromosome 3 is located in the region (15644275–15,646,800 bp) in which a QTLs for germination rate, seed weight, shoot length and root length has been detected in different studies [43,44,45]. RM434 on chromosome 9 is located in the region (15662573-15,662,838 bp) in which a QTLs for seedling dry weight has been detected in different studies [9, 43]. These results confirm the close relationship between seed and seedling traits with SRUE. In addition, SRUE could be enhanced by the crosses listed in Table 7, which shows cross combinations of accessions with complementary allelic variation at different loci to be selected as hybridization parents. The results of the current study provide basic marker information and accession information for breeding cultivars suitable to wet direct seeding by machine.


There is abundant phenotypic variation for SRUE and molecular marker allele diversity among the 542 accessions used. Twenty-three favorable alleles for SRUE were detected across 2 years. Daniaodao, Manyedao, SuWujing, Yue 40 and Baimangnuo are the 5 typical carrier accessions possessing the favorable alleles. These accessions could be used to improve SRUE traits for mechanized live broadcasts.


Plant materials

The tested materials were 542 rice accessionsFootnote 1 [46]; 121 of which were from Vietnam (Indica), while the remaining accessions were from China (Tej). These accessions range from 17° N to 54° N and 102° E to135° E, crossing 37° latitude from the north to the south and 33° longitude from the east to west (Additional file 6: Table S2).

Field planting

All the seeds of tested materials were sown in the seedling nursery of paddy fields in Jiangpu Experiment Station, Nanjing Agricultural University, in mid May 2016 and transplanted in mid-June. For each variety, four rows were transplanted. Each row had 8 hills with a spacing of 17 cm × 20 cm. Conventional field management practices were applied as recommended. In 2017, the dates of sowing and transplanting, and field management practices were identical to those in 2016.

Phenotypic data collection (the growth chamber test)

Seeds of the natural population were harvested from the middle row of the plot at maturity stage and placed in a 50 °C oven for 72 h to break dormancy. The SRUE experiment was conducted in two replications for each season.

50 grains of healthy seeds of equal size, fullness and color were weighted to obtain the fresh weight (FW), then dried at 104 °C for 24 h to obtain the dry weight (DW). The water content (WC) was calculated using the following formula

$$ WC=\frac{FW- DW}{DW} $$

The initial seed dry weight (ISDW) was then calculated using the following formula

$$ ISDW=\frac{FW}{1+ WC} $$

SRUE was determined following the method described by Soltani et al. (2006) [2] and Cheng et al. (2013) [9] with minor modification. 50 seeds of each accession were lined up on a filter paper with 30 cm × 45 cm in size (Additional file 3: Figure S3a). The seeds were covered with two layers of moist filter paper and the papers rolled up and sealed with a rubber band (Additional file 3: Figure S3b). One end of the paper roll was covered with a self-sealing plastic bag and the other end of the paper roll was placed vertically in a plastic box (45.5 cm × 31.5 cm × 15 cm) with tap water of 10 cm depth (Additional file 3: Figure S3c). The plastic boxes were put in a growth chamber (GXZ and RXZ intelligent light incubator, Ningbo science and technology park, new Jiangnan instrument Co., Ltd., Ningbo, China) to germinate under complete dark condition and 30 °C for 10 days. During the period of germination, tap water was added to the plastic boxes to keep the paper roll moist. After 10 days, the etiolated seedlings (Additional file 3: Figure S3d) were separated into two parts, one including shoot and root, and the other including the seed remnant (Additional file 3: Figure S3e). Each part was dried at 105 °C for at least 24 h to obtain constant seedling dry weight (SDW) and the remnant seed dry weight (RSDW) (Additional file 3: Figure S3f). The following parameters were calculated based on the formula described by Cheng et al. (2013) [9].

The weight of mobilized seed reserve (WMSR)


Where ISDW is Initial seed dry weight.

Seed reserve utilization efficiency (SRUE)

$$ SRUE=\frac{SDW}{WMSR} $$

Marker genotype identification

The plant leaves of the each accession in the natural population were collected 3 months after germination, and the total DNA was extracted using the method described by Murray and Thompson (1980) [47]. Marker genotype of each accession was identified using 266 pairs of SSR marker covering the 12 chromosomes in rice. The DNA sequence information of the 266 pairs of primers was obtained from the rice genome database ( and was synthesized by Shanghai Jierui Biology Co., Shanghai, China.

Each 10 μL PCR reaction solution contained 1 μL template DNA (20 ng μL− 1), 0.7 μL forward primer (2 pmolμL− 1), 0.7 μL backward primer (2 pmolμL− 1), 1 μL 10 × Buffer (free MgCl2), 0.2 μL dNTP (2.5 m mol L− 1), 0.6 μL MgCl2 (25 m mol L− 1), 0.1 μLTaq (5 U μL− 1) and 6.4 μL ddH2O. The reaction procedure was carried out on a PTC-100 Peltier Thermal Cycler (MJ Research Inc., USA) with the program set to: (1) denaturation at 94 °C for 5 min; (2) 34 cycles of denaturation at 94 °C for 0.5 min, annealing at 55~61 °C (depending on primer used) for 1 min, and extension at 72 °C for 1 min; and (3) a final extension at 72 °C for 10 min. The PCR amplified product was run on 8.0% polyacrylamide gel (PAG). A DNA marker with a gradient of 100 bp was used as the control. The electrophoresis was done using 0.5X TBE buffer on 180 V constant voltage and then visualized using silver staining. Different sizes of DNA fragments amplified by the same pair of SSR primers were regarded as allelic variation fragments of the pair of primers and measured using software Quantity One.

Population genetic structure and phylogenesis

Using STRUCTURE version 2.2 [48] the genetic clusters of the 542 accessions were identified. Five independent runs were performed for each K (K from 2 to 10). The length of the burn-in period was set to 50,000 iterations and defined a run of 100,000 Markov Chain Monte Carlo (MCMC) replicates after burn in. A mean log-likelihood value over five runs at each K was used. If the mean log-likelihood value was positively correlated with the model parameter K; the optimal K value was determined through an ad hoc statistic (∆K) based on the rate of change in [LnP(D)] between successive K values [26]. Non- admixed individuals in each genetic group were determined using a Q-matrix assignment greater than 0.9. Power Marker version 3.25 [49] was used to determine the number of alleles per locus, major allele frequency, genetic diversity per locus, and polymorphism information content (PIC) values per locus. The genetic distance was calculated based on 266 molecular markers using Nei’s distance [27] and phylogenetic reconstruction was performed using neighbor-joining method as implemented in Power Marker with the tree viewed using MEGA 4.0 [50]. Locus-by-locus analysis of molecular variance (AMOVA) [51] based on genetic groups delimited by the Bayesian clustering method in the program Arlequin 3.5 [52] was performed to statistically verify the structure using SSR and standard multi-locus frequency data. The genetic differentiation coefficient (Fst) between subpopulation was calculated using the method proposed by Weir and Hill (2002) [53]. The calculation process was performed in Arlequin 3.5 software.

Linkage disequilibrium

The linkage disequilibrium (LD) analysis was performed with TASSEL 2.1 software using 100,000 permutations to measure the level of linkage disequilibrium (LD) between loci [54], on all accessions and on the sub-populations generated by STRUCTURE. LD decay plot was drawn to observe the relationship between LD and genetic distance of syntenic (intra-chromosome).

Phenotypic data analysis and heritability in a broad sense

Analysis of variance (ANOVA) was run to establish the genotypic and environmental variances among the traits measured using EXCEL 2013 software and the SAS package (SAS Institute Inc., CARY, NC, USA). Heritability in a broad sense (\( {H}_B^2 \)) was computed for the natural population using the following equation

$$ {H}_B^2={\sigma}_g^2/\left({\sigma}_g^2+{\sigma}_e^2/\mathcal{n}\right) $$

where \( {\sigma}_g^2 \) is genetic variance, \( {\sigma}_e^2 \) is error variance, and is a number of replicates.

Association mapping

The associations between the trait and the markers were analyzed by both general linear model (GLM) and mixed linear model (MLM) using TASSEL 3.0 software [54]. The Q matrix obtained from the analysis results of Structure 2.2 was used as covariant in the GLM analysis; while the matrices Q and K were used as covariates in the MLM analysis [24]. The K matrix (kinship matrix) was obtained from the results of the relatedness analysis using SPAGeDi software [55]. A false discovery rate (FDR) of 0.01 was used as a threshold for significant associations according to the correction method published by Benjamini and Hochberg (1995) [56]. Using the association locus identified, the “null allele” (non-amplified allele) was used to determine the phenotypic effects of the alleles [12]. The formula used for calculating phenotypic effect of a single allele was

$$ {a}_i=\sum {x}_{ij}/{n}_i-\sum {N}_k/{n}_K $$

where ai was the phenotypic effect of the allele of i; xij denotes the phenotypic measurement values of j variety carrying the allele of i; ni represents the number of materials carrying the allele of i; Nk denotes the phenotypic value of the variety of k carrying the null allele; and nK represents the number of materials carrying the null allele. In the present study, marker loci with PVE > 5% were considered for further analysis. Varieties with higher phenotypic values together with the selected marker loci were analyzed to determine favorable alleles and their carrier accessions.

Difference of seedling establishment rates in soil condition

Twenty-two varieties with high SRUE value and 20 varieties with low SRUE value were selected to confirm the results obtained from growth chamber through soil cultivation. Fifty healthy seed of each variety were used to germinate under room condition using the paper towel method, only sprouted seeds were used to conduct the soil cultivation (Additional file 4: Figure. S4).

The soil cultivation experiments were conducted in plastic cups (12 cm height × 9 cm diameter) with 2 mm (diameter) drainage holes at the bottom of the cups. The cups were filled with 11 cm of soil and tap water was added to saturate the soil. 30 sprouted seed of each variety were laid out on the surface and covered with 1 cm of soil. The cups were submerged under 2 cm of water in plastic boxes (45.5 cm × 31.5 cm × 15 cm) and left to grow for 15 days under the soil conditions. A plastic cover was used to protect the germinated seeds from the birds and rain splash damage. The experiment was conducted in three replications.

Out of 30 sprouted seeds, the number of established seedlings was counted and the percentage of seedling establishment was calculated using the following formula described by Islam et al., 2014 [57]:

$$ Seedling\ establishment\ rate\left(\%\right)=\frac{Number\ of\ establishment\ plants}{Number\ of\ total\ seedling}\times 100 $$

Availability of data and materials

The rice accessions names and geographical origins are available in Additional file 5: Table S1.


  1. 1.

    All the rice seeds used in this research were collected during long-term rice science studies and properly kept in our State Key Laboratory of Crop Genetics and Germplasm Enhancement, Nanjing Agricultural University. Accession numbers 1–542 were selected from our previous studies on rice grain sizes and weight (Rf.





Deoxyribonucleoside trisphosphate


Dry weight


Disodium ethylenediaminetetra-acetate


False discovery rate


Fresh weight


General linear model

H 2 B :

Heritability in the broad sense


Initial seed dry weight


Linkage disequilibrium


Markov Chain Monte Carlo


Mixed linear model


Polymerase Chain Reaction


Polymorphic information content


Quantitative trait loci


Remaining seed dry weight


Sodium dodecyl sulfate


Seedling dry weight


Seed reserve utilization efficiency


Simple sequence repeat


Thermus aquaticus DNA polymerase


Tris/borate electrophoresis buffer




Tris (hydroaymethyl) aminonethane


Water content


Weight of the mobilized seed reserve


  1. 1.

    FAO. International Year of Rice 2004: Gender and rice fact sheet 2004. Available from:

  2. 2.

    Soltani A, Gholipoor M, Zeinali E. Seed reserve utilization and seedling growth of wheat as affected by drought and salinity. Environ Exp Bot. 2006;55(1):195–200.

    Article  Google Scholar 

  3. 3.

    Mohammadi H, Soltani A, Sadeghipour HR, Zeinali E. Effects of seed aging on subsequent seed reserve utilization and seedling growth in soybean. Int J Plant Prod. 2012;5(1):65–70.

    Google Scholar 

  4. 4.

    Kennedy PG, Hausmann NJ, Wenk EH, Dawson TE. The importance of seed reserves for seedling performance: an integrated approach using morphological, physiological, and stable isotope techniques. Oecologia. 2004;141(4):547–54.

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  5. 5.

    Hasan MA, Ahmed JU. Evaluation of seed reserve utilization efficiency during germination in relation to heat tolerance of wheat. Thai J Agric Sci. 2012;45(1):29–36.

    Google Scholar 

  6. 6.

    Sikder S, Hasan MA, Hossain MS. Germination characteristics and mobilization of seed reserves in maize varieties as influenced by temperature regimes. J Agric Rural Dev. 2010;7(1):51–8.

    Google Scholar 

  7. 7.

    Cheng XX, He S, Geng GH. Dynamic QTL analysis of seed reserve utilization in sh2 sweet corn germination stages. Genet Mol Res. 2016;15.

  8. 8.

    Roghayyeh S, Saeede R, Omid A, Mohammad S. The effect of salicylic acid and gibberellin on seed reserve utilization, germination and enzyme activity of sorghum (Sorghum bicolor L.) seeds under drought stress. J Stress Physiol Biochem. 2014;10(1).

  9. 9.

    Cheng X, Cheng J, Huang X, Lai Y, Wang L, Du W, et al. Dynamic quantitative trait loci analysis of seed reserve utilization during three germination stages in rice. PLoS One. 2013;8(11):e80002.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  10. 10.

    Cheng J, Cheng X, Wang L, He Y, An C, Wang Z, et al. Physiological characteristics of seed reserve utilization during the early seedling growth in rice. Brazilian J Bot. 2015;38(4):751–9.

    Article  Google Scholar 

  11. 11.

    Garris AJ, Tai TH, Coburn J, Kresovich S, McCouch S. Genetic structure and diversity in Oryza sativa. Genetics. 2005;169(3):1631.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  12. 12.

    Breseghello F, Sorrells ME. Association mapping of kernel size and milling quality in wheat (Triticum aestivum L.) cultivars. Genetics. 2006;172(2):1165–77.

    PubMed  PubMed Central  Article  Google Scholar 

  13. 13.

    Agrama HA, Eizenga GC, Yan W. Association mapping of yield and its components in rice cultivars. Mol Breed. 2007;19(4):341–56.

    Article  Google Scholar 

  14. 14.

    Huang X, Wei X, Sang T, Zhao Q, Feng Q, Zhao Y, et al. Genome-wide association studies of 14 agronomic traits in rice landraces. Nat Genet. 2010;42:961.

    CAS  Article  Google Scholar 

  15. 15.

    Zhao K, Tung CW, Eizenga GC, Wright MH, Ali ML, Price AH, et al. Genome-wide association mapping reveals a rich genetic architecture of complex traits in Oryza sativa. Nat Commun. 2011;2:467.

    PubMed  PubMed Central  Article  CAS  Google Scholar 

  16. 16.

    Li J, Lindqvist-Kreuze H, Tian Z, Liu J, Song B, Landeo J, et al. Conditional QTL underlying resistance to late blight in a diploid potato population. Theor Appl Genet. 2012;124(7):1339–50.

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  17. 17.

    Vanniarajan C, Vinod KK, Pereira A. Molecular evaluation of genetic diversity and association studies in rice (Oryza sativa L.). J Genet. 2012;91(1):9–19.

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  18. 18.

    Morris GP, Ramu P, Deshpande SP, Hash CT, Shah T, Upadhyaya HD, et al. Population genomic and genome-wide association studies of agroclimatic traits in sorghum. Proc Natl Acad Sci U S A. 2013;110(2):453.

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  19. 19.

    Zhang Z, Liu Z, Cui Z, Hu Y, Wang B, Tang J. Genetic analysis of grain filling rate using conditional QTL mapping in maize. PLoS One. 2013;8(2):e56344.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  20. 20.

    Dang X, Giang Tran Thi T, Mawuli Edzesi W, Liang L, Liu Q, Liu E, et al. Population genetic structure of Oryza sativa in east and southeast Asia and the discovery of elite alleles for grain traits. Sci Rep. 2015;5:11254.

    PubMed  PubMed Central  Article  Google Scholar 

  21. 21.

    Liu E, Liu X, Zeng S, Zhao K, Zhu C, Liu Y, et al. Time-course association mapping of the grain-filling rate in rice (Oryza sativa L.). PLoS One. 2015;10(3):e0119959.

    PubMed  PubMed Central  Article  CAS  Google Scholar 

  22. 22.

    Yang W, Guo Z, Huang C, Wang K, Jiang N, Feng H, et al. Genome-wide association study of rice (Oryza sativa L.) leaf traits with a high-throughput leaf scorer. J Exp Bot. 2015;66(18):5605–15.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  23. 23.

    Cui D, Xu C, Tang C, Yang C, Yu T, Xx A, et al. Genetic structure and association mapping of cold tolerance in improved japonica rice germplasm at the booting stage. Euphytica. 2013;193(3):369–82.

    CAS  Article  Google Scholar 

  24. 24.

    Dang X, Thi TGT, Dong G, Wang H, Edzesi WM, Hong D. Genetic diversity and association mapping of seed vigor in rice (Oryza sativa L.). Planta. 2014;239(6):1309–19.

    CAS  PubMed  Article  Google Scholar 

  25. 25.

    Rebolledo MC, Dingkuhn M, Courtois B, Gibon Y, Clément-Vidal A, Cruz DF, et al. Phenotypic and genetic dissection of component traits for early vigour in rice using plant growth modelling, sugar content analyses and association mapping. J Exp Bot. 2015;66(18):5555–66.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  26. 26.

    Evanno G, Regnaut S, Goudet J. Detecting the number of clusters of individuals using the software structure: a simulation study. Mol Ecol. 2005;14(8):2611–20.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  27. 27.

    Nei M, Tajima F, Tateno Y. Accuracy of estimated phylogenetic trees from molecular data. J Mol Evol. 1983;19(2):153–70.

    CAS  PubMed  Article  Google Scholar 

  28. 28.

    Li Z, Nelson R. Genetic diversity among soybean accessions from three countries measured by RAPDs. Crop Sci. 2001;41:1337–47.

    CAS  Article  Google Scholar 

  29. 29.

    Nordborg M, Weigel D. Next-generation genetics in plants. Nature. 2008;456:720.

    CAS  PubMed  Article  Google Scholar 

  30. 30.

    de Oliveira Borba TC, Brondani RPV, Rangel PHN, Brondani C. Microsatellite marker-mediated analysis of the EMBRAPA rice core collection genetic diversity. Genetica. 2009;137(3):293–304.

    PubMed  Article  CAS  PubMed Central  Google Scholar 

  31. 31.

    Ordonez SA Jr, Silva J, Oard JH. Association mapping of grain quality and flowering time in elite japonica rice germplasm. J Cereal Sci. 2010;51(3):337–43.

    Article  Google Scholar 

  32. 32.

    Garris AJ, McCouch SR, Kresovich S. Population structure and its effect on haplotype diversity and linkage disequilibrium surrounding the xa5 locus of rice (Oryza sativa L.). Genetics. 2003;165(2):759–69.

    PubMed  PubMed Central  Google Scholar 

  33. 33.

    Agrama HA, Eizenga GC. Evaluation of linkage disequilibrium in rice and its wild relatives. Proceedings of the XIV Annual International Plant & Animal Genome Conference. 2006;14.

  34. 34.

    Agrama HA, Eizenga GC. Molecular diversity and genome-wide linkage disequilibrium patterns in a worldwide collection of Oryza sativa and its wild relatives. Euphytica. 2008;160(3):339–55.

    CAS  Article  Google Scholar 

  35. 35.

    Olsen KM, Caicedo AL, Polato N, McClung A, McCouch S, Purugganan MD. Selection under domestication: evidence for a sweep in the rice waxy genomic region. Genetics. 2006;173(2):975–83.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  36. 36.

    Mather KA, Caicedo AL, Polato NR, Olsen KM, McCouch S, Purugganan MD. The extent of linkage disequilibrium in rice (Oryza sativa L.). Genetics. 2007;177(4):2223.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  37. 37.

    Jin L, Lu Y, Xiao P, Sun M, Corke H, Bao J. Genetic diversity and population structure of a diverse set of rice germplasm for association mapping. Theor Appl Genet. 2010;121(3):475–87.

    PubMed  Article  PubMed Central  Google Scholar 

  38. 38.

    Zhu C, Gore M, Buckler ES, Yu J. Status and prospects of association mapping in plants. Plant Genome. 2008;1(1):5–20.

    CAS  Article  Google Scholar 

  39. 39.

    Neumann K, Kobiljski B, Denčić S, Varshney RK, Börner A. Genome-wide association mapping: a case study in bread wheat (Triticum aestivum L.). Mol Breed. 2011;27(1):37–58.

    Article  Google Scholar 

  40. 40.

    Mikic S, Kondicspika A, Brbaklic L, Stanisavljevic D, Trkulja D, Tomicic M, et al. Multiple marker-traits associations for maize agronomic traits. Chilean J Agri Res. 2016;76(3):300–6.

    Article  Google Scholar 

  41. 41.

    Cairns JE, Namuco OS, Torres R, Simborio FA, Courtois B, Aquino GA, et al. Investigating early vigour in upland rice (Oryza sativa L.): Part II. Identification of QTLs controlling early vigour under greenhouse and field conditions. Field Crop Res. 2009;113(3):207–17.

    Article  Google Scholar 

  42. 42.

    Han Y, Xie D, Teng W, Zhang S, Chang W, Li W. Dynamic QTL analysis of linolenic acid content in different developmental stages of soybean seed. Theor Appl Genet. 2011;122(8):1481–8.

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  43. 43.

    Cui K, Peng S, Xing Y, Xu C, Yu S, Zhang Q. Molecular dissection of seedling-vigor and associated physiological traits in rice. Theor Appl Genet. 2002;105(5):745–53.

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  44. 44.

    Kanbar A, Janamatti M, Sudheer E, Vinod MS, Shashidhar HE. Mapping QTLs underlying seedling vigour traits in rice (Oryza sativa L.). Curr Sci. 2006;90(1):24–6.

    Google Scholar 

  45. 45.

    Manangkil OE, Vu HTT, Mori N, Yoshida S, Nakamura C. Mapping of quantitative trait loci controlling seedling vigor in rice (Oryza sativa L.) under submergence. Euphytica. 2013;192(1):63–75.

    CAS  Article  Google Scholar 

  46. 46.

    Dang X, Liu E, Liang Y, Liu Q, Breria CM, Hong D. QTL detection and elite alleles mining for stigma traits in Oryza sativa by association mapping. Front Plant Sci. 2016;7:1188.

    PubMed  PubMed Central  Article  Google Scholar 

  47. 47.

    Murray MG, Thompson WF. Rapid isolation of high molecular weight plant DNA. Nucleic Acids Res. 1980;8(19):4321–6.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  48. 48.

    Falush D, Stephens M, Pritchard JK. Inference of population structure using multilocus genotype data: linked loci and correlated allele frequencies. Genetics. 2003;164(4):1567.

    CAS  PubMed  PubMed Central  Google Scholar 

  49. 49.

    Liu K, Muse SV. PowerMarker: an integrated analysis environment for genetic marker analysis. Bioinformatics. 2005;21(9):2128–9.

    CAS  PubMed  Article  Google Scholar 

  50. 50.

    Tamura K, Dudley J, Nei M, Kumar S. MEGA4: Molecular Evolutionary Genetics Analysis (MEGA) Software Version 4.0. Mol Biol Evol. 2007;24(8):1596–9.

    CAS  Article  Google Scholar 

  51. 51.

    Weir BS, Cockerham CC. Estimating F-statistics for the analysis of population structure. Evolution. 1984;38(6):1358–70.

    CAS  PubMed  Google Scholar 

  52. 52.

    Excoffier L, Lischer Heidi EL. Arlequin suite ver 3.5: a new series of programs to perform population genetics analyses under Linux and Windows. Mol Ecol Resour. 2010;10(3):564–7.

    PubMed  Article  Google Scholar 

  53. 53.

    Weir BS, Hill WG. Estimating F-statistics. Annu Rev Genet. 2002;36(1):721–50.

    CAS  PubMed  Article  Google Scholar 

  54. 54.

    Bradbury PJ, Zhang Z, Kroon DE, Casstevens TM, Ramdoss Y, Buckler ES. TASSEL: software for association mapping of complex traits in diverse samples. Bioinformatics. 2007;23(19):2633–5.

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  55. 55.

    Hardy OJ, Vekemans X. SPAGEDI: a versatile computer program to analyses spatial genetic structure at the individual or population levels. Mol Ecol Resour. 2010;2(4):618–20.

    Article  CAS  Google Scholar 

  56. 56.

    Benjamini Y, Hochberg Y. Controlling the false discovery rate: A practical and powerful approach to multiple testing. J R Stat Soc Ser B Methodol. 1995;57(1):289–300.

    Google Scholar 

  57. 57.

    Islam MK, Islam MS, Biswas JK, Siyoung L, Alam I, Mooryong H. Screening of rice varieties for direct seeding method. Aust J Crop Sci. 2014;8(4):536–42.

    Google Scholar 

Download references


The first author is grateful to Mr. Melak Sherif for his helpful suggestions for analyzing the data. Gratitude is extended to Mrs. Zhang Yuanqing for her help in taking the images, as well thankful is extended to the following bachelor degree students Zemin Wang, Liyu Xiao, Tai An, Jiachen Wang and Ziyi Wang for their help in collecting phenotypic data. The authors are grateful to Hossam-eldin Seddik for language help.

Data sets and supplementary materials

The datasets and figures supporting the conclusions of this article are included within the article and its additional files.


Funding was supported by a National Natural Science Foundation of China (31671658 and 31571743) and a grant from doctoral fund of Educational Ministry (B0201300662) which was used for collection and analysis of the data and manuscript processing charge.

Author information




DH designed the research; NA, LD, M S. E, DA, LB, EL and XD carried out the field experiment; NA, EL and DX carried out the molecular experiment; NA, M S. E analyzed data; and NA wrote the manuscript; DH revised the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Delin Hong.

Ethics declarations

Ethics approval and consent to participate

All the rice seeds used in this research were collected and maintained in our laboratory during long-term rice science studies. Accession numbers 1–148 were obtained from Dr. Weidong Jin, the former PhD student guided by the corresponding author (Rf. Doi: Accession numbers 149-177 were obtained from Mr. Nguyen Phuong Tung, the former international student from Vietnam studying in Nanjing Agricultural University for MS degree guided by the corresponding author (Rf. Doi:

Consent for publication

Not applicable.

Competing interests

The authors declare they have no competing interests. All authors have read and approved the manuscript.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Additional file 1. Figure S1.

Gel picture display SSR profiles amplified by primer RM3428 using total DNA as template. 1: Yingtoudao; 2: Changdaotou; 3: Yangmiaozhong; 4: Maoguangdao; 5: Dazhongdao; 6: Sanxiadao; 7: Xiaoqingmang; 8: Hongganlizhihong; 9: Wuxidao; 10: Wanzhognqiu; 11: Fengjingdao; 12: Liuzhong; 13: Cuganlizhihong; 14: Chiguwandao; 15: Jiaobaiyeqing; 16: Chiguhong; 17: Fanluoqing; 18: Zaoyedao; 19: Baidiegu; 20: Wangjiadao; 21: Jiangyinzhong; 22: Eyingbaijingdao; 23: Tiekewanguangtou; 24: Tiekedao; 25: Dadaosuitou; 26: Aibaidao; 27: Xiepihuang; 28: Xiaobaidao; 29: Baishidao; 30: Manbaidao; 31: Guangtouluhuabai; 32: Hongmangjing; 33: Wumangyedao; 34: Luhuabai; 35: Haidongqing; 36: Shenlenuo; 37: Xiangqing; 38: Jinghui418; 39: Malaihong; 40: Jingnuo330; 41: Zaijinjing; 42: Fuyu3; 43: Dongnongjing424; 44: Dongnongjingnuo418; 45: R254; 46: Jiangyinnuo; 47: Jinggunuo; 48: Shanhonggu

Additional file 2. Figure S2.

Relationship between D ‘values and genetic distances of syntenic (intra-chromosome) marker pairs in six sub-populations

Additional file 3. Figure S3.

Part of the experiment operation process of SRUE measurement. a. Rice grains were lined on the filter papers. b. Rolled the papers and sail it with rubber band. c. Cover the top of the paper roll with self-sealing plastic bag and then vertically place them into a plastic box containing a layer tap water (10 cm depth). d. Etiolated seedlings after 10 days’ culture under complete dark at 30 °C. e. Separated fresh etiolated seedling (shoot and root) and the grain remnant on aluminum foil. f. Dried etiolated seedling (shoot and root) and grain remnant on aluminum foil.

Additional file 4. Figure S4.

Partial soil experiment operation process for SRUE measurement. A. Rice grains priming. B. Lying the germinated seed in the soil. C. e seed under the soil conditions in the boxes. D. Comparison of SER performance between high and low SRUE varieties under the soil conditions. E. Seedlings of the superior parents after 15 days’ culture under the soil conditions.

Additional file 5. Table S1.

Summary statistics for the 266 SSR markers used in this study.

Additional file 6. Table S2.

The code, name and origin of 542 rice accessions and the Q value of each accession belonging to the 6 subpopulations in this study.

Additional file 7. Table S3.

Analysis of molecular variance (AMOVA) for six subpopulations of rice accessions.

Additional file 8. Table S4.

Comparisons of marker loci detected in this study with loci reported previously.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (, which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Ali, N., Li, D., Eltahawy, M.S. et al. Mining of favorable alleles for seed reserve utilization efficiency in Oryza sativa by means of association mapping. BMC Genet 21, 4 (2020).

Download citation


  • Oryza sativa
  • Seed reserve utilization efficiency
  • Association mapping
  • Favorable alleles
  • Direct seeding