- Research article
- Open Access
Genome-wide association mapping revealed a diverse genetic basis of seed dormancy across subpopulations in rice (Oryza sativa L.)
BMC Genetics volume 17, Article number: 28 (2016)
Seed dormancy is an adaptive trait employed by flowering plants to avoid harsh environmental conditions for the continuity of their next generations. In cereal crops, moderate seed dormancy could help prevent pre-harvest sprouting and improve grain yield and quality. We performed a genome wide association study (GWAS) for dormancy, based on seed germination percentage (GP) in freshly harvested seeds (FHS) and after-ripened seeds (ARS) in 350 worldwide accessions that were characterized with strong population structure of indica, japonica and Aus subpopulations.
The germination tests revealed that Aus and indica rice had stronger seed dormancy than japonica rice in FHS. Association analysis revealed 16 loci significantly associated with GP in FHS and 38 in ARS. Three out of the 38 loci detected in ARS were also detected in FHS and 13 of the ARS loci were detected near previously mapped dormancy QTL. In FHS, three of the association loci were located within 100 kb around previously cloned GA/IAA inactivation genes such as GA2ox3, EUI1 and GH3-2 and one near dormancy gene, Sdr4. In ARS, an association signal was detected near ABA signaling gene ABI5. No association peaks were commonly detected among the sub-populations in FHS and only one association peak was detected in both indica and japonica populations in ARS. Sdr4 and GA2OX3 haplotype analysis showed that Aus and indica II (IndII) varieties had stronger dormancy alleles whereas indica I (IndI) and japonica had weak or non-dormancy alleles.
The association study and haplotype analysis together, indicate an involvement of independent genes and alleles contributing towards regulation and natural variation of seed dormancy among the rice sub-populations.
Seed dormancy, a phenomenon in which mature and viable seeds fail to germinate under conditions favorable for its germination in a specified period of time, is a very complicated trait controlled by both environmental as well as genetic factors arising from both maternal and embryonic tissues [1–3]. In nature, seed dormancy is an adaptive trait that is used by the wild species to delay germination until the environmental factors favorable for the survival of their offspring is available . In a controlled environment, seed dormancy is measured based on germination percentages, rates or index as the percentage of the number of seeds germinated out of the total numbers of seeds planted in a specified number of days (usually seven to fourteen days for germination percentage) [4, 5]. Dormancy is one of the traits among the cereal crops that have undergone domestication and could be a desirable trait as it can help prevent pre-harvest sprouting hence improved grain yield and quality [6, 7]. Since deep dormancy prevents germination and weak dormancy exposes the seeds to pre-harvest sprouting, moderate dormancy levels would be desirable in order to avoid the extremes of the dormancy levels .
In cultivated rice species, mean dormancy periods varies from one cultivar to another . The depth of dormancy is affected by the seed maturity stage [10, 11] and the environmental factors such as the temperature during seed ripening , the day length , the storage temperature [14, 15] and seed moisture content during the dry after-ripening period  among others. Besides the environmental factors, seed dormancy is also regulated by a number of plant hormones such as abscisic acid, gibberellic acid, auxin, ethylene and brassinosteroids [16, 17].
Studies conducted in Arabidopsis revealed key seed maturation regulators including FUS3, LEC1 and LEC2, DAG1 and ABI3 [18–20]. Molecular studies on mutations in HISTONE MONOUBIQUITINATION (HUB1) identified a decreased dormancy in Arabidopsis seeds due to transcriptional control via effects on chromatin structure . In Arabidopsis DELAY OF GERMINATION 1 (DOG1) QTL was cloned and was involved in embryonic dormancy . KYP/SUVH4, a mediator of H3 lysine 9 dimethylation was demonstrated as a negative regulator of seed dormancy in Arabidopsis . RDO5 was found to positively regulate seed dormancy by suppressing transcript levels of APUM9 and APUM11 in Arabidopsis . Chromatin remodeling was shown to correlate positively with DOG1 expression in response to dormancy cycling in the soil seed bank in Arabidopsis . In rice, qSD7–1, a clustered QTL (qSD7–1/qPC7) was delimited to the pleiotropic Rc locus and found to control seed dormancy by regulating ABA biosynthetic pathway in rice . Sdr4, a global regulator of seed maturation was cloned in rice and was positively regulated by OsVP1 .
A number of seed dormancy QTL has been reported in cultivated rice and wild rice [28–33]. Gramene QTL database for rice has documented 165 dormancy QTL including qDOR, qSD and sdr loci (http://www.gramene.org). The QTL mapped in the 12 chromosomes of rice except chromosome 10 included cluster QTL such as qSD7/qPC7, qSD1–2/qPH1 and qSD7–2/qPH7 [34, 35]. The successful use of QTL linkage mapping promoted the studies of the genetic architecture of various traits in rice; however it had a major limitation due to its restriction of allelic diversity between bi-parents leading to low resolution [36, 37].
GWAS tends to solve the shortcomings of QTL linkage mapping since it does not require the development of a specific segregating population to detect QTL. A larger number of gene pools and millions of genome-wide SNPs from next generation sequencing used in GWAS can narrow down confidence intervals for the loci detected with higher genomic resolutions . GWAS successes have not been without limitations such as the genetic architecture of the trait being controlled either by rare variants with large effects on the phenotype or common variants with small phenotypic effect [39, 40]. The likelihood of false positive associations due to strong or complete linkage of rare variants with other non-causative rare variants further reduces GWAS successes [41, 42]. A large and geographically diverse sample size or a large sample of local population with higher phenotypic diversity hence maximized genetic variation or minimized genetic heterogeneity within the sample, respectively provides a solution to GWAS shortcomings . Combining several SNPs in a region into a single indicator variable as a composite genotype can reduce the detection of rare variants . The use of mixed models have also minimized the detection of false positive associations by accounting for the resultant phenotypic covariance that is due to genetic relatedness [45, 46]. The success of GWAS in detecting genes of agronomic importance such as grain quality, grain yield, morphology, stress tolerance, and nutritional quality in rice, have demonstrated its usefulness in identifying more genome-wide genes contributing to seed dormancy in rice [47–50]. In Arabidopsis, an integration of GWAS and transcriptomic analysis identified HD2B as a negative regulator of seed dormancy during cold induced dormancy cycling .
In the present study, we used the GWAS strategy in a global collection of 350 rice accessions to evaluate the seed dormancy variations based on seed GP within and among the Aus, indica and japonica subpopulations. Our results identified 16 and 38 significant loci associated with seed dormancy in freshly harvested seeds and after-ripened seeds respectively. The detection of previously identified dormancy gene (Sdr4), qSD7–1, ABI5, GA/IAA catabolic genes and previously mapped QTL near the association loci in our study validated the reliability of our association mapping. This study also revealed the influence of different alleles in controlling dormancy among various cultivated rice groups. The detected association loci could be mined and used to improve pre-harvest sprouting tolerance by marker assisted selection (MAS) approach.
Phenotypic evaluations and heritability
A collection of 350 accessions of O. sativa collected from various parts of the world was used in this study. The germplasm consisted of indica, japonica, Aus subpopulations and intermediates (Additional file 1). The indica population was further sub-divided into indica I (IndI) and indica II (IndII) subpopulations and japonica into temperate japonica (Tej) and tropical japonica (Trj) subpopulations. In this diversity panel, FHS of Aus varieties had the lowest mean GP (38.6 %). The greatest range of GP was observed in Aus and IndII varieties (Table 1). The mean GP difference was largest between IndI (96.7 %) and IndII which had 55.5 % (Fig. 1a). No such significant difference was observed between Tej and Trj which had mean GPs of 78.1 % and 92.6 % respectively. These results signified that some genotypes could be characterized with strong seed dormancy.
For the ARS, Aus varieties had the lowest mean GP (57.9 %) with IndI having the highest (98.6 %). IndII, Tej and Trj had mean GP of 82.2, 88. 2 and 96.5 % respectively (Table 1). On average, the mean GP of each subpopulation were significantly increased in ARS as compared to their corresponding mean GP in the FHS with exception of IndI and Trj (Fig. 1b). In addition, the variation in GP among the five subpopulations in ARS was much lower (57.9 % - 98.6 %) as compared to that of FHS (38.6 % - 96.7 %) (Table 1). Obviously, seed dormancy had been released to some extent or completely broken during the two-month after-ripening period depending on the variety (Additional files 2 and 3). Furthermore, the heritability (H 2) of GP was 92.0 % and above in any of the populations.
Association mapping in FHS
To determine QTL associated with seed dormancy, we carried out GWAS on GP in FHS of indica, japonica and Aus subpopulations independently and in the whole panel using linear mixed model (LMM). The Manhattan plots and quantile-quantile plots for the GP in FHS and ARS using LMM are shown in Figs. 2 and 3. Considering that linkage disequilibrium (LD) decay in cultivated rice was extended from 100 kb to 200 kb [47, 52, 53] the association peaks falling within a region of less than 150 kb were considered as one association peak. In consequence, a total of 16 association signals were identified for GP in FHS (Table 2).
Six signals (FHS1.1, FHS1.2, FHS4.1, FHS5.1, FHS7and FHS11) were detected for GP in the whole population on chromosomes 1, 4, 5, 7 and 11 (Table 2). They individually explained 1.4–18.9 % of the GP variance. There were 2, 2 and 8 lead SNPs associated with GP in the subpopulations Aus, indica and japonica, respectively (Table 2). The association loci (FHS2.1) detected in Aus explained the highest GP variance of 71.1 %. Two associations (FHS1.3 and FHS7) were detected in indica rice. FHS7 was identified in both indica and whole population whereas FHS1.1 in Aus and whole population. None of the eight signals detected in japonica subpopulation were detected in the whole population or in any other subpopulation. The associations explained more of GP variance within the subpopulations than in whole population. For example, FHS7 explained 18.3 % of GP variance in the whole population, whereas it explained 44.9 % in indica subpopulation.
Association mapping in ARS
Accordingly, we conducted GWAS on GP in ARS and a total of 38 associations were identified. Fourteen signals were detected in the whole population across the 12 chromosomes except chromosomes 3 and 4 (Table 3). They individually explained 0.1–29.3 % of GP variance. Four, 10 and 10 signals were detected in Aus, indica and japonica subpopulations respectively. The signal ARS1.1 was detected in both whole and Aus populations while ARS3 was detected in both indica and japonica. Even though the three signals ARS1.1 (whole and Aus), ARS11.2 (whole) and ARS5.2 (japonica) were detected in both FHS and ARS, their phenotype contribution was lower in ARS than in FHS, for example ARS1.1 in Aus contributed to 6 % GP variance in FHS and only 0.7 % in ARS (Table 3). The signal in Aus (ARS8.2) contributed to the highest GP variance (40.1 %). Thirteen out of the 38 signals were harbored within the regions of previously mapped dormancy QTL which probably could be the candidates for these associations.
Genes and QTL around the putative peak positions
The phytohormones ABA and GA have been implicated to significantly control seed dormancy by the intrinsic balance of their biosynthesis and catabolism respectively . Thus, a higher ratio of ABA to GA leads to dormancy and vice versa . We searched for the dormancy related genes including ABA, GA and other plant hormones regulating dormancy such as auxin around the association peaks. Since the LD decay in cultivated rice was extended from 100 kb to 200 kb [47, 52, 53], the genes for dormancy related hormones within the 100-kb regions upstream and downstream of the association peaks in this study were considered to be the possible candidate genes for seed dormancy. Around the 16 association peaks detected in FHS, two GA related genes, one auxin related gene and one dormancy related gene were identified (Table 2). FHS7 (sf0723792996) was located 3 kb upstream of the first cloned seed dormancy gene in rice, Sdr4  in both indica and whole populations. GA2ox3, a GA catabolic gene  was located in the position of 66 kb downstream of FHS1.2 and was identified in both Aus and whole population. EUI1, a GA inactivation gene [57, 58] was located 35 kb upstream of FHS5 in the whole population. GH3–2, an IAA (major form of auxin in rice) inactivating gene that acts to catalyze the formation of an IAA amino acid conjugate leading to the suppression of expansin gene , was detected 75 kb upstream of FHS1.1 in the whole population (Table 2). In ARS, the ABA related gene ABI5 was detected 23 kb downstream of ARS1.3 and was found to be involved in ABA signaling and in the regulation of LEA genes during seed maturation and germination . OsHPL2 was detected near ARS2.1 and plays a role in inhibition of seed germination [61, 62]. OsAsr1, believed to be involved in ABA signaling in response to osmotic stress  was detected 66 kb downstream of ARS11.3 (Table 3).
In addition to known cloned genes, one and eleven previously mapped QTL were detected in the regions harboring the association loci in FHS and ARS respectively. The signal FSGP5 in FHS was harbored in the regions of qDGR5b on chromosome 1 (Table 2). The thirteen signals flanked within the regions of previously mapped QTL in ARS were spread across the chromosomes 1, 2, 3, 6,7,11 and 12 (Table 3).
Sdr4 haplotypes analysis
In order to ascertain the contributions of Sdr4 towards seed dormancy, we analyzed its haplotypes within the coding region. There were 4 SNPs within the coding region of Sdr4, 1 synonymous and 3 non-synonymous (Additional file 4a). The 3 non-synonymous SNPs resulted into 3 haplotypes (Hap1- Hap3) among the 350 accessions. Hap1 was the dominant haplotype present in 70, 61.4 and 100 % of Aus, indica and japonica varieties respectively. Hap2 was present in 30 % Aus and 30 % indica varieties. Hap3 was uniquely identified in indica at 8.6 % (Table 4). Comparison analysis within indica subpopulation revealed significant differences between Hap1 and Hap2 and between Hap1 and Hap3. There was no significant difference between Hap2 and Hap3 in indica. Significant difference was also observed between Hap1 and Hap2 in Aus (Table 4). Varieties possessing Hap2 in Aus and indica subpopulations had the lowest mean GP compared to Hap1 counterparts that had the highest mean GP. No variation of Sdr4 was observed in japonica rice (Table 4).
GA2OX3 haplotypes analysis
GA2ox3 is a GA catabolism gene that catalyzes the oxidation of GA20 to GA29 and GA29 to GA29-catabolites . It is responsible for the homeostatic regulation of biologically active GA concentration in rice; hence its expression leads to reduced GA levels and suppressed germination or growth. Due to its direct involvement in the GA pathway and its subsequent detection near association peaks in Aus and whole population, we conducted SNPs search within its genomic DNA and found a total of 22 SNPs including 3 non-synonymous SNPs among the 350 accessions. The 3 non-synonymous SNPs namely sf0131794745, sf0131794598 and sf0131795793 resulted into amino acid changes from Leucine to Valine, Valine to isoleucine and Alanine to Valine, respectively (Additional file 4a). The haplotype analysis using the non-synonymous SNPs resulted into a total of 3 haplotypes (Hap1 to Hap3). Here we compared the difference in GP among the 3 major haplotypes (Table 4). Hap1 was commonly found in 66.7, 85.6, and 11.7 % of Aus, indica and japonica (only Trj) varieties respectively. Hap2 was found in 33.3 % of Aus and 13.3 % indica (IndI) and was absent in japonica. 88.3 % of japonica varieties had Hap3 (Additional file 4b). Comparison analysis within Aus subpopulation revealed a significant difference in GP between Hap1 and Hap2. In indica there was a significant difference between Hap1 and Hap2 while the difference between Hap2 and Hap3 was not significant. There was no significant difference between Hap1 and Hap3 of japonica. Except for the Hap1 of Aus varieties, which had the lowest mean GP of 21.5 %, all other haplotypes showed a higher mean GP of above 70 % across the various subpopulations (Table 4) indicating that Hap1 allele could probably be functioning only in Aus and not in other subpopulations.
Diverse genetic basis of seed dormancy in indica, japonica and Aus subpopulations
Seed dormancy is a complex trait controlled by genetic and environmental conditions during seed development and storage [2, 9]. Thus temperature during grain filling in rice is an important determinant of levels of seed dormancy. The harvest time in relation to stage of ripening as well as the levels of temperature and humidity during storage is equally important in dormancy maintenance and release. Thus in order to minimize environmental effects experiment-wise, only 350 accessions whose seed development stages experienced similar temperature and humidity conditions in the field were kept for testing seed dormancy. In addition, the panicles for each accession that emerged within 2–3 days were uniformly harvested 32 days after heading, which minimized the environmental noise within accessions.
Our results indicated a lower GP of FHS in Aus and IndII at about 39 and 55 % respectively, whereas IndI and tropical japonica had very high GPs of more than 90 %. The temperate japonica subpopulation had a GP of about 78 %. On average, most Aus accessions and a number of IndII varieties had strong seed dormancy compared to IndI subpopulation, which had no seed dormancy and the japonica subpopulation which had weak dormancy. Whereas seed dormancy was diverse within indica subpopulation, no big difference in GP was observed within japonica subpopulation. It is believed that Tej and Trj have a close genetic relationship with a lower genetic diversity  and that Tej was derived from Trj [65, 66]. Thus the minor differences in GP between the two japonica subpopulations could probably be as a result of the low genetic diversity. A previous study showed that Aus have a smaller geographic distribution and a very high genetic diversity coupled with adaptive traits . Therefore, the lowest GP levels and higher phenotypic contribution (up to 71 %) in Aus were probably due to the diverse genetic differentiation. In addition, there were 16 associations detected in FHS which were unique to their specific subpopulations. More signals were expected to be detected in indica and Aus than in japonica due to lower GP levels and wider GP variance experienced in indica and Aus compared to japonica. However the results were the reverse. This case could be explained by few major QTL like Sdr4 identified in indica and Aus subpopulations and several minor QTL in japonica.
Previous findings have shown that 4–6 weeks could readily release dormancy in rice seeds stored at 20–30 °C at 11 % moisture content [15, 43]. In ARS there was a sharp GP increase in IndII and Aus with indica and japonica subpopulations having a mean GP of above 80 % in ARS while Aus had an increased GP of 59.9 % up from 38.6 %, an indication that the two months of after-ripening was able to completely break the dormancy or significantly release seed dormancy of many accessions in our study. There were 38 signals detected in ARS out of which 10, 4 and 10 were detected in indica, Aus and japonica respectively. Of these signals, only one signal (ARS3) was commonly detected in both indica and japonica. These results together indicated that different genes/alleles controlled seed dormancy in various rice sub-groups probably due to their divergent evolution and domestication processes.
Early and late detectable signals controlling seed dormancy
Dormancy QTL have been categorized into three based on the detection of their main effect throughout the after-ripening period [30, 68]. The QTL included those with constant effect which were detectable in FHS and stayed throughout the after-ripening duration, early detectable effects which influenced germination of FHS and became less effective after a few weeks of after-ripening and late detectable QTL whose effect on germination were detectable at a later time during the after-ripening period. In this categorization of the QTL, the genetic interactions and the dormancy allele background had to be considered . Our GWAS study identified a total of 16 and 38 association peaks in FHS and ARS respectively. Only three signals (FHS1.1, FHS11 and FHS5.2) out of the 16 signals in FHS could be detected in ARS while the remaining 13 associations disappeared. One signal in FHS and 13 signals in ARS were detected within the regions of previously mapped QTL indicating that these previously mapped QTL harboring the association signals could probably be the candidates for these associations.
It was also interesting to note that 35 out of the 38 signals detected in ARS were not detected in FHS posing a question “why were there more signals detected in ARS than in FHS when seed dormancy was released to a larger extent?” Probably the dormancy QTL categorization provides an answer to this question. The three commonly detected signals in FHS and ARS probably kept functioning in freshly harvested seeds through to the after-ripening seeds but their genetic effect was decreased with time like in Sdr4. The FHS signals lost in ARS could probably be related to early detectable dormancy effects and or weak dormancy alleles that influenced the germination of FHS and became less effective after the two months after-ripening. The 35 association signals newly detected in ARS including the 13 signals harbored in the regions of previously mapped QTL were probably related to the late effect detectable QTL. Transcriptomic study in A. thaliana revealed a separate genetic mechanism underlying dormancy establishment and after-ripening (AR) in seeds, and that AR genes were down-regulated in freshly harvested seeds and up-regulated in stored seeds . Thus, we may conclude that there exist independent genes controlling seed dormancy in FHS and ARS.
Candidate genes for seed dormancy
Dormancy in seeds has been studied in relation to failure of seeds to germinate in a specified period of time and by examining the expression levels of ABA, GA and other growth related phytohormones in the wild type and mutants [3, 18, 19]. Hundreds of seed dormancy QTL have been detected by linkage mapping (http://www.gramene.org) but only dormancy genes, Sdr4 , qSD7-1  and the endosperm imposed seed dormancy QTL, qSD1–2  have been molecularly cloned in rice. Our association mapping resulted into 16 lead SNPs in FHS seeds; two of which were located less than 100 kb near GA inactivation genes (GA2ox3 and EUI1), 1 near auxin inactivation gene (GH3–2) and one near Sdr4. The GA genes were reported to regulate rice growth and panicle architecture by regulating the concentration of biologically active GA [56–58, 71]. The auxin related gene (GH3–2) was reported to inactivate IAA by catalyzing the formation of an IAA amino acid conjugate resulting in the suppression of expansins . It is most likely that these genes have effect on seed dormancy and could be the candidates for these associations. Although the signal ARS1.2 was detected more than 200 kb away from qSD1–2/GA20ox-2, we propose GA20ox-2 to be the possible candidate of ARS1.2. Loss-of-function mutations of the OsGA20ox2 resulted into reduced GA levels, which slowed down tissue morphogenesis, delayed ABA accumulation and subsequent maturation programs hence decreased dormancy at harvest . The failure to detect the genes directly involved in ABA pathway near the associated loci in FHS but instead a few ABA related genes like ABI5, OsAsr1 and OsHPL2 (implicated in ABA signaling pathway) in ARS, could be an indication that dormancy maintenance and release in rice is independent of ABA levels though it plays a significant role in these mechanisms. In Arabidopsis it was demonstrated that ABA is not a dormancy-specific factor in imbibed rice seeds but rather a growth regulator of seed dormancy and germination . It was of interest to notice that GA2ox3 was detected near association loci in both FHS and ARS in Aus, an indication that GA2ox3 plays a crucial role in dormancy maintenance and is a stable gene. In barley, after-ripening was found to promote expression of HvGA2ox3 in imbibed after-ripened seeds . In Sorghum a transcriptional study in an imbibed dormant seed harvested 30 days after pollination (DAP) revealed an early activity of GA synthesis that was suppressed by increased deactivation rate by SbGA2ox3 and SbGA2ox1 which were highly expressed. The expression of these two catabolic genes however, disappeared in imbibed dormant seed harvested 42 DAP . Thus a further follow-up and closer study of GA catabolic genes in relation to their direct involvement in seed dormancy maintenance should be conducted using direct mutagenesis by genome editing technique or transcriptomic technique since in the past, more studies have been directed towards ABA and to some extent GA synthesis genes leaving aside the GA catabolic genes which could be of equal importance as ABA in regards to seed dormancy.
Sdr4 and GA2ox3 haplotypes for breeding pre-harvest sprouting resistance variety
The haplotype analysis within Sdr4 and GA2ox3 genes revealed that different alleles controlled seed dormancy among different rice populations. For example, Sdr4 Hap1 conferred low dormancy; Hap3 unique only to indica had moderate dormancy whereas Hap2 conferred strong dormancy especially in Aus. All japonica varieties possessed Hap1 alleles. These results supported the previous study on Sdr4 that there are three different alleles Sdr4-n, Sdr4-k and Sdr4-k’ and all japonica varieties carrying Sdr4-n alleles conferred low dormancy, whereas Sdr4-n and Sdr4-k were widely distributed in indica . Even though Hap2 conferred strong dormancy, there were some accessions in Hap2 that had higher GP and some accessions in Hap1 that had lower GP. This occurrence was concluded to be as a result of the genetic interaction between Sdr4 and a modifier gene, OsVP1 . We noted that all IndI varieties with exception of four varieties had Hap1 alleles while only 24 % of IndII had Hap1 alleles (Additional file 4b), an indication that IndI was extremely selected for reduced dormancy during domestication, eventually rendering them non-dormant. Accordingly, the genetic interactions between the GA2ox3 haplotypes can be researched further in order to understand the relation between the GA2ox3 haplotypes in conferring seed dormancy. Thus, upon its validation, GA2ox3 can be considered for breeding pre-harvest sprouting resistant rice varieties since GA2ox3 was only detected near association peaks in Aus, which notably had the lowest GP as compared to any other subpopulations and also its inhibiting effect on germination persisted throughout the after-ripening period.
In conclusion, this study revealed that different genes/alleles conferred dormancy in the various subpopulations of rice in FHS and ARS. The association loci may provide a rich source of information about the natural genetic variations underlying the evolution, domestication and breeding of indica, Aus and japonica rice in relation to seed dormancy and other adaptive traits. The major association signals could be useful in improving the non-dormant IndI and Trj varieties to possess moderate seed dormancy by crossing them with strong dormant varieties from Aus and IndII using marker assisted selection (MAS) breeding approach.
A worldwide rice collection consisting of 529 rice accessions  were grown in the experimental station of Huazhong Agricultural University, Wuhan in May 2014 for seed dormancy evaluation. Seven plants were planted in each row with spacing of 16.5 cm between plants within a row and 26.4 cm between rows. Field management was conducted according to the standard agronomic practices. The five middle plants in each row were tagged for heading dates, harvested and used for examining seed dormancy. In order to minimize the noise on environmental effect, 350 accessions whose seed development was completed in high temperature and high humidity conditions were selected for seed dormancy evaluation.
Phenotype assessment for seed dormancy
The accessions were grown to maturity in the field and the heading dates of two early heading panicles from each of the five middle plants were individually recorded, and the panicles were tagged every day. The tagged panicles of each accession that headed in the same dates (2–3 day heading date interval) were harvested 32 days after heading, pooled together and used to score the germination percentages. The variation of heading date was large from June 15th to August 30th across the whole population. In order to minimize the environmental noise, only the accessions that flowered between 1st July 2014 and 5th August 2014 were used for germination analysis. The average daily temperature during this period ranged from 23 to 28 °C with an average humidity of between 70–95 %.
The panicles that flowered on the same date and/or having one to two day heading date intervals were harvested from the five middle plants of each accession and dried at 30 °C for 24 h; the seeds removed from panicles and pooled together and separated into two batches. The first batch of seeds for each accession was surface sterilized with 0.6 % sodium hypochlorite solution for 15 min, rinsed five times with distilled water and pre-germinated by soaking in distilled water with changing of water every day for 48 h at 30 °C. 100 imbibed seeds from each accession were transferred into 9 cm Petri-plates lined with wet filter paper in three replicates and placed in a growth chamber set at 28 °C for 14 h light and 22 °C for 10 h dark with 100 % relative humidity for 7 days. The seed was considered germinated when the radicle or coleoptile reached a length of ≥2 mm. GP was scored as the percentage of the number of seeds germinated in the total numbers of seeds in the plate at the first 7 days. The seeds from the second batch were stored at room temperature (~25 °C) for two months to break dormancy by way of after-ripening, after which the seeds were used for germination tests as described above. The germination percentage results were presented as the mean of the germination percentages obtained from the three replicates of 100 seeds ± standard deviation (SD).
Next generation sequencing for the accessions collection was conducted in the previous study , and population structure was modeled as a random effect in linear mixed model (LMM) using the kinship (K) matrix and GWAS was performed using LMM provided by the FaST-LMM programme . The numbers of SNPs used for GWAS for the whole population and each subpopulation were as follows: whole population 3,916,415, Aus 1,925,362, indica 2,767,159 and japonica 1,857,845 while considering only the SNPs with minor allele frequency of ≥0.05 and the varieties with the minor allele frequency of ≥6 in a population. However, Some SNPs were completely linked, thereby causing redundancy in GWAS. Thus the number of informative SNPs (M) was used to calculate the effective number of independent SNPs (Me) after a modified Bonferroni correction . The effective numbers of independent SNPs (Additional file 5) were then used in calculating the genome-wide significance thresholds for GWAS based on a nominal value of 0.05 for LMM resulting into a stringent genome-wide significant threshold value of 6.6 × 10−8, 8.7 × 10−8, 2.0 × 10−7 and 2.0 × 10−7 in the whole population, subpopulations indica, japonica and Aus respectively.
Sdr4 and GA2ox3 Haplotype analysis
The whole genomic DNA analysis of Sdr4 and GA2ox3 genes among the 350 accessions resulted into 4 and 22 SNPs respectively (http://ricevarmap.ncpgr.cn). Only the non-synonymous SNPs within the coding regions of these genes were used for haplotype analysis.
Availability of supporting data
The data sets supporting the results of this article are included within the article and its additional files. The original data sets used in this study are available upon request as part of the data is not for public.
Bewly JD. Seed germination and dormancy. Plant Cell. 1997;9:1055–66.
Li B, Foley ME. Genetic and molecular control of Seed Dormancy. Trends Plant Sci. 1997;2:384–9.
Baskin JM, Baskin CC. Classification system for seed dormancy. Seed Sci Res. 2004;14:1–16.
Baskin CC, Baskin JM. Seeds: ecology, biogeography, and evolution of dormancy and germination. San Diego: Academic; 1998.
Veasey EA, Karasawa MGM, Santos PP, Rosa MS, Mamanie E, Oliveira GCX. Variation in the loss of seed dormancy during after-ripening of wild and cultivated Rice Species. Ann Bot. 2004;94:875–82.
Harlan JR, de Wet JMJ, Price EG. Comparative evolution of cereals. Evolution. 1973;27:311–25.
Gubler F, Millar AA, Jacobsen JV. Dormancy release, ABA and pre-harvest sprouting. Curr Opin Plant Biol. 2005;8:183–7.
Bewley JD, Black M. Seeds- physiology of development and germination. 2nd ed. New York: Plenum Press; 1994.
Roberts EH. Dormancy of rice seed. I. The distribution of dormancy periods. J Exp Bot. 1961;13:319–39.
Roberts EH. Dormancy in rice seed. III. The influence of temperature, moisture and gaseous environment. J Exp Bot. 1962;13:75–94.
Anderson JA, Sorrells ME, Tanksley SD. RFLP analysis of genomic regions associated with resistance to pre-harvest sprouting in wheat. Crop Sci. 1993;33:453–9.
Ikehashi H. Induction and test of dormancy of rice seeds by temperature condition during maturation. Japan J Breed. 1972;22:209–16.
Takahashi N. Inheritance of seed germination and dormancy. In: Science of rice plant: genetics. Tokyo: Food and Agric Policy Res Center; 1997. p. 348–59.
Roberts EH. Dormancy in rice seed. IV. Varietal responses to storage and germination temperatures. J Exp Bot. 1965;16:341–9.
Cohn MA, Hughes JA. Seed dormancy in red rice (Oryza sativa). I. Effect of temperature on dry-afterripening. Weed Sci. 1981;29:402–4.
Koornneef M, Bentsink L, Hilhorst H. Seed dormancy and germination. Curr Opin Plant Biol. 2002;5:33–6.
Finkelstein RR. The role of hormones during seed development and Germination. In: Davies PJ, editor. Plant Hormones – biosynthesis, signal transduction, action! Dordrecht: The Netherlands: Kluwer Academic Publishers; 2004. p. 513–37.
Rohde A, Kurup S, Holdsworth M. ABI3 emerges from seed. Trends Plant Sci. 2000;5:418–9.
Monke G, Altschmied L, Tewes A, Reidt W, Mock HP, Baumlein H, et al. Seed-specific transcription factors ABI3 and FUS3: molecular interaction with DNA. Planta. 2004;219:158–66.
Gualberti G, Papi M, Bellucci L, Ricci I, Bouchez D, Camilleri C, et al. Mutations in the Dof zinc finger genes DAG2 and DAG1 influence with opposite effects germination of Arabidopsis seeds. Plant Cell. 2002;14:1253–63.
Liu Y, Koornneef M, Soppe WJ. The absence of histone H2B monoubiquitination in the Arabidopsis hub1 (rdo4) mutant reveals a role for chromatin remodeling in seed dormancy. Plant Cell. 2007;19:433–44.
Bentsink L, Jowett J, Hanhart CJ, Koornneef M. Cloning of DOG1, a quantitative trait locus controlling seed dormancy in Arabidopsis. Proc Natl Acad Sci U S A. 2006;103:17042–7.
Zheng J, Chen FY, Wang Z, Cao H, Li X, Deng X, et al. A novel role for histone methyltransferase KYP⁄SUVH4 in the control of Arabidopsis primary seed dormancy. New Phytol. 2012;193:605–16.
Xiang Y, Nakabayashi K, Ding J, He F, Bentsink L, Soppe WJJ. REDUCED DORMANCY5 Encodes a Protein Phosphatase 2C that Is Required for Seed Dormancy in Arabidopsis. Plant Cell. 2014;26:4362–75.
Footitt S, Müller K, Kermode AR, Finch-Savage WE. Seed dormancy cycling in Arabidopsis: chromatin remodeling and regulation of DOG1 in response to seasonal environmental signals. Plant J. 2015;81:413–25.
Gu XY, Foley ME, Horvath DP, Anderson JV, Feng J, Zhang L, et al. Association between seed dormancy and pericarp color is controlled by a pleiotropic gene that regulates abscisic acid and flavonoid synthesis in weedy Red rice. Genet. 2011;189:1515–24.
Sugimoto K, Takeuchi Y, Ebana K, Miyao A, Hirochika H, Hara N, et al. Molecular cloning of Sdr4, a regulator involved in seed dormancy and domestication of rice. Proc Natl Acad Sci U S A. 2010;107:5792–7.
Lin SY, Sasaki T, Yano M. Mapping quantitative trait loci controlling seed dormancy and heading date in rice, Oryza sativa L., using backcross inbred lines. Theor Appl Genet. 1998;96:997–1003.
Dong Y, Tsuzuki E, Kamiunten H, Terao H, Lin D, Matsuo M, et al. Identification of quantitative trait loci associated with pre-harvest sprouting resistance in rice (Oryza sativa L.). Field crops Res. 2003;81:133–9.
Gu XY, Kianian SF, Foley ME. Multiple loci and epistases control genetic variation for seed dormancy in weedy rice (Oryza sativa). Genet. 2004;166:1503–16.
Wan JM, Cao YJ, Wang CM, Ikehashi H. Quantitative trait loci associated with seed dormancy in rice. Crop Sci. 2005;45:712–6.
Jiang L, Cao YJ, Wang CM, Zhai HQ, Wan JM, Yoshimura A. Detection and analysis of QTL for seed dormancy in rice (Oryza sativa L.) using RIL and CSSL population. Acta Genet Sin. 2003;30:453–8.
Li W, Xu L, Bai X, Xing Y. Quantitative trait loci for seed dormancy in rice. Euphytica. 2010;178:427–35.
Gu XY, Turnipseed EB, Foley ME. The qSD12 locus controls offspring tissue-imposed seed dormancy in rice. Genet. 2008;179:2263–73.
Ye H, Beighley DH, Feng J, Gu XY. Genetic and physiological characterization of two clusters of quantitative trait loci associated with seed dormancy and plant height in rice. G3 (Bethesda). 2013;3:323–31.
Borevitz JO, Nordborg M. The impact of genomics on the study of natural variation in Arabidopsis. Plant Physiol. 2003;132:718–25.
Korte A, Farlow A. The advantages and limitations of trait analysis with GWAS: a review. Plant Methods. 2013;9:29.
Hindorff LA, Sethupathy P, Junkins HA, Ramos EM, Mehta JP, Collins FS, et al. Potential etiologic and functional implications of genome-wide association loci for human diseases and traits. Proc Natl Acad Sci U S A. 2009;106:9362–7.
Asimit J, Zeggini E. Rare variant association analysis methods for complex traits. Annu Rev Genet. 2010;44:293–308.
Gibson G. Rare and common variants: twenty arguments. Nat Rev Genet. 2011;13:135–45.
Dickson SP, Wang K, Krantz I, Hakonarson H, Goldstein DB. Rare variants create synthetic genome-wide associations. PLoS Biol. 2010;8(1):e1000294.
Wray NR, Purcell SM, Visscher PM. Synthetic associations created by rare variants do not explain most GWAS results. PLoS Biol. 2011;9(1):e1000579.
Li Y, Huang Y, Bergelson J, Nordborg M, Borevitz JO. Association mapping of local climate sensitive quantitative trait loci in Arabidopsis thaliana. Proc Natl Acad Sci U S A. 2010;107:21199–204.
Feng T, Zhu X. Detecting rare variants. Methods Mol Biol. 2012;850:453–64.
Yu J, Pressoir G, Briggs WH, Vroh BI, Yamasaki M, Doebley JF, et al. A unified mixed-model method for association mapping that accounts for multiple levels of relatedness. Nat Genet. 2006;38:203–8.
Listgarten J, Lippert C, Kadie CM, Davidson RI, Eskin E, Heckerman D. Improved linear mixed models for genome-wide association studies. Nat Methods. 2012;9:525–6.
Huang X, Wei X, Sang T, Zhao Q, Feng Q, Zhao Y, et al. Genome-wide association studies of 14 agronomic traits in rice landraces. Nat Genet. 2010;42:961–7.
Zhao K, Tung CW, Eizenga GC, Wright MH, Ali ML, Price AH, et al. Genome-wide association mapping reveals a rich genetic architecture of complex traits in oryza sativa. Nat Commun. 2011;2:267. doi:10.1038/ncomms1467.
Norton GJ, Douglas A, Lahner B, Yakubova E, Guerinot ML, Pinson SRM, et al. Genome wide association mapping of grain arsenic, copper, molybdenum and zinc in rice (Oryza sativa L.) grown at four international field sites. PLoS One. 2014;9(2):e89685.
Eizenga GC, Ali ML, Bryant RJ, Yeater KM, McClung AM, McCouch SR. Registration of the ‘Rice Diversity Panel 1’ for genome-wide association studies. J Plant Registrations. 2014;8:109–16.
Yano R, Takebayashi Y, Nambara E, Kamiya Y, Seo M. Combining association mapping and transcriptomics identify HD2B histone deacetylase as a genetic factor associated with seed dormancy in Arabidopsis thaliana. Plant J. 2013;74:815–28.
Mather KA, Caicedo AL, Polato NR, Olsen KM, McCouch S, Purugganan MD. The extent of linkage disequilibrium in rice (Oryza sativa L.). Genet. 2007;177:2223–32.
McNally KL, Childs KL, Bohnert R, Davidson RM, Zhao K, Ulat VJ, et al. Genome-wide SNP variation reveals relationships among landraces and modern varieties of rice. Proc Natl Acad Sci U S A. 2009;106:12273–8.
Ali-Rachedi S, Bouinot D, Wagner MH, Bonnet M, Sotta B, Grappin P, et al. Changes in endogenous abscisic acid levels during dormancy release and maintenance of mature seeds: studies with the Cape Verde Islands ecotype, the dormant model of Arabidopsis thaliana. Planta. 2004;219:479–88.
Cadman CS, Toorop PE, Hilhorst HW, Finch-Savage WE. Gene expression profiles of Arabidopsis Cvi seeds during dormancy cycling indicate a common underlying dormancy control mechanism. Plant J. 2006;46:805–22.
Sakai M, Sakamoto T, Saito T, Matsuoka M, Tanaka H, Kobayashi M. Expression of novel rice gibberellin 2-oxidase gene is under homeostatic regulation by biologically active gibberellins. J Plant Res. 2003;116:161–4.
Zhu Y, Nomura T, Xu Y, Zhang Y, Peng Y, Mao B, et al. Elongated uppermost internode encodes a cytochrome P450 monooxygenase that epoxidizes gibberellins in a novel deactivation reaction in rice. Plant Cell. 2006;18:442–56.
Luo A, Qian Q, Yin HF, Liu XQ, Yin CX, Lan Y, et al. EUI1, encoding a putative cytochrome P450 monooxygenase, regulates internode elongation by modulating gibberellin response in rice. Plant Cell Physiol. 2006;47:181–91.
Fu J, Liu H, Li Y, Yu H, Li X, Xiao J, et al. Manipulating broad-spectrum disease resistance by suppressing pathogen-induced auxin accumulation in rice. Plant Physiol. 2011;155:589–602.
Finkelstein R, Lynch T. The Arabidopsis abscisic acid response gene ABI5 encodes a basic leucine zipper transcription factor. Plant Cell. 2000;12:599–609.
Gardner HW, Dornbos DLJ, Desjardins A. Hexanal, trans-2-hexenal, and trans-2-nonenal inhibit soybean, Glycine max, seed germination. J Agric Food Chem. 1990;38:1316–20.
Chehab EW, Raman G, Walley JW, Perea JV, Banu G, Theg S, et al. Rice HYDROPEROXIDE LYASES with unique expression patterns generate distinct aldehyde signatures in Arabidopsis. Plant Physiol. 2006;141:121–34.
Vaidyanathan R, Kuruvilla S, Thomas G. Characterization and expression pattern of an abscisic acid and osmotic stress responsive gene from rice. Plant Sci. 1998;140:21–30.
Ni J, Colowit P, Mackill D. Evaluation of genetic diversity in rice subspecies using microsatellite markers. Crop Sci. 2002;42:601–7.
Glaszmann JC. Isozymes and classification of Asian rice varieties. Theor Appl Genet. 1987;74:21–30.
Zhang Q, Maroof M, Lu T, Shen B. Genetic diversity and differentiation of Indica and Japonica rice detected by RFLP analysis. Theor Appl Genet. 1992;83:495–9.
Garris AJ, Tai TH, Coburn J, Kresovich S, McCouch S. Genetic structure and diversity in Oryza sativa L. Genet. 2005;169:1631–8.
Alonso-Blanco C, Bentsink L, Hanhart CJ, Vries HBE, Koornneef M. Analysis of natural allelic variation at seed dormancy loci of Arabidopsis thaliana. Genet. 2003;164:711–29.
Carrera E, Holman T, Medhurst A, Dietrich D, Footitt S, Theodoulou FL, et al. Seed after-ripening is a discrete developmental pathway associated with specific gene networks in Arabidopsis. Plant J. 2008;53:214–24.
Ye H, Feng JH, Zhang LH, Zhang JF, Mispan MS, Cao ZQ, et al. Map-based cloning of seed dormancy1–2 identified a gibberellin synthesis gene regulating the development of endosperm-imposed dormancy in rice. Plant Physiol. 2015;169:2152–65.
Miura K, Ikeda M, Matsubara A, Song XJ, Ito M, Asano K, et al. OsSPL14 promotes panicle branching and higher productivity in rice. Nat Genet. 2010;42:545–50.
Gianinetti A, Vernier P. On the role of abscisic acid in seed dormancy of red rice. J Exp Bot. 2007;58:3449–62. 2007.
Gubler F, Hughes T, Waterhouse P, Jacobsen J. Regulation of dormancy in barley by blue light and after-ripening: effects on abscisic acid and gibberellin metabolism. Plant Physiol. 2008;147:886–96.
Rodriguez MV, Mendiondo GM, Cantoro R, Auge GA, Luna V, Masciarelli O, et al. Expression of seed dormancy in grain sorghum lines with contrasting pre-harvest sprouting behavior involves differential regulation of gibberellin metabolism genes. Plant Cell Physiol. 2012;53:64–80.
Chen W, Gao Y, Xie W, Gong L, Lu K, Wang W, et al. Genome-wide association analyses provide genetic and biochemical insights into natural variation in rice metabolism. Nat Genet. 2014;46:714–21.
Zhao H, Yao W, Ouyang Y, Yang W, Gong W, Wang GW, et al. RiceVarMap: a comprehensive database of rice genomic variations. Nucleic Acids Res. 2014. 43 doi: 10.1093/nar/gku894.
Lippert C, Listgarten J, Liu Y, Kadie CM, Davidson RI, Heckerman D, et al. FaST linear mixed models for genome-wide association studies. Nat Methods. 2011;8:833–5.
Li MX, Yeung JM, Cherny SS, Sham PC. Evaluating the effective numbers of independent tests and significant p-value thresholds in commercial genotyping arrays and public imputation reference datasets. Hum Genet. 2012;131:747–56.
Gu XY, Kianian SF, Foley ME. Phenotypic selection for seed dormancy introduced a set of adaptive haplotypes from weedy into cultivated rice. Genet. 2005;171:695–704.
Cai HW, Morishima H. Genomic regions affecting seed shattering and seed dormancy in rice. Theor Appl Genet. 2000;100:840–6.
We deeply thank Mr. J.B. Wang for his excellent work in field management. This work was supported by grants from the National Special Program for Research of Transgenic Plant of China (2011ZX08009–001–002) and the Natural science foundation of Hubei province, China.
The authors declare that they have no competing interests.
YZX designed the work. RAM collected the phenotype and genotype data. RAM and HZ analyzed the dataset. RAM and YZX wrote the manuscript. All authors read and approved the final manuscript.
Neighbor-joining tree of the 350 rice accessions with reference to GP: A neighbor-joining tree showing the divergent groups of the 350 rice accessions used in this study with reference to germination percentage (GP). (PDF 123 kb)
List of twenty most dormant accessions that retained their dormancy in After-ripened seeds. This table contains names of twenty most dormant accessions, sub- population, country of origin and the germination percentages in FHS and ARS. (PDF 95 kb)
List of most dormant accessions that lost their dormancy in After-ripened seeds. This table contains names of most dormant accessions, sub- population, country of origin and the germination percentages in FHS and ARS. (PDF 105 kb)
Non-synonymous SNPs in Sdr4 and GA2ox3 genes in 350 accessions used in our study. This file contains two tables (a) and (b). Table (a) contains the non-synonymous SNPs in Sdr4 and GA2ox3 genes, the SNPs position within the chromosomes, the minor and major alleles for the SNPs and the Amino acid changes in relation to the nucleotide change from major to minor alleles. The tables also shows the haplotype diversity within these two genes. Table (b) shows the number of accessions in individual sub-populations possessing any of the Sdr4 and GA2ox3 haplotypes. (PDF 101 kb)
Estimated effective number of SNPs and significant Thresholds in populations: This table shows the effective number of independent SNPs (Me) after a modified Bonferroni correction calculated using informative SNPs (M) in Whole population, Aus, indica and japonica populations. (PDF 85 kb)
The Genome-wide association mapping results for Germination Percentage (GP) of the freshly harvested seeds (FHS) in whole, Aus, indica and japonica populations. The figure shows neighbor-joining tree, histogram of the phenotypes (GP), quantile-quantile plot of the expected null distribution and the observed P-value and the Manhattan plots of GP of freshly harvested seeds in populations using LMM and LR methods. (PDF 795 kb)
The Genome-wide association mapping results for Germination Percentage (GP) of the after-ripened seeds (ARS) in whole, Aus, indica and japonica populations. The figure shows neighbor-joining tree, histogram of the phenotypes (GP), quantile-quantile plot of the expected null distribution and the observed P-value and the Manhattan plots of GP of after-ripened seeds in populations using LMM and LR methods. (PDF 823 kb)
About this article
Cite this article
Magwa, R.A., Zhao, H. & Xing, Y. Genome-wide association mapping revealed a diverse genetic basis of seed dormancy across subpopulations in rice (Oryza sativa L.). BMC Genet 17, 28 (2016) doi:10.1186/s12863-016-0340-2
- Seed dormancy
- Germination percentage
- Association mapping
- Haplotype analysis