Comparative mapping combined with homology-based cloning of the rice genome reveals candidate genes for grain zinc and iron concentration in maize

Background Grain zinc and iron concentration is a complex trait that is controlled by quantitative trait loci (QTL) and is important for maintaining body health. Despite the substantial effort that has been put into identifying QTL for grain zinc and iron concentration, the integration of independent QTL is useful for understanding the genetic foundation of traits. The number of QTL for grain zinc and iron concentration is relatively low in a single species. Therefore, combined analysis of different genomes may help overcome this challenge. Results As a continuation of our work on maize, meta-analysis of QTL for grain zinc and iron concentration in rice was performed to identify meta-QTL (MQTL). Based on MQTL in rice and maize, comparative mapping combined with homology-based cloning was performed to identify candidate genes for grain zinc and iron concentration in maize. In total, 22 MQTL in rice, 4 syntenic MQTL-related regions, and 3 MQTL-containing candidate genes in maize (ortho-mMQTL) were detected. Two maize orthologs of rice, GRMZM2G366919 and GRMZM2G178190, were characterized as natural resistance-associated macrophage protein (NRAMP) genes and considered to be candidate genes. Phylogenetic analysis of NRAMP genes among maize, rice, and Arabidopsis thaliana further demonstrated that they are likely responsible for the natural variation of maize grain zinc and iron concentration. Conclusions Syntenic MQTL-related regions and ortho-mMQTL are prime areas for future investigation as well as for marker-assisted selection breeding programs. Furthermore, the combined method using the rice genome that was used in this study can shed light on other species and help direct future quantitative trait research. In conclusion, these results help elucidate the molecular mechanism that underlies grain zinc and iron concentration in maize.


Background
Zinc and iron are essential micronutrients for all living organisms and play important roles in maintaining life. Zinc and iron deficiencies lead to serious diseases such as low immunity, stunted growth, and iron-deficiency anemia [1]. According to the World Health Organization (2002), zinc and iron deficiencies are the top-ranked health risk factors in developing countries [2]. It is estimated that about 30% and 60% of the world's population suffers from diseases that are caused by zinc deficiency and iron deficiency, respectively [3][4][5]. Biofortification is the improvement of the concentration of essential minerals and vitamins in major staple crops through conventional plant breeding and modern biotechnology. This, combined with increasing the daily intake of such crops, has proven to be the most economical and sustainable approach for relieving micronutrient deficiency in the last decade worldwide [6][7][8].
Understanding the genetic mechanisms behind biofortified traits is the first step in biofortification. Over the past few years, some loci that are responsible for zinc and iron concentration-related traits have been detected through quantitative trait loci (QTL) mapping in various kinds of crops, in particular in grains of major staple foods such as rice (Oryza sativa L.) [9][10][11][12][13][14][15][16] and maize (Zea mays L.) [17][18][19][20], which have been shown to contain low levels of micronutrients. However, previous results that pertained to the genomic location, confidence intervals or total variance explained by QTL were inconsistent because of different genetic backgrounds, environments, and/or mapping methods. Therefore, comparative analysis of QTL that are revealed by independent experiments has become a popular research topic with substantial challenges.
Instead of manually compiling a large amount of QTL information, meta-analysis has been shown to be an effective tool for integrating and re-analyzing such data [21]. Using this method, the number of "real" QTL that were represented by QTL detected in different studies could be calculated and the refined position and the reduced confidence interval of the "real" QTL could be estimated. Meta-analysis has been used in different species to analyze a wide variety of traits, including grain yield and its related traits, flowering time and photoperiod sensitivity, drought tolerance, disease resistance, cold stress, nitrogen use efficiency, grain moisture, root and leaf architecture traits, fiber quality, oil content, and plant maturity traits [22][23][24][25][26][27][28][29][30][31][32][33][34][35][36][37][38][39]. We previously performed a meta-analysis on zinc and iron concentration in maize grains, and 10 meta-QTL (MQTL) were found [17]. MQTL could increase the accuracy and pace of genetic improvement of crops.
In the meta-analysis of grain zinc and iron concentration in maize, we found that the number of QTL is far less than those that are related to easily available traits such as plant height, because the phenotypic values of such traits are difficult to quantify. Fortunately, previous studies have shown that there is an extensive synteny between maize and rice genomes [40]. Therefore, combined analysis of the two species is an alternative way to use limited resources. Comparative mapping that uses common genetic markers to reveal synteny among different species is an ideal way to integrate the genetic information of independent genomes [41]. Conserved chromosome regions for important agronomic traits of maize and rice have been reported by comparative mapping of QTL in maize and rice [42,43]. Comparative mapping of MQTL with higher reliability could accurately uncover the conserved synteny for traits of interest. However, to our knowledge, no published study has compared MQTL.
Therefore, in this study, we combined comparative mapping with homology-based cloning using MQTL for grain zinc and iron concentration in maize (mMQTL) and rice (rMQTL) to predict candidate genes for maize. First, a meta-analysis on published QTL that control grain zinc and iron concentration-related traits in rice was performed to detect MQTL in this species. Then, these were compared with grain zinc and iron concentration MQTL in maize, which was previously reported by us through comparative mapping to identify the conserved synteny. Furthermore, positions of MQTL for maize zinc and iron concentration in grains and maize orthologs of rice zinc and iron metabolism-related genes were compared to reveal the relationship between these genes and the natural variation of this trait. Finally, phylogenetic degeneration of maize orthologs of the rice natural resistance-associated macrophage protein (NRAMP) gene family was elucidated to provide a foundation for further functional characterization.

QTL meta-analysis for zinc and iron concentration in rice grains
Meta-analysis was conducted to integrate and refine QTL for grain zinc and iron concentrations in rice when 74 of the 90 collected QTL were projected onto the consensus map. According to the definition of metaanalysis, chromosome regions that contained only one QTL were ignored during the analysis, which resulted in 63 QTL that were involved in integration. In total, 22 rMQTL were distributed across all rice chromosomes except chromosomes 10 and 11: three rMQTL on chromosomes 1, 2, 3, 7, and 8; two rMQTL on chromosomes 5 and 6; and one each on chromosomes 4, 9, and 12 ( Figure 1).
Detailed information about rMQTL is provided in   narrower than the mean confidence intervals of their respective original QTL. At three rMQTL, rMQTL3.3, rMQTL7.1, and rMQTL8.2, the confidence intervals were less than 10 cM. The phenotypic variance of the rMQTL varied from 6.72% (rMQTL1.1) to 24.30% (rMQTL5.2), and at 12 of the 22 rMQTL, the phenotypic variance was greater than 10%. In general, the rMQTL were represented by several original QTL that were associated with both grain zinc concentration and grain iron concentration.

Syntenic MQTL-related regions between maize and rice
Comparative mapping of MQTL for grain zinc and iron concentration between maize and rice was performed to study the conserved synteny for such traits when respective MQTL data were available through metaanalysis. In total, four syntenic MQTL-related regions with more than two common markers were received: mMQTL2.1 on maize chromosome 2 was co-linear with rMQTL7.1 on rice chromosome 7 (Figure 2a), mMQTL3 on maize chromosome 3 was co-linear with rMQTL1.1 and rMQTL1.3 on rice chromosome 1 (Figure 2b), mMQTL5 on maize chromosome 5 was co-linear with rMQTL2.2 on rice chromosome 2 (Figure 2c), and mMQTL9.2 on maize chromosome 9 was co-linear with rMQTL3.1 on rice chromosome 3 ( Figure 2d). Extensive database searching for common markers that were associated with maize and rice MQTL maps was carried out to seek the functional annotation information. An overgo probe, pco110312/AY107242, which is located in the intervals of mMQTL9.2 and rMQTL3.1, was able to anchor on the following metal transport protein-coding genes: GRMZM2G178190 in maize and OsNRAMP2, which belongs to the NRAMP gene family in rice (Figure 2d). Sequence alignment indicated that the protein sequence of the two genes showed very high identity (92%). Other common markers, however, had no functional information that was related to the target trait we studied.

Characterization of the ortho-mMQTL
A total of 38 maize orthologs of rice zinc and iron metabolism-related genes were obtained through a homology-based cloning method, and their detailed information is listed in Table 2. After comparing the positions of mMQTL and maize orthologs of wellcharacterized rice genes, three ortho-mMQTLs that contained orthologs were discovered. The genomic region of ortho-mMQTL2.1 possessed the following maize orthologs: GRMZM2G085833 of the rice-cloned gene, OsYSL6, which belongs to the yellow stripe1-like (YSL) gene family; GRMZM2G366919 of the rice-cloned gene, OsNRAMP1, which belongs to the NRAMP gene family; and GRMZM2G175576 of the rice clone-gene, OsHMA3, which belongs to the heavy metal ATPase (HMA) gene family. The genomic region of ortho-mMQTL3 possessed the following maize orthologs: GRMZM2G063306 (ZmTOM1) of the rice-cloned gene OsTOM1 and GRMZM2G057413 of the rice-cloned gene OsIRO2, which is a basic helix-loop-helix transcription factor. Additionally, the genomic region of ortho-mMQTL10 that possessed the maize ortholog GRMZM2G026391 of the rice-cloned gene OsYSL16 also belonged to the rice YSL gene family.
In comparison, ortho-mMQTL2.1 has attracted a substantial amount of attention because it is a "hot spot" of maize orthologs of rice genes and also because of the synteny between mMQTL2.1 and rMQTL7.1 that was revealed by comparative mapping. Additionally, the rice gene OsNRAMP1, which is located in the interval of MQTL7.1, is homologous with GRMZM2G366919, which is a maize ortholog that is located in the region of mMQTL2.1. Therefore, mMQTL2.1 and rMQTL7.1 were co-linear and contained a pair of homologous genes, GRMZM2G366919/OsNRAMP1.

Identification and analysis of maize NRAMP genes
Because of the homology of the two pairs of genes in maize and rice, GRMZM2G366919/OsNRAMP1 and GRMZM2G178190/OsNRAMP2, and their significant association with the natural variance of grain zinc and iron concentration, members of the NRAMP gene family in maize were searched, and a phylogenetic tree was built to elucidate the relationship between the gene function and genome evolution as well as provide a foundation for further functional characterization.
Eight putative genes in the maize genome were identified using reported NRAMP proteins from Arabidopsis thaliana as database queries. The phylogenetic tree was then constructed when all of the maize NRAMP proteins were aligned with the A. thaliana and rice NRAMP proteins ( Figure 3). The NRAMP genes were divided into two groups based on the phylogenetic relationships: Class I and Class II. Most of the maize (5 of 8) and rice (5 of 7) NRAMP genes were categorized into Class I. A few were categorized into Class II. For A. thaliana, a model eudicot, the opposite occurred. A phylogenetic analysis showed that GRMZM2G366919, which is closely related to OsNRAMP1, was placed into Class I, a class which also contained AtNRAMP1, 6 and OsNRAMP3, 4, 5, 6. GRMZM2G178190, which is closely related to OsN-RAMP2, was categorized into Class II, a class which also contained AtNRAMP2, 3, 4, 5 and OsNRAMP2, 7.

Meta-analysis for QTL integration
Grain zinc and iron concentration is a polygenic trait that is controlled by QTL. Quantifying this trait is time   consuming, laborious, and expensive. Consequently, comparing QTL for traits that are identified by independent experiments is important. Meta-analysis has been shown to be effective for QTL integration, and consensus QTL, with more accurate positions and reduced confidence intervals, could be provided [23]. In this study, a total of 90 collected QTL for zinc and iron concentration in rice grains were integrated into 22 rMQTL with a 65% decrease in total QTL through meta-analysis. The confidence intervals of rMQTL decreased by 29% to 75% compared with corresponding mean confidence intervals of several initial QTL. We have previously conducted a meta-analysis on this trait in maize. Similarly, the 64% decrease in total QTL and 29% to 83% decreases in confidence intervals of mMQTL were achieved [19]. The genetic and physical intervals of MQTL could even be reduced to approximately 2 cM and 500 kb, respectively, in the meta-analysis for grain yield QTL that were detected in grasses during agricultural drought [25]. Therefore, meta-analysis can effectively synthesize and refine multiple independent QTL that are detected under different genetic backgrounds, population types and sizes, mapping statistics, and even phenotypic methodologies. The precise position and reduced confidence intervals for MQTL will pave the way for further QTL fine mapping and map-based cloning.
In addition to integrating independent QTL, metaanalyses can also reveal the genetic correlations among different traits. In a meta-analysis of QTL for leaf architecture traits, four MQTL were identified for three or four traits [38]. In accordance with previous knowledge that plant digestibility is associated with cell wall composition in maize, meta-analysis of QTL for the two traits showed that 42% of MQTL for digestibility had confidence intervals that overlapped with MQTL for cell wall composition traits [93].
In the current study, most rMQTL for grain zinc and iron concentration in rice were found to include QTL of both traits. Furthermore, in maize, meta-analysis of QTL for the same traits also showed that 8 of 10 mMQTL involved the two QTL traits, simultaneously. The correlation of grain zinc concentration and grain iron concentration at the molecular level strongly indicates that the variation loci responsible for the two traits were co-localized in both maize and rice genomes, or even in other species. MQTL for multiple traits could facilitate the genetic improvement through markerassisted selection breeding programs.

Synteny of grain zinc and iron concentration between maize and rice
There is a well-known evolutionary relationship between maize and rice, which are two major Gramineae species. Comparative mapping of QTL is useful for revealing the syntenic relationships of target traits among different species. For example, comparative analysis revealed that QTL for important agronomic traits, including plant height, number of rows, and kernels per row, are extensively conserved in the syntenic genomic regions of maize and rice [44,45]. In this study, comparative mapping for MQTL that control grain zinc and iron concentration in maize and rice was performed, and four syntenic MQTL-related regions were found. Moreover, the pco110312 overgo probe linked mMQTL9.2 and rMQTL3.1, which are syntenic MQTL-related regions, can anchor onto metal transport protein-coding genes, GRMZM2G178190 and OsNRAMP2. Although no candidate gene was found in other syntenic MQTL-related regions, they provided a foundation for future candidate gene mining. Therefore, the results here illustrate that grain zinc and iron concentration are syntenic between maize and rice, and the syntenic MQTL-related regions are reliable for subsequent analysis.
Based on the comparative mapping results, the four syntenic MQTL-related regions discussed aboved all had relatively broad intervals, which indicating that it was easier to find the respective syntenic region in the other species when MQTL had large confidence intervals. These results could provide a foundation for future research on these MQTL. Because of the narrowed intervals, no syntenic regions were found in MQTL with small confidence intervals. However, some of those MQTL, such as mMQTL2.2 and rMQTL8.2, integrated multiple initial QTL and explained a large percent of phenotypic variation, could provide insight into detection of new functional genes that underlie grain zinc and iron concentration.

Homology-based cloning of maize grain zinc and iron concentration-related genes
Only one candidate gene for grain zinc and iron concentration in maize was discovered in the four conserved genomic regions. Only one gene may have been discovered because the online comparison is limited by the data that are available in public databases. Nevertheless, some rice functionally-characterized zinc and iron metabolism-related genes can be used for homologybased cloning of maize genes. Therefore, the positions of mMQTL and maize orthologs of rice-cloned genes were compared to validate the function of those genes for grain zinc and iron concentration variation in maize. Three ortho-mMQTLs with candidate genes were found. In particular, ortho-mMQTL2.1, which contained GRMZM2G366919, was co-linear with rMQTL7.1, and the corresponding orthologous gene, OsNRAMP1, was located in the genomic region of rMQTL7.1.
In a similar comparison of locations between maize orthologs of rice yield genes and MQTL, three candidate loci for maize yield were successfully predicted [94]. By mapping maize orthologs of rice-and A. thaliana-cloned genes that are associated with leaf architecture traits on the consensus map before OTL meta-analysis, Ku et al. also discovered candidate genes for the traits that they studied [38]. Overall, functionally-characterized genes in rice, which is a model species of Gramineae, could be used to identify and analyze candidate genes in maize or other grasses.
Characterization of the maize NRAMP gene family NRAMP was first identified in rat macrophages as a resistance gene to intracellular pathogens that transport iron [95]. Subsequently, many homologues of rat NRAMP that transport various cations, not merely iron, were characterized in plants. NRAMP genes are, in general, associated with membrane-spanning proteins [96] and widely distributed both in graminaceous and non-graminaceous species. To date, a total of 6 and at least 7 NRAMP genes have been cloned and some of them have been wellcharacterized in A. thaliana and rice, respectively.
In this study, two candidate genes in maize, GRMZM2G366919 and GRMZM2G178190, were identified as being associated with the natural variation of grain zinc and iron concentration through comparative mapping of MQTL combined with a homology-based cloning method with the rice genome. Based on their homology with rice NRAMP genes, members of the maize NRAMP gene family were mined, and a phylogenetic analysis of NRAMP genes in A. thaliana, rice, and maize was carried out to determine the evolutionary relationships among the genes. GRMZM2G366919, which is included in Class I, is closely related to OsNRAMP1, which participates in the control of iron, cadmium, and aluminum homoeostasis in rice [72,73,97]. OsNRAMP5, similar to OsNRAMP1, is relatively closely related to GRMZM2G366919, which contributes to iron, cadmium, and manganese transport in rice [75,76,98]. Interestingly, AtNRAMP1, which is also contained in Class I, is an iron transporter in A. thaliana and is able to rescue both low and high ironsensitive phenotypes of the yeast mutant fet3fet4 [97]. GRMZM2G178190 and OsNRAMP2 are classified into Class II and are most closely related to each other, and OsNRAMP2 was predicted to be a metal homeostasis gene in rice, although its specific function has not yet been clarified [99,100]. It is also worth noting that, with in Class II, AtNRAMP3 and AtNRAMP4 are capable of transporting iron, cadmium, and manganese in A. thaliana [101,102], and AtNRAMP3 disruption can increase the accumulation of zinc in roots under iron starvation [103]. Therefore, the phylogenetic analysis demonstrated that GRMZM2G366919 and GRMZM2G178190 might be responsible for zinc and iron metabolism in maize and might be more likely to regulate their accumulation in grains.

Implications for quantitative trait genetic research
Zinc and iron concentration in grains is undoubtedly a complex agronomic trait and plays a vital role in maintaining human health. However, the genetic basis of grain zinc and iron concentration remains obscure, despite many studies that have been conducted to identify QTL or genes that underlie this trait. We performed meta-analysis of QTL for grain zinc and iron concentration in rice in the present study and maize in a previous study [17] to detect the respective MQTL. However, in this study, to eliminate the limitation imposed by the lack of genetic information from one genome, we combined comparative mapping and homology-based cloning with the rice genome.
The MQTL allowed mining of candidate genes for grain zinc and iron concentration in maize. Two maize orthologs of rice NRAMP genes validated the power and effectiveness of the combined method that we adopted. Additionally, the combined method, as well as the wellstudied rice genome employed here, can be extended to research on other species or complex traits.

Conclusion
Enriching the concentration of zinc and iron in edible parts of major crops is an effective way to relieve malnutrition that is caused by zinc and iron deficiencies, and determining the molecular basis of grain zinc and iron concentration is a prerequisite for biofortification. Metaanalysis of QTL for very complicated traits such as grain zinc and iron concentration is important and useful. MQTL that are the integration of multiple independent QTL, with more precise locations and reduced confidence intervals, are useful for facilitating subsequent research. Candidate genes that were retrieved from the combination of comparative mapping of MQTL and homology-based cloning techniques could be used to reveal the molecular mechanisms that underlie zinc and iron concentration in maize grains. Syntenic MQTLrelated regions and ortho-mMQTLs that contain candidate genes could be used for further fine mapping and map-based cloning.

QTL meta-analysis
Three steps were required for conducting the metaanalysis to identify MQTL. First, a bibliographic review on the mapping of QTL for zinc and iron concentrationrelated traits in rice grains was performed. The QTL information was collected from published reports including journal articles and dissertations. In all, eight reports involving nine mapping populations and 90 QTL were compiled. The details of those studies are provided in Table 3. Second, a consensus map that was integrated from multiple independent genetic linkage maps was built. The rice genetic linkage map Cornell SSR 2001 was selected as a reference map on which the maps of 8  [104]. Third, a meta-analysis of QTL clusters on each chromosome was launched to detect MQTL. The modified Akaike's information criterion (AIC) was used to select the QTL model; the model with the lowest AIC value was chosen as the best model, indicating the most likely number of "real" QTL on each chromosome [21]. Biomercator v2.1 was used to construct the consensus map with the "map projection" function and to conduct meta-analysis with the "meta-analysis" function [105].

MQTL comparative mapping
Integrated MQTL for grain zinc and iron concentrationrelated traits in rice were compared with MQTL for the same traits in maize. The CMap program on the Gramene (http://www.gramene.org/) was used to investigate the synteny of grain zinc and iron concentration in the two species. Maize was selected as the reference species using IBM2 2008 Neighbors as the reference map and then the rice physical map, Gramene Annotated Nipponbare Sequence 2009, was added as a comparative map with rMQTL anchored first. In this study, comparative maps with fewer than three common markers were discarded. To facilitate the description, MQTL for grain zinc and iron concentration in maize, which we have previously reported [17], were renamed (Table 4). Common markers that linked the two genomes were searched for (primarily in GeneBank, http:// www.ncbi.nlm.nih.gov/genbank/, and Gramene) to identify their genomic annotation information.

Ortho-mMQTL mining
Detailed information on 33 cloned rice zinc or iron metabolism-related genes, including NAS, NAAT1, DMAS1, TOM1, YSL, ZIP, NRAMP, HMA, MTP, FRDL, VIT, and IRO was retrieved from the Rice Genome Annotation Project database (http://rice.plantbiology.msu. edu/). Maize orthologs of the 33 rice genes were identified by searching the databases of the Rice Genome Annotation Project, NCBI (http://www.ncbi.nlm.nih.gov/), B73 maize sequence (http://www.maizesequence.org) and Phytozome (http://phytozome.jgi.doe.gov/pz/portal.html) using the BLAST program. Their physical locations were identified using the maize genome browser, MaizeGDB (http://www.maizegdb.org/). Subsequently, the positions of mMQTL and maize orthologs were specifically compared to reveal the relationship between maize orthologs of rice zinc or iron-metabolism related genes and the natural variance of zinc and iron concentration in maize grains. In this study, mMQTL-possessing maize orthologs of rice zinc or iron metabolism-related genes were temporarily called ortho-mMQTL.

Maize NRAMP genes identification and phylogenetic analysis
Members of the maize NRAMP gene family were identified using the BLASTP program in the Phytozome database by employing the protein sequence of previously identified A. thaliana NRAMP genes as queries. The threshold of e-value and identity for the BLASTP program were set at 1e-80 and >75%, respectively. In addition, protein motifs were searched for in the Pfam database (http://pfam.sanger.ac.uk) to confirm the candidate sequence that encodes NRAMP proteins. Multiple alignments of NRAMP proteins from maize, rice and A. thaliana were performed using the ClustalX program [106]. The phylogenetic tree was constructed using MEGA v4.0 software with the neighbor-joining (NJ) method and 1,000 bootstrap replicates [107].