- Research article
Molecular ecology and selection in the drought-related Asr gene polymorphisms in wild and cultivated common bean (Phaseolus vulgaris L.)
BMC Geneticsvolume 13, Article number: 58 (2012)
The abscisic acid (ABA) pathway plays an important role in the plants’ reaction to drought stress and ABA-stress response (Asr) genes are important in controlling this process. In this sense, we accessed nucleotide diversity at two candidate genes for drought tolerance (Asr1 and Asr2), involved in an ABA signaling pathway, in the reference collection of cultivated common bean (Phaseolus vulgaris L.) and a core collection of wild common bean accessions.
Our wild population samples covered a range of mesic (semi-arid) to very dry (desert) habitats, while our cultivated samples presented a wide spectrum of drought tolerance. Both genes showed very different patterns of nucleotide variation. Asr1 exhibited very low nucleotide diversity relative to the neutral reference loci that were previously surveyed in these populations. This suggests that strong purifying selection has been acting on this gene. In contrast, Asr2 exhibited higher levels of nucleotide diversity, which is indicative of adaptive selection. These patterns were more notable in wild beans than in cultivated common beans indicting that natural selection has played a role over long time periods compared to farmer selection since domestication.
Together these results suggested the importance of Asr1 in the context of drought tolerance, and constitute the first steps towards an association study between genetic polymorphism of this gene family and variation in drought tolerance traits. Furthermore, one of our major successes was to find that wild common bean is a reservoir of genetic variation and selection signatures at Asr genes, which may be useful for breeding drought tolerance in cultivated common bean.
Common bean (Phaseolus vulgaris L.) is the most important food legume in terms of providing directly consumed nutrients and dietary protein in developing countries of Latin America, Africa as well as in traditional diets of the Middle East and the Mediterranean with over 23 M tons grown around the tropics, sub-tropics and temperate zones for anywhere from on-farm to local market, and within-country consumption or exports . Common bean is usually grown in areas with sufficient rainfall but has also extended to regions where drought is endemic and supplemental irrigation is scarce such as in northeastern Brazil, coastal Peru, the central and northern highlands of Mexico, and in lower elevations of East Africa  as well as the western plains of the United States and Canada . Therefore, increasing drought tolerance through common bean breeding has become a common goal of national and international breeding programs.
Relatively few sources of drought tolerance have been identified in common bean compared to other legumes. Most studies have concentrated on advanced lines from a few commercial classes  and not from the wild and landrace collections which are considerable reservoirs of naturally-adapted genotypes for drought-stress environments. Therefore, searching wild and cultivated collections of common bean is another goal of plant breeding programs and has recently been assigned a high level of funding within the context of the food security programs of the International Agricultural Research Centers. Initial testing has shown high variability for drought tolerance traits and certainly common bean contains a lot of allele richness [5, 6].
Drought tolerance is a genetically, physiological and mechanistic complex trait. In terms of genetics, the multiple individual traits that make up drought tolerance are usually inherited quantitatively with very few major genes for drought tolerance mechanisms known, although Blair et al. did find some quantitative trait loci for drought tolerance. Epigenetic and environmental components of drought stress exist, as well. One transcription factor that is often involved in signaling of drought stress is abscisic acid whose levels are often correlated with plant parts and whole plants that are suffering from drought stress . Some of the mechanisms of drought tolerance are controlled through an ABA responsive pathway , while others are independent of ABA [8, 10]. In particular, the transcription factors of the Asr (abscisic acid, stress, ripening induced) family of genes are plant-specific and stress-regulated components of the ABA- dependent pathway, with further proof of their role found in their interaction with ABRE elements [11, 12]. Sucrose synthase genes are thought to be downstream Asr genes . The number of Asr genes found in plant genome databases varies from one in Vitis vitifera, four in Brachypodium distachyon, six in Oryza sativa, and up to seven in Sorghum bicolor. Expression analysis have demonstrated their explicit role in conferring increased drought and salt tolerance in tomato, rice and lilies [14–16] but to date no analysis of their role in the legumes has been put forward.
In this regards, diversity analysis of the ASR family has been illustrative of adaptive selection of crop plants to help deal with environmental conditions. For example, studies of the extent of nucleotide diversity in Asr genes in Solanaceae species and in wild and cultivated rice provided some evidence of non-neutral evolution, adaptive, or demographic events in dry areas [14, 17, 18]. Moreover, genetic mapping of Asr1 co-localized this gene with QTLs for xylem sap ABA content, for anthesis–silking interval responsive to mild water deprivation and for leaf senescence in maize [19, 20]. Finally, Maskin et al. showed a DNA-binding activity, Konrad & Bar-Zvi  revealed that the unstructured form of tomato ASR1 proteins presented a chaperone-like activity that stabilized other proteins against denaturation caused by heat and freeze–thaw cycles, and Cakir et al. described an association with a grape hexose transporter promoter.
Common bean is a good model to study drought related candidate genes and especially the less complex Asr family because of its rich evolutionary history in the wild across two continents (South and Central America) and multiple domestication process (in the Andean and Mesoamerican regions of the Americas). Wild Phaseolus vulgaris beans are diverse in the western hemisphere ranging from temperate Argentina to dry land parts of Mexico. However, wild beans are thought to have evolved from an original gene pool in Ecuador and northern Peru, after which radiation to various regions north and south of there, gave rise to an Andean, a Colombian, and a Mesoamerican gene pool . The Andean and Mesoamerican wild beans were then subjected to domestication in each region giving rise to cultivars of both gene pools [25–27].
Mesoamerican beans were domesticated in the region of Jalisco , although this does not preclude more than one domestication event in another part of Mesoamerica giving rise to a diversity of chloroplast haplotypes . For the Andean gene pool, southern Bolivia may have been the center of domestication  with introgression from the wild occurring in the extension of cultivated types northwards towards the equator. Both domestications occurred 5,000–8,000 years ago . Therefore in common bean, populations structure is divided into gene pools (Andean and Mesoamerican for cultivated beans and four or more groups in wild beans); while additional structure within each of these gene pools is then found. Within the cultivated Andean gene pool the races Nueva Granada, Peru and Chile are identifiable [5, 32–35]. Within the cultivated Mesoamerican gene pool the race Mesoamerica, the complex Durango-Jalisco and the race Guatemala are observable [5, 36].
As mentioned above, the cultivated gene pool structure contrasts with the structure obtained for wild common bean in which four main clusters are seen: the Colombo-Mesoamerican, the Mexican, the Andean and the Peruvian-Ecuadorian . This simplistic model has been further challenged by results from Kwak et al. where wild common beans from Central America were divisible into different groups including those from Guatemala and Costa Rica.
Introgression between gene pools, between the races and between cultivated and wild genotypes has been a historical, long-term and re-iterative process [34, 37, 38]. However, the structure of the wild populations is maintained by geographic barriers along the length of the Andean to Mesoamerica arc of mountains and varying terrain. In terms of habitat ecology, race Durango-Jalisco is the only one of the groups of races that has significant drought tolerance, with part of this group distributed in semi-arid areas of Mexico ; race Chile has adaptation to relative drier areas as well, but is only found in the southern Andes [35, 40]. Races Mesoamerica and Guatemala, or Nueva Granada and Peru occupy low to mid altitude or highland regions of Latin America, respectively [5, 41, 42]. Some highland and mid-elevation sites are also drought susceptible especially in the equatorial regions where bimodal rainfall events are often short. Although cultivars from the Durango-Jalisco complex have the highest level of drought tolerance, one may expect to find still higher levels in certain wild germplasm .
Nucleotide diversity surveys are powerful tools for the study of reference collections of cultivated and wild genotypes that allow population genetic tests to be made for departure from the neutral equilibrium models and to identify the diverse selective modes that shaped the evolution of specific genes . For instance, sequence information for specific genes can show their genealogy and suggest how different groups of accessions evolved from the wild and how this influenced the genotypes involved in domestication events. Therefore, the particular role of duplication, lineage sorting, sub-functionalization, sub-speciation and ecological constrains on gene evolution can be inferred. Furthermore, population structure, SNP diversity and phenotype association are other activities which can we undertaken .
The specific goals of this research were to evaluate the allele diversity of the Asr1 and Asr2 genes in wild and cultivated common bean and to determine (1) the extent of haplotype diversity, (2) the allele distribution in relation with the gene pool origins and probable drought tolerance based on geographic origin, (3) the differences at these candidate genes between wild and cultivated common beans, and (4) the patterns of nucleotide variation as related to local adaptation to ecological environments.
Structure of the ASR family
There are two members of the ASR gene (PF02496) in common bean that could be distinguished by the primers we used here. Both of them have two exons and one intron. The Asr2 gene is smaller than the Asr1 gene. Asr2 includes 100 bp upstream of the ABA domain whilst Asr1 includes 550 bp. In addition, the intron in Asr2 (179 bp) is smaller than the intron in Asr1 (565 bp) (Figure 1). This structure was confirmed with the alignment between common bean ESTs (TC2798 and CA910244) and our sequenced region.
Polymorphism and deviation from neutrality at the Asrgenes
Molecular variation in the wild bean collection was significantly structured and all analyzed regions of both genes were related to the sub-population structure except for Asr2. The G st , F st and Snn values were 0.344, 0.495 and 0.256 for Asr1, and 0.167, 0.426 and 0.350 for Asr2, showing the higher differentiation in the first gene compared to the second gene. Recombination parameters per gene  and minimum number of recombination events using the four-gamete test were higher for the intron region of each gene than for the exon regions of the genes (0.8 and 0.001 respectively). In general, Asr genes presented low polymorphism as reflected by PIC (polymorphic information content) values for SNPs, although these were higher on average for Asr2 than for Asr1, and much higher in the wild collection than in the cultivated collection as would be expected by a domestication bottleneck. Polymorphic sites were generally more common in introns and non-conserved domains than in exons or conserved regions (Table 1).
In total there were similar numbers of SNPs in each gene, however in Asr1 more were in the wild accessions compared to Asr2 where the SNPs were equally divided between cultivated and wild. Interestingly, all mutations in coding regions of Asr1 and Asr2 genes were synonymous mutations. Additionally, Asr2 polymorphic sites distinguished DOR364 from the other three cultivars used as controls, while Asr1 did not distinguish between Andean or Mesoamerican controls, although Tajima’s D varied between the gene pools (Table 2).
Deviations from Wright-Fisher neutrality were significant in the wild collection for Asr2 when the analysis was carried out on a global basis, without considering population structure, and for Asr1 when the analysis was carried out taking into account Mesoamerica and Guatemala populations (Table 3). Tajima’s D was positive in the first case and negative in the second one. The number of polymorphic sites found in Mesoamerican and Guatemala populations was higher than that found in Ecuador-Northern Peru and Andean populations. Moreover, no significant deviations were observed in other populations or gene pools. Ramos-Onsins & Rozas‘R2 values, which test for population growth, were not significant except for the Mesoamerican wild population.
π values for Asr1 and Asr2 were compared against the background distribution from Cortés et al. for non-drought related genes. Despite this selection of random genes, the background distribution was not neutral and tended to inflate π values, as would be expected where some of the random genes were important for sub-population selection. Asr2 fell within the range of random genes. However, Asr1 presented extreme lower values in relation with the evolutionary background, accounting for a p-value less than 0.001 (Figure 2). This shows that like the background distribution of SNPs, the present SNPs accounted for the gene pool structure.
Globally, Asr2 presented more haplotypes than Asr1. Moreover, haplotypes with low frequency were more common at Asr2 than at Asr1 (Table 4). A hypothetical haplotype was required to analyze variation in the exon region of Asr1. Furthermore, some patterns suggested by the previous section were revealed by total haplotype frequency for each gene pool division was clearer for Asr1, as well as for each sub-population (Figure 3). In particular, Northern Peru population had a unique fixed haplotype of Asr1. The haplotypes with the highest frequency were shared by accessions from Mesoamerican, Guatemala, Colombia and Andean populations for Asr1 and Asr2 genes. A pair of equally high-frequency haplotypes shared by more than two populations was found for Asr2. Interestingly, Ecuador-Northern Peru population did not have any of these haplotypes. A Mesoamerican-Andean gene pool division was clear for both genes, especially for Asr1, as was stated in the previous section.
A qualitative evaluation of the possible association between population structure, drought tolerance, and haplotype and nucleotide diversity of candidate genes was made by comparing the allelic diversity with ecological variation for drought tolerance (Figure 4). Median Joining Networks aided in this inspection in this way, it was possible to identify genetic variation correlated with estimated habitat drought index and drought severity index for the wild and cultivated genotypes, respectively. In particular, the combination of estimated habitat drought stress, drought severity index, population structure, and candidate gene haplotypes in a qualitative analysis revealed four main categories of relationships, as described below.
First, some haplotypes included sequences of accessions from the same wild population but with very different estimated levels of drought stress associated with their habitat, as was the case for some Asr2 haplotypes. Second, some haplotypes included accessions from dissimilar populations that presented a similar estimated habitat drought stress. Third, some accessions had distinct haplotypes but were categorized in a similar rank of estimated drought stress, irrespective of their populations. Fourth and finally, some accessions of the same population had dissimilar haplotypes and disparate drought stress ranks. Visual inspection of allelic variation for these haplotypes along different ecological regions confirmed the previous categories.
Common bean Asr genes are an example of a small gene family with a simple structure and potential role in drought tolerance. This paper is the first attempt to characterize their diversity in common bean. Furthermore, this research integrates different lines of evidence from the coalescent theory and co-evolution analysis to evaluate the role of these genes in drought tolerance. One of our major successes was to find that wild common bean is a reservoir of genetic variation at Asr genes, which may be useful for breeding drought tolerance in cultivated common bean.
Wild common bean is a reservoir of genetic variation at Asrgenes
Traditionally, wild relatives of cultivated plants were not subjected to bottlenecks or selective sweeps during the domestication process, and were not selected subsequently to improve specific traits. Hence, wild gene pools have not been genetically eroded and they conserve much of the original variation developed over millennia for the species . Moreover, they are better adapted to some abiotic stress conditions from their original habitat than cultivars and they present higher levels of exogamy than cultivated accessions. Such adaptations were lost in the transition toward domestication and to field production, and gave way to a superior allocation of plant resources and biomass to yield. Consequently, wild relatives must have higher genetic diversity and phenotypic variability than cultivated individuals, except in the case of diversifying selection that were part of the domestication syndrome (set of traits that were selected during domestication ) in the crop.
This hypothesis of higher wild versus cultivated diversity has been demonstrated in rice [14, 49], common bean [28, 50], dogs [51, 52], chickens , fishes , and many other species. The hypothesis of dissimilar grades of variation between wild and cultivated plants has been reinforced for common bean in the present research using the Asr candidate genes instead of genomic variation as previous studies have done. The candidate gene approach allows comparisons of adaptive variation to be made, and not just neutral polymorphism. This analysis of adaptive variation closes the gap between diversity analysis and functional genomics. Furthermore, the selective and population-based hypotheses used to explain the lack of neutrality are distinguished easily when adaptive and neutral variation is compared in a common framework. As we will discuss extensively in the following sections, a combination of SNP markers at Asr genes and SNP markers at a genome-wide scale, allowed us to conclude dissimilar diversity between wild and cultivated common beans. Different evolutionary imprints in each gene, and divergent correlation between candidate genes and drought tolerance, accounted in all cases for the evolutionary inertia and the confounding effects of population structure. In a first observation, we detected more genetic variation for drought tolerance in wild than in cultivated common beans. The variation found among the cultivated races was considerable, but lower than in wild common beans. Cultivated races span different regions across the Americas in relation with water stress. Therefore, drought tolerance is not expected to be uniform across the cultivated accessions, even though there was presumably not selection for drought tolerance during the domestication syndrome. In addition, some differences existed between the adaptation of wild and cultivated individuals to drought stress. The evaluation of drought physiology traits in wild populations of common bean would have been impractical due to long growth cycle and low biomass . Hence, the ecological analysis we applied was predictive of drought tolerance.
Asrparalogous have experienced different evolutionary patterns
Selective processes, such as purifying selection and local adaptation, imprint regions of the genome in different manners, causing the departure of genetic variation from neutral expectations [49, 51]. Purifying selection is associated with low values of nucleotide diversity (π) and negative values of Tajima’s D. However, recent population bottlenecks tend to achieve the same reduction in nucleotide variation. On the other hand, local adaptation tends to homogenize haplotypes within the same niche, fix polymorphisms between different populations, and eliminate low frequency polymorphism. Consequently, few haplotypes with high frequency are generated, corresponding to high values of nucleotide diversity and global Tajima’s D. Nevertheless, independent domestication events, extensive population structure and population expansions after bottlenecks can also leave the same patterns . In the case of cultivated common bean, at least two independent domestication processes, in the Andes and in Central America, generated extensive population structure and a genome-wide increase in the global nucleotide diversity [27, 33]. That is why we have observed a significant positive, bimodal distribution of π values. However, this pattern was lost when the two gene pools were considered independently. It remains to be determined whether a population expansion after the two independent domestication bottlenecks could be relevant to explaining the observed patterns of gene diversity. We did observe significant population expansion for the Mesoamerican wild population using the Ramos-Onsins & Rozas’ R2 statistic . We could not, however, find the same pattern for the Andean gene pool. We predict that an extensive survey of SNP markers will reinforce further conclusions. Particularly, a well saturated genome-wide mismatch distribution will allow us to confirm the extent of population expansion within Mesoamerican or Andean beans.
For wild common bean, the global neutrality test against the Wright-Fisher neutral model was rejected for Asr2 showing a bias toward high nucleotide diversity. This was not the case for Asr1, where the neutrality test was rejected when made against the evolutionary background, showing a bias toward low nucleotide diversity. Moreover, when the gene pools were analyzed independently a significantly negative Tajima’s D was observed from the Wright-Fisher equilibrium for Asr1 but not for Asr2. This behavior was likely to be a consequence of the fact that the Wright-Fisher neutral model did not account for the population structure or the evolutionary processes of the species, while the background distribution successfully captured the demographic complexity of common bean. Therefore, the evolutionary background of a crop like common bean is the ideal framework to make straightforward comparisons between candidate genes and genome-wide variation, and this is perfectly equivalent to applying the neutrality test to each one of the sub-populations, as well. Both sources of evidence undoubtedly suggest that Asr1 was subjected to purifying selection at least in wild common bean.
This has been a common finding in other genes and species. For example, ABA related transcription factors in wild tomatoes , genes and chromosomes in maize [58, 59], and genomic regions in chickens  and rice , have been associated with local or genomic selective sweeps. Indeed, the implications of these findings have been important because once one knows how selective processes have proceeded then one can identify this pattern elsewhere .
It is necessary to emphasize that population structure and climatic variability are partially correlated because both follow a latitudinal pattern, as was suggested previously [29, 62]. This is particularly true for Asr2 because its variation overlaps with the evolutionary background. Although population structure explains its significant high nucleotide diversity, there is not power to explain an association between Asr2 and the estimated drought tolerance, if any.
The haplotype/allelic analysis accounted for population structure. Therefore, false positive associations that are especially common in haplotype – phenotype correlations can be rejected . Local adaptation explains these discoveries because it imprints the genetic regions with global high nucleotide diversity, and is congruent with ecological (estimated drought tolerance) and genetic (population structure) characteristics as well. Neither population structure nor population expansions after bottlenecks explain per se and en plene both results. Rather it is more likely that purifying selection caused by ecological adaptation on wild sources of domesticates explains the observed variation between wild and cultivated genotypes. Similar patterns have been found in cattle . Genomic signatures of local adaptation have been revealed in Phytophthora infestans and Brugia malayi, and were proposed in trees, as well . More specifically to our study, Asr2 has been associated with adaptive variation for arid habitats in wild tomatoes [11, 18]. Particularly, Fischer et al. found that two genes of the same ASR family (associated with the ABA related drought response pathway) presented dissimilar patterns of nucleotide variation in wild tomatoes, accounting for purifying selection and local adaptation. Hence, our study is the second report of paralogous genes with the ABA domain that present quite different selection signatures in wild relatives of a crop.
Finally, we found signatures of purifying selection and local adaptation in the wild collection. This is a consequence of the enormous genetic and phenotypic variation stored in the wild populations. In addition, there were differential selection patterns between wild and cultivated beans, which explained why we did not detect selective signatures at the cultivated gene pool level, but we did at the wild gene pool level. Wild common bean is a viney annual plant that germinates among small trees and shrubs in forest clearings or in disturbed environments with the onset of seasonal rains . Specifically, the growth cycle of the wild common bean in the tropics and subtropics is from 6 to 10 months in length, where with bimodal rainfall a mid-season dry period occurs that can last 2 to 4 weeks to as long as 3 months on the equator. In response to this mid-cycle drought the wild P. vulgaris enters a survival mode of slow growth and reduced physiological activity, until rainfall resumes and flowering occurs. In contrast, cultivated beans are not subjected frequently to these environmental pressures which may explain less selection for novel variability in drought related genes such as Asr1 and Asr2.
In short, the adaptive importance of Asr1 was inferred because we detected a distortion in its genetic pattern in relation with the evolutionary background. In the future, expression analysis and QTL information will be needed to validate the importance of Asr1 in the context of drought tolerance in common bean. Overall, however, we have found a valuable gene for plant breeding in common bean and have also proven that signatures of selection are good predictors of markers and genes associated with a desired trait. The value of wild beans from dry environments to improve Andean or Mesoamerican cultivars should be explored empirically especially given that wild common beans occupy more geographical regions with extensive drought stress in temperate or high-subtropical zones than cultivated accessions. These regions include the arid parts of the Andes in Peru, Bolivia and Chile, or the infertile valleys of the southwestern Mexico. With this work we also lay the groundwork for the evaluation of wild and cultivated common bean collections to analyze other candidate genes for drought tolerance.
This research suggested the importance of Asr1 in the context of drought tolerance, and constitutes the first steps towards an association study between genetic polymorphism of this gene family and variation in drought tolerance traits. Furthermore, one of our major successes was to find that wild common bean is a reservoir of genetic variation and selection signatures at Asr genes, which may be useful for breeding drought tolerance in cultivated common bean.
Totals of 104 wild common bean accessions and 297 cultivated accessions were considered in this study. These 401 accessions from reference and core collections were selected to be representative samples of the gene pools and races in the cultivated core collection analyzed by Blair et al., and to be subsets of the wild bean core collection developed by Tohme et al..
Control genotypes included the Andean genotypes Calima (G4494) and Chaucha Chuga (G19833), as well as the Mesoamerican genotypes ICA Pijao (G5773) and Dorado (DOR364), with common name and germplasm entry or advanced line name listed in each case in parenthesis. Seed samples for wild accessions were provided by the Genetic Resource Unit (http://isa.ciat.cgiar.org/urg/main.do), while the cultivated reference collection is maintained by the bean program from original stock at the same Unit.
DNA extraction, primer design, DNA amplification and sequencing
Total DNA was extracted in the Germplasm Characterization Laboratory of CIAT . The primers used to amplify the three regions of the Asr1 and Asr2 genes were provided by the ADOC project or designed from ASR gene sequences (Table 5). Amplification conditions for genes Asr2 and Asr1 (region 1 and 2) used thermocycling conditions of a 95°C hot start for 5 min, followed by 35 cycles of 95°C denaturation for 1 min, annealing for 1 min at 62°C, 53°C or 60°C for the Asr1 region 1, Asr1 region2, and Asr2 gene, respectively. All PCR reactions were followed by a 72°C extension for 2 min.
The PCR reactions were carried out in a 25 μl final volume containing 65 ng of genomic DNA, 1 X PCR buffer (1 X: 10 mM of Tris–HCl pH 8.8, 50 mM of KCl, 0.1% of TritonX-100), 0.3 μM (for the Asr1 - region 1) or 0.5 μM (for the Asr1 - region 2 and Asr2 gene) of each of the forward and reverse primers; 2 mM of MgCl2; 0.4 mM of total dNTPs; and 1 U (for the Asr1 – region 1), 1.25 U (for Asr2), or 2 U (for the Asr1 – region 2) of Taq Polymerase (Fermentas).
Size determination of PCR products was carried out by gel electrophoresis through a 1.5% agarose gel using Tris-Borate-EDTA buffer containing SYBR-Green. PCR products were purified using Exo-Sap clean-up reactions. These clean products were used as templates for subsequent Sanger sequencing reactions, using BigDye Terminator v3.1 Cycle Sequencing Kits. The samples were run on an ABI prism 3730 automated sequencer at the Cornell University Biotechnology Resource Center. Base-pair calls, quality score assignment, and construction of contigs were carried out using Sequencher 4.7 (Gene Codes Corp., Ann Arbor, Michigan). Sequences have been deposited at GenBank under accession numbers JX082400 - JX083058.
Gene characterization, domain detection, and protein and nucleotide alignments
Exons and introns were determined through Blastn of the sequenced regions against the well characterized soybean genes and the non-redundant (nr) EST database with a gap open penalty of 5, a gap extension penalty of 2, a match score of 2, and a mismatch score of −3. Coding regions, UTRs, reading frames and conserved domains were determined through Blastx of the new sequences against nr protein database with a gap open penalty of 11, a gap extension penalty of 1 and a BLOSUM62 matrix available from http://blast.ncbi.nlm.nih.gov.
The ASR domain region was confirmed using the Pfam website (http://pfam.sanger.ac.uk/). Nucleotide and protein alignments, as well as Neighbor-Joining trees, were constructed using orthologous and paralogous genes to verify conserved regions. Nucleotide alignments were carried out with MUSCLE algorithm  and Geneious 4.0 software (Biomatters Ltd.). The sequences were also manually examined to verify the quality of the alignment. Haplotype reconstruction was carried out using PHASE software .
Patterns of nucleotide diversity
Allele assignments and frequencies for all the accessions were used to calculate the polymorphic information content (PIC) for each SNP marker according to Anderson et al. using the formula ; were pij is the frequency of the allele j for each marker i. Levels of genetic diversity within domesticated and wild common bean were quantified with measures of nucleotide diversity based on the number of segregating sites (θW)  and based on the average number of nucleotide differences per site between sequences (π) . These calculations were carried out with the program DnaSP 5.10 . G ST , F ST and Snn values were also computed with the same software. Meanwhile, the number of haplotypes and the haplotype diversity (Hd) were calculated with the same software. Median joining haplotype networks were built using Network 4.5.1 .
Neighbor Joining tree construction and nodal support evaluation using 1000 bootstrap replicates were carried out with the program Mega4 . Network trees accounted for population subdivision based on K-values for number of subpopulations as determined by Blair et al.. Drought tolerance for each genotype was based on geographic origin and estimated habitat drought index (according to the formula [PET-P]/PET, where PET and P accounts for potential evotranspiration and precipitation, respectively. PET was calculated following two different approaches based on temperature and radiation [78, 79]) or field testing and an actual drought severity index (according to the index of Rosales et al. applied on 8 × 8 and 12 x 12 lattice trials with three repetitions each and two environments - drought and irrigation, evaluated at 2008 following the same methodology reported by Blair et al.) for wild and cultivated accessions, respectively .
Tests for selection were performed to estimate whether Asr1 and Asr2 followed the Wright-Fisher model of neutral evolution in each subpopulation. Tajima’s D tests were carried out with DnaSP using 5,000 coalescent simulations  also for each division of wild vs. cultivated beans, Andean vs. Mesoamerican gene pools and race found in each group. Moreover, the nucleotide diversity of Nei was contrasted against an estimated distribution (evolutionary background) using random gene based-SNP markers unrelated to drought stress . Such comparison took into account the differences between markers in terms of number of alleles and information content, as well as the basics of population structure in common bean . The results obtained with the neutral model were compared with the ones obtained using the background distribution in order to access the power of the latter to capture the complexity of demographic processes in common bean.
Broughton WJ, Hernandez G, Blair M, Beebe S, Gepts P, Vanderleyden J: Beans (Phaseolus spp.) - model food legumes. Plant Soil. 2003, 25 (1): 55-128.
Singh SP: Common Bean (Phaseolus vulgaris L.). Genetic Resources, Chromosome Engineering, and Crop Improvement, Grain Legumes. Edited by: Singh RJ, Jauhar PP. 2005, CRC Press, London
Munoz LC, Blair MW, Duque MC, Tohme J, Roca W: Introgression in common bean x tepary bean interspecific congruity-backeross lines as measured by AFLP markers. Crop Sci. 2003, 44 (2): 637-645.
Blair MW, Galeano CH, Tovar E, Muñoz Torres MC, Castrillón AV, Beebe SE, Rao IM: Development of a Mesoamerican intra-genepool genetic map for quantitative trait loci detection in a drought tolerant × susceptible common bean (Phaseolus vulgaris L.) cross. Mol Breed. 2012, 29 (1): 71-88. 10.1007/s11032-010-9527-9.
Blair M, Diaz LM, Buendia HF, Duque MC: Genetic diversity, seed size associations and population structure of a core collection of common beans (Phaseolus vulgaris L.). Theor Appl Genet. 2009, 119 (6): 955-972. 10.1007/s00122-009-1064-8.
Cortés AJ, This D, Chavarro C, Madriñén S, Blair MW: Nucleotide diversity patterns at the drought-related DREB2 encoding genes in wild and cultivated common bean (Phaseolus vulgaris L.). Theor Applied Genetics. 2012, 125 (5): 1069-1085. 10.1007/s00122-012-1896-5.
Blair MW, Galeano CH, Tovar E, Muñoz MC, Castrillón AV, Beebe SE, Rao IM: Development of a Mesoamerican intra-genepool genetic map for quantitative trait loci detection in a drought tolerant × susceptible common bean (Phaseolus vulgaris L.) cross. Mol Breed. 2010, 29: 71-88.
Bartels D, Sunkar R: Drought and salt tolerance in plants. Crit Rev Plant Sci. 2005, 24 (1): 23-58. 10.1080/07352680590910410.
Saumonneau A, Laloi M, Lallemand M, Rabot A, Atanassova R: Dissection of the transcriptional regulation of grape ASR and response to glucose and abscisic acid. J Exp Bot. 2011, 63 (3): 1495-1510.
Taiz L, Zeiger E: Plant Physiology. 2006, Sinauer Associates, Sunderland, MA, 4
Frankel N, Carrari F, Hasson E, Lusem ND: Evolutionary history of the Asr gene family. Gene. 2006, 378: 74-83.
Rossi M, Carrari Y, Cabrera JL, Váquez C, Herrera L, Gudesblat G, Iusem ND: Analysis of an abscisic acid (ABA)-responsive gene promoter belonging to the Asr gene family from tomato in homologous and heterologous systems. Mol Gen Genet. 1998, 258: 1-8. 10.1007/s004380050700.
Hirose T, Scofield GN, Terao T: An expression analysis profile for the entire sucrose synthase gene family in rice. Plant Sci. 2008, 174: 534-543. 10.1016/j.plantsci.2008.02.009.
Philippe R, Courtois B, McNally KL, Mournet P, El-Malki R, Le Paslier MC, Fabre D, Billot C, Brunel D, Glaszmann JC, et al: Structure, allelic diversity and selection of Asr genes, candidate for drought tolerance, in Oryza sativa L. and wild relatives. Theor Appl Genet. 2010, 12 (4): 769-787.
Kalifa Y, Perlson E, Gilad A, Konrad Z, Scolnik PA, Bar-Zvi D: Over-expression of the water and salt stress-regulated Asr1 gene confers an increased salt tolerance. Plant Cell Environ. 2004, 27: 1459-1468. 10.1111/j.1365-3040.2004.01251.x.
Yang CY, Chen YC, Jauh GY, Wang CS: A Lily ASR Protein Involves Abscisic Acid Signaling and Confers Drought and Salt Resistance in Arabidopsis. Plant Physiol. 2005, 139: 836-846. 10.1104/pp.105.065458.
Giombini MI, Frankel N, Lusem ND, Hasson E: Nucleotide polymorphism in the drought responsive gene Asr2 in wild populations of tomato. Genetica. 2009, 136: 13-25. 10.1007/s10709-008-9295-1.
Frankel N, Hasson E, Iusem ND, Rossi M: Adaptive Evolution of the Water Stress-Induced Gene Asr2 in Lycopersicon Species Dwelling in Arid Habitats. Mol Biol Evol. 2003, 20 (12): 1955-1962. 10.1093/molbev/msg214.
Vienne D, Leonardi A, Damerval C, Zivy M: Genetics of proteome variation for QTL characterization: application to droughtstress responses in maize. J Exp Botany. 1999, 50: 303-309.
Jeanneaua M, Gerentesb D, Foueillassarc X, Zivyd M, Vidala J, Toppand A, Perez P: Improvement of drought tolerance in maize: towards the functional validation of the Zm-Asr1 gene and increase of water use efficiency by over-expressing C4–PEPC. Biochimie. 2002, 48 (11): 1127-1135.
Maskin L, Frankel N, Gudesblat G, Demergasso MJ, Pietrasanta LI, Iusem ND: Dimerization and DNA-binding of ASR1, a small hydrophilic protein abundant in plant tissues suffering from water loss. Biochem Biophys Res Commun. 2007, 352: 831-835. 10.1016/j.bbrc.2006.11.115.
Konrad Z, Bar-Zvi D: Synergism between the chaperone-like activity of the stress regulated ASR1 protein and the osmolyte. Planta. 2008, 227 (6): 1213-1219. 10.1007/s00425-008-0693-5.
Cakir B, Agasse A, Gaillard C, Saumonneau A, Delrot S, Atanassova R: A Grape ASR Protein Involved in Sugar and Abscisic Acid Signaling. Plant Cell. 2003, 15: 2165-2180. 10.1105/tpc.013854.
Tohme J, González O, Beebe S, Duque MC: AFLP Analysis of Gene Pools of a Wild Bean Core Collection. Crop Sci. 1996, 36 (5): 1375-1384. 10.2135/cropsci1996.0011183X003600050048x.
Gepts P: Origin and evolution of common bean: Past events and recent trends. HortSci. 1998, 33 (7): 1119-1135.
Gepts P, Debouck DG: Origin, domestication, and evolution of the common bean. Common Beans: Research for Crop Improvement. Edited by: van Schoonhaven A, Cali VO. 1991, Centro Internacional de Agricultura Tropical (CIAT), Cali, Colombia
Gepts P, Osborn TC, Rashka K, Bliss FA: Phaseolin-protein Variability in Wild Forms and Landraces of the Common Bean (Phaseolus vulgaris): Evidence for Multiple Centers of Domestication. Econ Botany. 1986, 40 (4): 451-468. 10.1007/BF02859659.
Kwak M, Kami JA, Gepts P: The Putative Mesoamerican Domestication Center of Phaseolus vulgaris Is Located in the Lerma-Santiago Basin of Mexico. Crop Sci. 2009, 49 (2): 554-563. 10.2135/cropsci2008.07.0421.
Chacon MI, Pickersgill B, Debouck DG: Domestication patterns in common bean (Phaseolus vulgaris L.) and the origin of the Mesoamerican and Andean cultivated races. Theor Appl Genet. 2005, 110 (3): 432-444. 10.1007/s00122-004-1842-2.
Beebe S, Skroch PW, Tohme J, Duque MC, Pedraza F, Nienhuis J: Structure of genetic diversity among common bean landraces of Mesoamerican origin based on Correspondence Analysis of RAPD. Crop Sci. 2000, 40: 264-273. 10.2135/cropsci2000.401264x.
Schoonhoven Av, Voysest O: Common Beans: research for Crop Improvement. 1991, Centro Internacional de Agricultura Tropical, Cali, Colombia
Benchimol LL, de Campos T, Carbonell SAM, Colombo CA, Chioratto AF, Formighieri EF, Gouvea LRL, de Souza AP: Structure of genetic diversity among common bean (Phaseolus vulgaris L.) varieties of Mesoamerican and Andean origins using new developed microsatellite markers. Gen Res Crop Evol. 2007, 54 (8): 1747-1762. 10.1007/s10722-006-9184-3.
Kwak M, Gepts P: Structure of genetic diversity in the two major gene pools of common bean (Phaseolus vulgaris L., Fabaceae). Theor Appl Genet. 2009, 118 (5): 979-992. 10.1007/s00122-008-0955-4.
Blair MW, Diaz JM, Hidalgo R, Diaz LM, Duque MC: Microsatellite characterization of Andean races of common bean (Phaseolus vulgaris L.). Theor Appl Genet. 2007, 116 (1): 29-43. 10.1007/s00122-007-0644-8.
Paredes M, Becerra V, Tay J, Blair MW, Bascur G: Selection of a Representative Core Collection from the Chilean Common Bean Germplasm. Chilean J Agri Res. 2010, 70 (1): 3-15.
Díaz LM, Blair MW: Race structure within the Mesoamerican gene pool of common bean (Phaseolus vulgaris L.) as determined by microsatellite markers. Theor Appl Genet. 2006, 114 (1): 143-154. 10.1007/s00122-006-0417-9.
Blair MW, Giraldo MC, Buendia HF, Tovar E, Duque MC, Beebe SE: Microsatellite marker diversity in common bean (Phaseolus vulgaris L.). Theor Appl Genet. 2006, 113 (1): 100-109. 10.1007/s00122-006-0276-4.
Papa R, Gepts P: Asymmetry of gene flow and differential geographical structure of molecular diversity in wild and domesticated common bean (Phaseolus vulgaris L.) from Mesoamerica. Theor Appl Genet. 2003, 106 (2): 239-250.
Acosta JA, Kelly JD, Gepts P: Prebreeding in common bean and use of genetic diversity from wild germplasm. Crop Sci. 2007, 47 (S3): S44-S59.
Becerra V, Paredes M, Rojo C, Díaz LM, Blair MW: Microsatellite marker characterization of Chilean common bean (Phaseolus vulgaris L.) germplasm. Crop Sci. 2010, 50: 1-10. 10.2135/cropsci2009.02.0086.
Singh SP, Gepts P, Debouck DG: Races of common bean (Phaseolus vulgaris, Fabaceae). Econ Botany. 1991, 45 (3): 379-396. 10.1007/BF02887079.
Beebe S, Rengifo J, Gaitan E, Duque MC, Tohme J: Diversity and origin of Andean landraces of common bean. Crop Sci. 2001, 41 (3): 854-862. 10.2135/cropsci2001.413854x.
Wakeley J: Coalescent theory: an introduction. 2008, Harvard University, Cambridge
Rafalski JA: Association genetics in crop improvement. Curr Opin Plant Biol. 2010, 13 (2): 174-180. 10.1016/j.pbi.2009.12.004.
Hudson RR, Kreitman M, Aguade M: A test of neutral molecularevolution based on nucleotide data. Genetics. 1987, 116: 153-159.
Cortés AJ, Chavarro MC, Blair MW: SNP marker diversity in common bean (Phaseolus vulgaris L.). Theor Appl Genet. 2011, 123 (5): 827-845. 10.1007/s00122-011-1630-8.
Singh SP: Broadening the genetic base of common bean cultivars: A review. Crop Sci. 2001, 41 (6): 1659-1675. 10.2135/cropsci2001.1659.
Harlan JR: Crops and Man. 1975, American Society of Agronomy, Madison, WI, USA
Zhao KY, Wright M, Kimball J, Eizenga G, McClung A, Kovach M, Tyagi W, Ali ML, Tung CW, Reynolds A, et al: Genomic Diversity and Introgression in O. sativa Reveal the Impact of Domestication and Breeding on the Rice Genome. PLoS One. 2010, 5 (5):
Papa R, Bellucci E, Rossi M, Leonardi S, Rau D, Gepts P, Nanni L, Attene G: Tagging the signatures of domestication in common bean (Phaseolus vulgaris) by means of pooled DNA samples. Ann Bot. 2007, 100 (5): 1039-1051. 10.1093/aob/mcm151.
Von Holdt BM, Pollinger JP, Lohmueller KE, Han EJ, Parker HG, Quignon P, Degenhardt JD, Boyko AR, Earl DA, Auton A, et al: Genome-wide SNP and haplotype analyses reveal a rich history underlying dog domestication. Nature. 2010, 464 (7290): 898-U109. 10.1038/nature08837.
Boyko AR, Quignon P, Li L, Schoenebeck JJ, Degenhardt JD, Lohmueller KE, Zhao KY, Brisbin A, Parker HG, vonHoldt BM, et al: A Simple Genetic Architecture Underlies Morphological Variation in Dogs. PLoS Biol. 2010, 8 (8):
Rubin CJ, Zody MC, Eriksson J, Meadows JRS, Sherwood E, Webster MT, Jiang L, Ingman M, Sharpe T, Ka S, et al: Whole-genome resequencing reveals loci under selection during chicken domestication. Nature. 2010, 464 (7288): 587-U145. 10.1038/nature08832.
Karaiskou N, Triantafyllidis A, Katsares V, Abatzopoulos TJ, Triantaphyllidis C: Microsatellite variability of wild and farmed populations of Sparus aurata. J Fish Biol. 2009, 74 (8): 1816-1825. 10.1111/j.1095-8649.2009.02186.x.
Beebe S, Rao IM, Cajiao C, Grajales M: Selection for Drought Resistance in Common Bean Also Improves Yield in Phosphorus Limited and Favorable Environments. Crop Sci. 2008, 48: 582-592. 10.2135/cropsci2007.07.0404.
Librado P, Rozas J: DnaSP v5: a software for comprehensive analysis of DNA polymorphism data. Bioinformatics. 2009, 25 (11): 1451-1452. 10.1093/bioinformatics/btp187.
Xia H, Camus-Kulandaivelu LT, Stephan W, Tellier AL, Zhang Z: Nucleotide diversity patterns of local adaptation at drought-related candidate genes in wild tomatoes. Mol Ecol. 2010, Online version
Tiana F, Stevensa NM, Buckler ES: Tracking footprints of maize domestication and evidence for a massive selective sweep on chromosome 10. PNAS. 2009, 106 (1): 9979-9986. 10.1073/pnas.0901122106.
Camus L, Chevin LM, Cordet CT, Charcosset A, Manicacci D, Tenaillon MI: Patterns of Molecular Evolution Associated With Two Selective Sweeps in the Tb1-Dwarf8 Region In Maize. Genetics. 2008, 180: 1107-1121. 10.1534/genetics.108.088849.
Caicedo AL, Williamson SH, Hernandez RD, Boyko A, Fledel-Alon A, York TL, Polato NR, Olsen KM, Nielsen R, McCouch SR, et al: Genome-wide patterns of nucleotide polymorphism in domesticated rice. PLoS Genet. 2007, 3 (9): 1745-1756.
Burton PR, Clayton DG, Cardon LR, Craddock N, Deloukas P, Duncanson A, Kwiatkowski DP, McCarthy MI, Ouwehand WH, Samani NJ, et al: Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls. Nature. 2007, 447 (7145): 661-678. 10.1038/nature05911.
Chacon MI, Pickersgill B, Debouck DG, Arias JS: Phylogeographic analysis of the chloroplast DNA variation in wild common bean (Phaseolus vulgaris L.) in the Americas. Plant Sys Evol. 2007, 266 (3–4): 175-195.
Sahana G, Guldbrandtsen B, Janss L, Lund MS: Comparison of Association Mapping Methods in a Complex Pedigreed Population. Genet Epidemiol. 2010, 34: 455-462. 10.1002/gepi.20499.
Dayo GK, Thevenon S, Berthier D, Moazami-Goudarzi K, Denis C, Cuny G, Eggen A, Gautier M: Detection of selection signatures within candidate regions underlying trypanotolerance in outbred cattle populations. Mol Ecol. 2009, 18 (8): 1801-1813. 10.1111/j.1365-294X.2009.04141.x.
Haas BJ, Kamoun S, Zody MC, Jiang RHY, Handsaker RE, Cano LM, Grabherr M, Kodira CD, Raffaele S, Torto-Alalibo T, et al: Genome sequence and analysis of the Irish potato famine pathogen Phytophthora infestans. Nature. 2009, 461 (7262): 393-398. 10.1038/nature08358.
Ghedin E, Wang SL, Spiro D, Caler E, Zhao Q, Crabtree J, Allen JE, Delcher AL, Guiliano DB, Miranda-Saavedra D, et al: Draft genome of the filarial nematode parasite Brugia malayi. Science. 2007, 317 (5845): 1756-1760. 10.1126/science.1145406.
Kelleher CT, Wilkin J, Zhuang J, Cortés AJ, Quintero ALP, Gallagher TF, Bohlmann J, Douglas CJ, Ellis BE, Ritland K: SNP discovery, gene diversity, and linkage disequilibrium in wild populations of Populus tremuloides. Tree Gen Genomes. 2012
Fischer I, Camus-Kulandaivelu L, Allal F, Stephan W: Adaptation to drought in two wild tomato species: the evolution of the Asr gene family. New Phytol. 2011, 190 (4): 1032-1044. 10.1111/j.1469-8137.2011.03648.x.
Afanador LK, Hadley SD: Adoption of a mini-prep DNA extraction method for RAPD marker analysis in common bean. Bean Improvement Coop. 1993, 35: 10-11.
Edgar RC: MUSCLE: a multiple sequence alignment method with reduced time and space complexity. BMC Bioinforma. 2004, 5 (1): 113-10.1186/1471-2105-5-113.
Stephens M, Donnelly P: A comparison of Bayesian methods for haplotype reconstruction from population genotype data. Am J Hum Genet. 2003, 73 (5): 1162-1169. 10.1086/379378.
Anderson JA, Churchill GA, Autrique JE, Tanksley SD, Sorrells ME: Optimizing parental selection for genetic linkage maps. Genome. 1993, 36: 181-186. 10.1139/g93-024.
Watterson GA: Number of segregating sites in genetic models without recombination. Theor Popul Biol. 1975, 7: 256-276. 10.1016/0040-5809(75)90020-9.
Nei M: Molecular evolutionary genetics. 1987, Wiley & Sons, New York
Rozas J, Sanchez-DelBarrio JC, Messeguer X, Rozas R: DnaSP, DNA polymorphism analyses by the coalescent and other methods. Bioinformatics. 2003, 19 (18): 2496-2497. 10.1093/bioinformatics/btg359.
Bandelt HJ, Forster P, Röhl A: Median-joining networks for inferring intraspecific phylogenies. Mol Biol Evol. 1999, 16 (1): 37-48. 10.1093/oxfordjournals.molbev.a026036.
Tamura K, Dudley J, Nei M, Kumar S: MEGA4: Molecular Evolutionary Genetics Analysis (MEGA) software version 4.0. Mol Biol Evol. 2007, 24 (8): 1596-1599. 10.1093/molbev/msm092.
Thornthwaite CW, Mather JR: The Water Balance. Climatology. 1955, 8 (1): 1-104.
Hamon WR: Estimating Potential Evapotranspiration. Proc Am Soc Civil Eng. 1961, 87: 107-120.
Rosales R, Ramírez P, Acosta JA, Castillo F, Kelly JD: Grain Yield And Drought Tolerance Of Common Bean Under Field Conditions. Agrociencia. 2000, 34: 153-165.
Pérez JC, Monserrate F, Beebe S, Blair M: Field evaluation of a common bean reference collection for drought tolerance. Annual Report Outcome Line SBA-1 Product 2: Beans that are more productive in smallholder systems of poor farmers. Edited by: CIAT. 2008, Palmira, Colombia
Tajima F: Statistical method for testing the neutral mutation hypothesis by DNA polymorphism. Genetics. 1989, 123 (3): 585-595.
The authors wish to thank A Soler and L Díaz for DNA extractions and the personnel of the CIAT Genetic Resource Unit and CIAT bean program for seed multiplication. We also acknowledge the Cornell Biotechnology Center for support in sequencing and to the anonymous reviewers for their valuable comments. Sequencing at the GeneScope (Paris, France) facility by D Brunel and team is gratefully acknowledged as is primer design by P Romain and the team of D. This research was supported by the Generation Challenge Program ADOC project to DT and MWB.
AJC and CMC carried out laboratory work, genetic and statistical analyses. SM contributed with the statistical analysis. MWB and DT participated in the design and coordination of the study. AJC and MWB drafted the manuscript. All authors read and approved the final manuscript.