Skip to main content

Mapping phenotypic performance and novel SNPs for cold tolerance in tomato (Solanum lycopersicum) genotypes through GWAS and population genetics

Abstract

The cold stress susceptibility of tomato (Solanum lycopersicum) curtails its cultivation, with significant impact in temperate regions and on cropping seasons. To unravel genomic regions responsible for cold stress resilience, a diverse set of fifty genotypes encompassing cultivated, wild species, and landraces were genotyped using genotyping-by-sequencing. Over two years and six trials employing both early and late sowing, these lines were evaluated. Illumina-based next-generation sequencing produced up to 3 million reads per sample from individually sequenced library pools. The Tassel pipeline yielded 10,802 variants, subsequently filtered to 3,854 SNPs for genome-wide association analysis (GWAS). Employing clustering methods (population structure) via TASSEL, SNPhylo, and Kinship matrix, the fifty genotypes clustered into four distinct gene pools. The GWAS for cold tolerance in tomato integrated key traits including yield. Using six independent phenotypic datasets representing various environments, the study identified 4,517 significant marker-trait associations for cold tolerance traits. Notably, pivotal variations (> 10%) in cold stress tolerance, particularly proline content, were linked to marker-trait associations. Additionally, 5,727 significant marker-trait associations for yield and yield-related traits were unveiled, shedding light on fruit yield and directly associated attributes. The investigation pinpointed 685 candidate genes across all examined traits, including 60 genes associated with biological processes within these genomic regions. Remarkably, 7 out of the 60 genes were directly linked to abiotic stress tolerance, functioning as stress-responsive genes either directly or indirectly. The identified genes, particularly those associated with stress response, could hold the key to enhancing cold tolerance and overall crop productivity in tomato cultivation.

Peer Review reports

Introduction

Tomato due to its versatility and comparatively short genome (950 Mb), is commonly used as representative plant species for determining the genetic basis of complex traits, providing an ideal basis for current “-omics” research and genome guided breeding and has been employed both in conventional and molecular genetics. Nutrionally, the tomatoes are designated as ‘Protective Food’ as they are high in lycopene, ascorbic acid and beta-carotene. Consumption of lycopene-rich diets has been linked to a lower risk of numerous malignancies, including prostate cancer and coronary heart diseases (Karppi et al., 2009). Chen et al., (2015) found that increased lycopene consumption/circulating concentration is linked to a decreased possibility of prostate cancer and this red pigment (lycopene) is currently regarded as the “world’s most effective natural antioxidant”, making it a valuable ingredient in commercial therapeutic formulations [1, 2].

Tomatoes are extremely susceptible to chilling temperatures (0–12 °C), and many tomato-growing areas endure low (chilling) temperatures during the growing season, resulting in significant yield and colour reductions. Low temperature (cold stress) sensitivity of commercial varieties thus limits their geographic distribution, cultivation, growing period, or planting and harvesting times. Due to the prevalence of low temperatures, the number of cropping seasons is just limited to one only in temperate regions. Changes in ambient temperature have an unavoidable impact on biological processes. The rate of spontaneous and enzyme-catalysed chemical and physical reactions, the structure and molecular dynamics, and the strength of molecular contacts are all affected by temperature. Each of these side effects disrupts metabolism and cellular signal processing in some way. Due to direct suppression of metabolic activities as well as cold-induced osmosis (reduction of water uptake and cellular dehydration), the cold stress hinders plants from expressing their full genetic potential. Low temperature has a negative influence on tomato plant growth and development throughout its life cycle. It reduces germination [3], water status and photosynthesis affecting early phases of growth and development [4]. In addition, the reproductive development is also severely disrupted at all stages. Cold stress results in homeotic floral changes and reduced fruit set due to poor pollen germination, resulting in a significant drop in fruit production. Cultivars that are cold-tolerant can yield two or three cropping seasons each year. This would improve the efficiency with which existing processing capacity and fresh market availability could be improved and utilized [5].

In the past fruit yield, plant habit, adaptability to machine harvesting in processing cultivars and features linked to fruit appearance for the fresh market such as firmness, colour have been the focus of the majority of breeding studies. However in recent times due to climate change tomato breeding goals are more focused towards long-term productivity in changing environmental conditions. To broaden Solanum lycopersicon’s restricted genetic diversity, genetic variation from exotic germplasm collections is to be incorporated from other species like Solanum pimpinellifolium, Solanum chilense, Solanum peruvianum, Solanum habrochaites and Solanum pennellii. The first step includes studying the genetic diversity. Genomic revolution, during the last two decades, simplified understanding of the complex responses to biotic and abiotic stress in several crop plants [67]. Since tomato species have spread across wide range of habitats of and are found in cold, dry, mesic and salty ecosystems at elevations above 3000 m MSL and therefore could be valuable for identifying and understanding the natural defence mechanism in Solanum species that allows them to deal with cold but not freezing conditions, known as cold acclimation. Plants use transcriptional, post-transcriptional and post-translational processes to modify their gene expression during cold acclimatization. Cold stress activates a variety of transduction pathways in response to environmental changes. Transcriptome studies of plants’ responses to abiotic stress revealed that the plant’s response to abiotic stress is mediated by many regulatory mechanisms.

Genome-wide association studies look at genetic variants across a variety’s genome to discover if any of them are linked to a trait [8] and has evolved as an useful tool for investigating and linking the vast volumes of data on genome sequence variation with observable behavioural differences. In GWAS the sample size is always large making it inaccessible for the researchers by being expensive but one of the approaches can be to use a star-like design by including geographically distant accessions (Arthur and Ashley, 2014). This maximizes the genetic variance within the sample (Li et al., 2010). Therefore, we used a core collection of diverse accessions collected from various agroclimatic regions to generate a basic and comprehensive dataset for cold tolerance traits apart from yield and yield related traits. QTL and GWAS has been conducted in tomato for various traits [9]. QTL studies for abiotic stresses have been conducted by Arms et al. (2015), Liu et al. (2016), Diouf et al. (2018) etc. in tomato. Also, three hundred eighty-eight suggestive association loci (including 126 significant loci) for ninety-two metabolic traits including nutrition and flavor-related loci were identified by genome-wide association study from various accessions in two different environments by Ye et al. (2019) [10]. The genetic architecture underlying tomato yield-related traits has been studied through GWAS. Based on 4.4 million single nucleotide polymorphisms obtained from diverse accessions, a comprehensive genome-wide association study for twenty-seven 27 agronomic traits in tomato have also been conducted (Ye et al. (2020) [11]. GWAS analysis of a collection of landraces and vintage was performed revealing that previously uncharacterized chromosomal regions were potentially involved in the expression of variable phenotypes by Rodriquez et al. But, a complete GWAS program with prime objective of cold tolerance has never been conducted. Therefore keeping in mind the lag this study was designed [12].

Materials and methods

Plant material

The experimental material comprised of fifty genotypes, used for identifying genomic regions and candidate genes for cold tolerance, yield and yield related traits collected from different agro- climatic regions. The list of genotypes is presented in Table 1. The collection included lines released as cold tolerant varieties viz., Pusa Sheetal, varieties grown and well acclimatized in cold stressed areas viz., Polish Tomato (heirloom variety from Poland; vigorous plants set fruit well even in cool weather), Sub Arctic Plenty (these tomatoes were developed in Alberta for prairie climates and set fruit under cold conditions), Coldset (an heirloom variety from Canada; the seeds withstand low temperatures), Black Plum (Russian heirloom variety, it is a fairly hardy cultivar and produces fruits in cooler temperatures) Stupice (it is Czechoslovakian-bred vine; it can blossom at cooler temperatures and can grow in climates as cold as those found in Alaska), Glacier (very cold tolerant and may survive a light frost also)., various species viz., WIR-3957 (Solanum peruvianum), WIR-13,717 (Solanum lycopersicum var cerasiforme), WIR-13,706 (Solanum lycopersicum var cerasiforme), WIR-5032 (Solanum chilense), WIR-13,708 (Solanum lycopersicum var cerasiforme), IIHR-1939 (Solanum pimpinellifolium) and IIHR-2805 (Solanum peruvianum), local landrace viz., Local-1 and breeding lines already identified as cold tolerant viz., IARI-2 and IARI-3.

Table 1 List of Solanum genotypes used in the study

Phenotyping

In case of tomato, the optimal temperature 15°C to 30°C. Tomatoes are sensitive to cold stress and it has a negative impact on tomato plant growth and development throughout its life cycle. Cold temperatures decrease germination early in development, the reproductive development is severely disrupted at all stages, Cold stress results in homeotic floral changes and reduced fruit set due to poor pollen germination and Furthermore, the low temperature causes tomatoes to fail to ripen normally. Early and late sowing a simple and effective technique for both germination and reproductive stage was adopted for phenotyping the germplasm along with planting at optimal periods. The fifty lines were evaluated for two consecutive years 2019 and 2020 under normal sown environment (E1: Field, normal sown (2019); E2: Polyhouse sown (2019); E3: Field, early sown (2020); E4: Field, normal sown (2020); E5: Field, late sown (2020); E6: Polyhouse sown (2020).

Experiments were conducted at experimental farm of the Division of Vegetable Science, SKUAST-K, Shalimar (34o N latitude and 74.89o E longitude, 1685 m above MSL). The population was evaluated in randomized block design. The spacing followed was 60 cm × 45 cm. All the individuals of the population were apportioned into a total of three blocks along replications. The maximum, minimum, and mean temperatures were recorded daily during the entire cropping season for both years. The mapping population was phenotyped for yield and yield related traits viz., days to emergence, seedling length at transplant (cm), number of flowers per truss, number of days to first fruit set, number of fruits per truss, number of days to first harvest, average fruit weight (g), fruit shape index, number of fruits per plant, fruit yield per plant (kg), number of primary branches, plant height (cm) and duration of harvest and cold tolerance traits viz., pollen viability (% (fluorescent microscope (LMI microscope ABE-UK)), malondialdehyde content (nmol gfw−1 [13]), proline content (µmol gfw−1 [14]), total leaf chlorophyll content (mg 100−1g [15]), ascorbic acid (mg 100−1g A.O.A.C (1984)), lycopene content (mg 100−1g [16]), total phenols (mg 100−1g (Malick and Singh (1980) and total soluble sugars (mg 100−1g [17]).

Analysis of variance, best linear unbiased prediction (BLUP), and heritability

The ANOVA for the genotypes was performed using Metan: An R package [18], for individual environments using the mixed model analysis. For each trait and environment, the analysis was performed considering entry and block (nested within replication) as random effects and replication as fixed effects.

Broad-sense heritability was calculated as H2 = Vg/Vp.

In GAPIT-R, the best linear unbiased predictors (BLUPs) of each genotype were calculated for each environment. The calculated BLUPs were then used in the GWAS analysis.

DNA extraction, genotyping and single-nucleotide polymorphism calling

DNA from 50 genotypes was isolated from leaf samples by using Xcelgen Plant gDNA Isolation Kit (XG2611-01) as per the protocol described. The DNA was eluted in 50 µl Nuclease-Free Water. The samples were quantified using Qubit Fluorometer. For determining A260/280 ratio, 1 µl of sample was loaded in Nanodrop8000 spectrophotometer. The quality of samples was checked in 0.8% Agarose gel electrophoresis along with Hind III marker for the presence of intact bands. The DNA from samples were digested using ApeK1 enzyme and the GBS library was prepared from the digested DNA fragments by ligating adaptors specific to the cut-site. The library pool was analyzed in Bioanalyzer 2100 (Agilent Technologies) using High Sensitivity (HS) DNA chip and then sequenced independently on Illumina platform to generate up to ~ 3 million reads/ sample. These barcodes tagged fastq files were analysed. TASSEL5 GBSv2 pipeline was used for identification of tags at cut sites and SNPs located across reference genome. Tomato reference genome (S_lycopersicum_chromosomes.4.00.fa.gz) employed was downloaded from solgenomics /(ftp://ftp.solgenomics.net/tomato_genome/assembly/build_4.00/). The fastq files were then aligned against the tomato reference genome using the Bowtie2 tool version 2.2.9. The .sam file created from the Bowtie2 aligner program was used through SAMToGBSdbPlugin.The alignment file was then processed by using the GBSv2 analysis pipeline for SNP calling and genotyping.

Population structure, kinship and linkage decay (LD) analysis

STRUCTURE software (v.2.3.4). was used to assess the population genetic structure among 50 tomato genotypes [19] with admixture model. Population structure was estimated based on total SNP loci (3854 SNPs) and K from 1 to 10 with 10 independent runs for each K. To determine the probable number of clusters based on genotypes, the software parameters were set to 5000 burnin and 50,000 MCMC (Markov Chain Monte Carlo) iteration. Structure output was then subjected to structure harvester for identification of effective number of clusters using the Evanno test implement in STRUCTURE HARVESTER (Earl 2012). The Principal component analysis was obtained using TASSEL (5.0) for determination of percentage of variation explained by top three principal components. For phylogenetic analysis the SNP data was imported to SNPhylo with following parameters;-m 0.05 –a 478 –A-M 0.5-b-B 192; where m is the minor allele frequency; a is the total number of autosomes, b is Performs; M Missing rate. Tassel was used for generation of Kinship matrix and to know the relatedness among 50 tomato genotypes based on shared alleles among individuals. Linkage Disequilibrium (LD) was estimated using TASSEL (5.0) and LD curve was fitting using a nonlinear model as described by [20].

Genome wide association analysis (GWAS)

The Genome-wide association analysis involves regressing each SNP separately on a given trait, adjusted for various confounding variables. After removing four controls, the genotypes were processed for GWAS analysis. TASSEL and GAPIT was used for GWAS analyses. SNP-trait association analysis was performed using Mixed Linear Model (MLM) [21] implemented in TASSEL that include correction for both population structure and kinship. GAPIT was also used for GWAS analysis, which is a Genome Association and Prediction Integrated Tool. GAPIT is a package that is run in the R software environment. GAPIT’s MLM (Mixed Linear Model) was used for GWAS analysis. Six phenotypic data sets representing each of six environments were used independently for genome wide association study. After GWAS analysis, SNPs showing association with particular trait at P-value < = 0.05 were considered as significant SNPs. The qqman R package [22] was used to plot Manhattan and quantile-quantile (QQ) plots. A 5% significance level was used to identify SNPs significantly associated with trait.

Identification of candidate genes

Functional annotation of the predicted underlying genes with significantly associated SNPs was performed using the Solanum lycopersicum SL4.0 genome browser. Nearest genes located upstream or downstream of the significant markers were considered. Gene models were blasted against Tomato Genome Proteins (ITAG release 4.0) to determine the gene annotation.

Results

Phenotypic performance and genetic variability

The Analysis of variance performed on the traits using environment (Y), genotype (G) and genotype x environment (G x Y) interactions as effects of the model as presented in Table 2 revealed highly significant differences among all the genotypes under study for all traits and environments except for the trait fruit shape index thereby indicating a good amount of variability in the present material. The perusal of Table 3 revealed that mean performance of genotypes for various traits under different environments showed large variation and also the range was high for almost all the traits under study indicating that wide variation existed in the population.

Table 2 Phenotypic Performance and Genetic Variability using environment (Y), genotype (G) and genotype x environment (G x Y) interactions
Table 3 Environment wise estimates of mean and range for different traits in tomato (Solanum spp.)

Population structure, distribution, principal component analysis, kinship matrix and linkage disequilibrium (LD)

Three million reads/sample were generated by sequencing using on Illumina platform. The raw sequence data were filtered to remove low-quality bases, adapter contamination and uncalled bases to produce high-quality sequence data. Then Tassel pipeline resulted in generation of 10,802 SNPSs after SNPS calling. After applying various quality-filtering parameters (MAF > = 0.05, MAC > = 10, Missing Data < = 50%) 3854 SNPSs were retained for downstream analysis. of 10,802 variants. Linkage decay was observed after 1.0 Mb distance (Fig. 1) that has a practical significance of identifying significant trait association even with a smaller number of markers.

Fig. 1
figure 1

Linkage disequilibrium plot

Using SNPs genotype file, the population structure within 50 genotypes was investigated. The ΔK method was used to infer the correct number of subpopulations. The ΔK method takes the rate of change of the mean probability values (LnP) of each subpopulation into consideration. The structural analysis led to the identification of four (K = 4) genetically distinct subpopulations. The rate of change was maximum (159.83) at K = 4 (Fig. 2); therefore, we considered four subpopulations in our population of 50 tomato genotypes.

Fig. 2
figure 2

Plot showing the ΔK values for cluster size K = 1 to K = 10. The estimation of ΔK is performed at 10 independent runs burnin = 50,000 and MCMC = 100,000. Optimum cluster was obtained at K4

For the visualization of population structure, admixture analysis and population distribution at optimum K value of 4, bar plot was generated (Fig. 3). Then, based on the admixture coefficient obtained from STRUCTURE, number of samples falling under each population was determined. Genotypes with membership probabilities higher than 0.6 were assigned to one of the subpopulations (Table 4) whereas remaining were considered as admixed.

Table 4 Population assigned based on admixture coefficient
Fig. 3
figure 3

Bar Plots for K = 4 showing the population structure and genetic diversity present in each sample with their admixtures from 4 populations. The admixtures for each population in each sample shows the total proportion of each genotype present in the samples and their diversity

These four subpopulations (Fig. 4) possessed an uneven distribution of genotypes with P2 receiving the maximum genotypes equal to 37. P3, P1 and P4 received 5, 4, 2 genotypes respectively and the population consisted of 2 admixtures.

Fig. 4
figure 4

Number of genotypes in each subpopulation based on membership coefficient obtained using STRUCTURE software

P2 consisted of majority of germplasm including exotic collections, land races, released varieties belonging majorly to species Solanum lycopersicum (31) and some of them to other species; Solanum pimpinellifolium (2), Solanum. lycopersicum var. cerasiforme (3), Solanum peruvianum (1). Other subpopulation that is P1 consisted of 4 genotypes belonging to species Solanum peruvianum (1), Solanum habrochites (1) and Solanum lycopersicum (2). P3 consisted of 5 genotypes belonging to Solanum. lycopersicum var. cerasiforme (2), Solanum chilense (1) and Solanum lycopersicum (2). P4 and admixtures both consisted of 2 genotypes both belonging to Solanum lycopersicum. There was no clear trend of the distribution. Through principal component analysis (Fig. 5) and Kinship analysis generated using trait analysis by association, evolution and linkage (TASSEL) (Fig. 6A) and genome association and prediction integrated tool (GAPIT) (Fig. 6B) analysis presence of four populations was confirmed.

Fig. 5
figure 5

Principal component analysis using genotypic data from all the samples showed samples were scattered in different populations. Each dot represents an entry. Sub populations as defined by STRUCTURE analysis. D: admixed sub-population, P1: sub-population 1, P2: sub-population 2, P3: sub-population 3

Fig. 6
figure 6

A Kinship plot generated from TASSEL, B Kinship plot generated from GAPIT

Genome wide association study

Then Tassel pipeline resulted in generation of 10,802 variants on aligning against the tomato reference genome. These variants were filtered to retain only SNPs which were to be further used for downstream analysis GWAS analysis. After filtration, 3854 SNPs were retained. SNPs were not distributed evenly across all chromosomes. The top significant marker trait associations are presented in Table 5.

Table 5 Top significant marker trait associations under different environments

Marker trait association for cold tolerance traits

Pollen viability

A total of 265 significant SNPs were mapped for this trait under all the six environments. Under the cold stressed environments (E3 and E5) of open field conditions 119 significant SNPs were mapped. The mapping of significant SNPs was onto chromosomes 1, 2, 8, 9, 11 and 12 mostly. CH09_32765160, CH09_32765161 and CH09_32765231 were mapped across both the protected environments (E2 and E6) in both years explaining major variation of > 10% with additive effect. CH08_9512860 was mapped across both the normal environments (E1 and E4) in both years explaining a major phenotypic variation of > 10% (PVE) with negative effect. CH08_9396204 was mapped across both the cold stressed environments (E3 and E5) with explaining a major phenotypic variation of > 10% with additive effect.

Malondialdehyde

A total of 728 significant SNPS were mapped for this trait under all the six environments. Under the cold stressed environments (E3 and E5) of open field conditions 238 significant SNPs were mapped. The mapping of significant SNPs was mostly onto chromosomes 1, 6, 2, 8, 9 and 11. CH08_9395919 and CH02_22999374, were mapped across both the normal environments (E1 and E4) in both years explaining a major phenotypic variation of > 10% (PVE) with negative effect which is desirable for the stated trait. CH08_9513555, CH08_9513565 and CH02_22989441 were mapped across both the protected environments (E2 and E6) in both years explaining major variation of > 10% with negative effect. CH08_9292940 was mapped across both the cold stressed environments with additive variance explaining a minor phenotypic variation of > 10% with negative effect.

figure a

Manhattan plots and Quantile-Quantile (QQ) plots of top significant markers for A Proline content, B No of fruits per plant, C No of days to first fruit set, D Fruit shape index under different environments

Proline content

A total of 757 significant SNPs were mapped for this trait under all the six environments. Under the cold stressed environments (E3 and E5) of open field conditions 275 significant SNPS were mapped. The mapping of significant SNPs was onto chromosome 8 and 11 mostly. CH08_9484802 and CH08_9484810 were mapped across both the cold stressed environments (E3 and E5) with explaining a major phenotypic variation of > 10% with the former having negative effect and latter having additive effect. CH08_9484802 (8.75*10 − 6), CH08_9484810 (8.75*10 − 6) and CH08_9528421 (7.65*10 − 5) were mapped across early cold environment with highly significant P-values. CH08_9456588 and CH08_9435068 were mapped across both the protected environments (E2 and E6) in both years explaining major variation of > 10% with additive effect. CH08_9435068 was mapped across both the normal environments (E1 and E4) in both years explaining a major phenotypic variation of > 10% (PVE) with additive effect.

Total leaf chlorophyll content

A total of 163 significant SNPs were mapped for this trait under all the six environments. Under the cold stressed environments (E3 and E5) of open field conditions 35 significant SNPs were mapped. The mapping of significant SNPS for total leaf chlorophyll content was onto chromosomes 1, 2, 3, 8, 9, 10 and 11 mostly. CH08_9408610, CH09_32797168, CH09_32797221 and CH09_32797222 were mapped across both the normal environments (E1 and E4) in both years explaining a minor phenotypic variation of < 10% (PVE) with additive effect. CH09_32800680, CH09_32800681 and CH09_32797168 were mapped across both the protected environments (E2 and E6) in both years explaining major variation of > 10% with additive effect.

Ascorbic acid

A total of 792 significant SNPS were mapped for this trait under all the six environments. Under the cold stressed environments (E3 and E5) of open field conditions 236 significant SNPS were mapped. The mapping of significant SNPs was mostly onto chromosomes 2, 3, 9, 10 and 11. Some of the top significant SNPs across environments overlapped, indicating some major effect loci. CH11_51779471 was mapped across all the six environments explaining a major phenotypic variation of > 10% (PVE) with additive effect. CH10_64514613 and CH10_64514614 were mapped across both the normal environments (E1 and E4) in both years explaining minor phenotypic variation of < 10% with additive effects. CH02_4489 and CH09_32801508 were mapped across both the protected environments (E2 and E6) in both years explaining major variation of > 10%. CH11_51768254 was mapped across both the cold stressed environments with additive variance explaining a minor phenotypic variation of < 10%.

Lycopene content

A total of 799 significant SNPs were mapped for this trait under all the six environments. Under the cold stressed environments (E3 and E5) of open field conditions 314 significant SNPS were mapped. The mapping of significant SNPs was onto chromosomes 2, 5, 6, 9 and 11 mostly. CH06_10446131 and CH06_10446132 were mapped across five environments except the late sown environment explaining a major phenotypic variation of > 10% (PVE) and additive effect. CH11_13438931 was mapped across both the normal environments (E1 and E4) in both years explaining a major phenotypic variation of > 10% (PVE) with additive effect. CH11_2043421 was mapped across both the protected environments (E2 and E6) in both years explaining major variation of > 10% with additive effect.

Total phenols

A total of 643 significant SNPs were mapped for this trait under all the six environments. Under the cold stressed environments (E3 and E5) of open field conditions 263 significant SNPs were mapped. The mapping of significant SNPs was onto chromosome 2, 3, 6, 7, 8, 9 and 11 mostly. CH06_8656494, CH06_8656496 and CH07_67281761 were mapped across both the normal environments (E1 and E4) in both years explaining a minor phenotypic variation of < 10% (PVE) with additive effect. CH03_542789 and CH09_32765930 were mapped across both the protected environments (E2 and E6) in both years explaining major variation of > 10% with additive effect.

Total soluble sugars

A total of 370 significant SNPs were mapped for this trait under all the six environments. Under the cold stressed environments (E3 and E5) of open field conditions 206 significant SNPS were mapped. The mapping of significant SNPs was onto chromosomes 2, 6, 8, 9 and 11 mostly. CH08_9452288, CH08_9452289 and CH06_43801072 were mapped across both the normal environments (E1 and E4) in both years explaining a minor phenotypic variation of < 10% (PVE) with additive effect. CH02_22990072 and CH11_13438935 were mapped across both the protected environments (E2 and E6) in both years explaining minor phenotypic variation of < 10% with additive effect.

Marker trait association for yield and yield related traits

Days to emergence

A total of 456 significant SNPs were mapped for this trait under all the six environments. Under the cold stressed environments (E3 and E5) of open field conditions 85 significant SNPS were mapped. The mapping of significant SNPs was onto chromosomes 1, 2, 6, 7, 8 9 and 11 mostly. CH09_32800889 and CH09_32800890 were mapped across both the normal environments (E1 and E4) in both years explaining a minor phenotypic variation of < 10% (PVE) with negative effect desirable for the trait. CH06_11575381, CH06_11575394, CH06_11575453 and CH06_8639002 were mapped across both the protected environments (E2 and E6) in both years explaining minor phenotypic variation of < 10% with negative effect.

Seedling length at transplant

A total of 702 significant SNPs were mapped for this trait under all the six environments. Under the cold stressed environments (E3 and E5) of open field conditions 309 significant SNPS were mapped. The mapping of significant SNPs for this trait was onto chromosomes 4, 5, 6, 8, 9 and 11 mostly. CH05_26702700, CH05_26702702 and CH08_9396204 were mapped across both the normal environments (E1 and E4) in both years explaining a major phenotypic variation of > 10% (PVE) with additive effect.

Number of flowers per truss

A total of 187 significant SNPs were mapped for this trait under all the six environments. Under the cold stressed environments (E3 and E5) of open field conditions 71 significant SNPS were mapped. The mapping of significant SNPs was onto chromosome 3, 8, 9, 11 and 12 mostly. CH03_2152187, CH08_9512856, CH09_32801176, CH09_32801177 and CH09_32801178 was mapped across both the protected environments (E2 and E6) in both years explaining major phenotypic variation of > 10% with additive effect.

Number of days to first fruit set

A total of 365 significant SNPs were mapped for this trait under all the six environments. Under the cold stressed environments (E3 and E5) of open field conditions 25 significant SNPs were mapped. The mapping of significant SNPS was onto chromosomes 2, 6, 8, 9, 11 and 12 mostly. CH09_32796951, CH09 32,796,992 and CH09_32797021 were mapped across both the normal environments (E1 and E4) in both years explaining a major phenotypic variation of > 10% (PVE) with negative effect desirable for the trait.

Number of fruits per truss

A total of 362 significant SNPs were mapped for this trait under all the six environments. Under the cold stressed environments (E3 and E5) of open field conditions 31 significant SNPs were mapped. The mapping of significant SNPS was onto chromosome 1, 3, 7, 9, 11 and 12 mostly. CH12_49436823 was mapped across both the protected environments (E2 and E6) in both years explaining minor phenotypic variation of < 10% with additive effect.

Number of days to first harvest

A total of 265 significant SNPs were mapped for this trait under all the six environments. Under the cold stressed environments (E3 and E5) of open field conditions 35 significant SNPS were mapped. The mapping of significant SNPs was onto chromosomes 1, 2, 3, 8, 9 and 11 mostly. CH11_13438926 was mapped across both the protected environments (E2 and E6) in both years explaining major phenotypic variation of > 10%.

Average fruit weight

A total of 1085 significant SNPs were mapped for this trait under all the six environments. Under the cold stressed environments (E3 and E5) of open field conditions 346 significant SNPS were mapped. The mapping of significant SNPs was onto chromosomes 7, 8, 9 and 11 mostly. CH09_32800889, CH09_32800890, CH11_51768254 and CH11_51768333 were mapped across all the environments except the late sown environment in both years explaining minor phenotypic variation of < 10% with negative effect.

Fruit shape index

A total of 793 significant SNPS were mapped for this trait under all the six environments. Under the cold stressed environments (E3 and E5) of open field conditions 211 significant SNPS were mapped. The mapping of significant SNPs was onto chromosomes 1, 2, 3, 4, 6, 8, 9 and 11 mostly. Different SNPs were mapped across different environments. Top SNPS with highly significant p-values were mapped across the normal environment (E4) namely CH08_9435068 (8.31*10 − 7), CH08_9452288 (1.75*10 − 5), CH08_9452289 (1.75*10 − 5), CH08_9454260 (2.31*10 − 5) and CH08_9454275 (2.31*10 − 5) explaining major phenotypic variation of > 10% with additive effect.

Number of fruits per plant

A total of 289 significant SNPs were mapped for this trait under all the six environments. Under the cold stressed environments (E3 and E5) of open field conditions 158 significant SNPS were mapped. The mapping of significant SNPs was onto chromosomes 2, 3, 9, 10, 11 and 12 mostly. CH03_2152187 was mapped across both the normal environments (E1 and E4) in both years explaining a major phenotypic variation of > 10% (PVE) with additive effect. CH12_49436823 was mapped across both the cold environments (E3 and E5) in both years explaining a major phenotypic variation of > 10% (PVE) with additive effect.

Fruit yield per plant

A total of 701 significant SNPs were mapped for this trait under all the six environments. Under the cold stressed environments (E3 and E5) of open field conditions 101 significant SNPS were mapped. The mapping of significant SNPs was on to chromosomes 1, 4, 7, 8, 9 and 11 mostly. CH04_58505565 was mapped across both the normal environments (E1 and E4) in both years explaining a major phenotypic variation of > 10% (PVE) with additive effect. CH04_58505565 and CH01_60016947 were mapped across both the protected environments (E2 and E6) in both years explaining major phenotypic variation of > 10% with additive effect.

Number of primary branches

A total of 469 significant SNPs were mapped for this trait under all the six environments. Under the cold stressed environments (E3 and E5) of open field conditions 114 significant SNPS were mapped. The mapping of significant SNPs was onto chromosome 1, 2, 8, 9 and 11 mostly. CH08_9792846 was mapped across both the protected environments (E2 and E6) in both years explaining major phenotypic variation of > 10% with additive effect.

Plant height

A total of 610 significant SNPs were mapped for this trait under all the six environments. Under the cold stressed environments (E3 and E5) of open field conditions 210 significant SNPS were mapped. The mapping of significant SNPs was onto chromosomes 1, 2, 6, 8 and 9 mostly. CH08_9792846 was mapped across both the protected environments (E2 and E6) in both years explaining a major phenotypic variation of > 10% (PVE) with additive effect. CH06_43801076 was mapped across both the cold environments (E3 and E5) in both years explaining a major phenotypic variation of > 10% (PVE) with additive effect.

Duration of harvest

A total of 145 significant SNPs were mapped for this trait under all the six environments. Under the cold stressed environments (E3 and E5) of open field conditions 85 significant SNPS were mapped. The mapping of significant SNPs was onto chromosomes 1, 2, 8, 9 and 11 mostly. CH02_11280 and CH08_9792846 were mapped across both the normal environments (E1 and E4) in both years explaining a minor phenotypic variation of < 10% (PVE) with additive effect. CH11_13438944, was mapped across both the protected environments (E2 and E6) in both years explaining minor phenotypic variation of < 10% with additive effect.

Candidate genes

Mining of the candidate genes revealed a total of 685 genes; 40 genes from physical positions 24,506,038–86,800,901 bp on chromosome 1, 72 genes from physical positions 2344-38613228 bp on chromosome 2, 22 genes from physical positions 542789-61706899 bp on chromosome 3, 13 genes from physical positions 30,904,930–64,421,176 bp on chromosome 4, 8 genes from physical positions 1,507,699–28,259,872 bp on chromosome 5, 36 genes from physical positions 4,370,518–43,806,292 bp on chromosome 6, 38 genes from physical positions 66,235,127–67,281,761 bp on chromosome 7, 190 genes from physical positions 2,628,180–45,067,799 bp on chromosome 8, 137 genes from physical positions 23,454,319–61,706,964 bp on chromosome 9, 16 genes from physical positions 42,879,485–64,514,640 bp on chromosome 10, 108 genes from physical positions 2,043,342–52,369,820 bp on chromosome 11 and 7 genes from physical positions 5,896,180–65,817,136 bp on chromosome 12. Based on functional categorization, 60 genes were found to be associated with biological processes in these genomic regions (5 on chromosome-1, 8 on chromosome-2, 1 on chromosome-3, 4 on chromosome-4, 2 on chromosome-5, 3 on chromosome-6, 2 on chromosome-7, 15 on chromosome-8, 6 on chromosome-9, 2 on chromosome-10, 10 on chromosome-11 and 2 on chromosome-12). 7 among 60 were directly found to be related to abiotic stress tolerance and function directly or indirectly as cold stress responsive genes. First among seven genes comprised of Solyc03g007550.4 annotated LIM domain protein (AHRD V3.3 *** AT5G52950.1). Second one was Solyc11g039880.2 with annotation as Nucleoporin (DUF3414) (AHRD V3.3 *** F4JUG3_ARATH). Third one was Solyc02g022930.3 with annotation as 3-hydroxyisobutyrate dehydrogenase (AHRD V3.3 *** A0A1U8FBY0_CAPAN) and fourth one was Solyc07g065840.2 annotated as molecular chaperone Hsp90-2. Fifth one was Solyc04g082980.2 annotated as Tetratricopeptide repeat protein SKI3 (AHRD V3.3 *** A0A2G2ZSZ3_CAPAN) and sixth one was Solyc05g006830.3 annotated as Thioredoxin (AHRD V3.3 *** A0A2G2VM05_CAPBA). Seventh one was Solyc11g017345.1 annotated as F-box/FBD/LRR-repeat protein (AHRD V3.3 *-* XP_010312830.1). Other genes included essential genes for growth and developmental processes (Table 6).

Table 6 List of candidate genes identified

Discussion

Every degree increase in ambient temperature, in the context of climate change, has a significant impact on crop yield, particularly for tomato, which is exceptionally sensitive to cold temperatures. Tomato cultivar’s low temperature sensitivity limits their geographic distribution, cultivation and growing season. Therefore, understanding the nature, impact, and molecular mechanisms of cold stress tolerance will help in designing strategies to overcome production losses. Previously, no studies have been reported on GWAS for cold tolerance in tomato. Studies were more focused on understanding the nature, impact, and existing diversity in germplasm lines as well as identifying the genomic regions responsible for yield and quality traits. In this study, we reported some marker trait associations for cold stress tolerance in tomato.

The analysis of variance revealed highly significant differences among all the genotypes under study for all traits and environments thereby indicating a good amount of variability in the present material. The mean performance of genotypes for various traits under different environments showed large variation and also the range was high for almost all the traits under study indicating that wide variation existed in the population. Heritability in broad sense was found to be high for all the characters under study. High heritability for physiological traits indicated that the selection for cold tolerance relying on these traits could be effective. Its value ranged from 89 − 100%. Studies for varaiblity of germplasm were conducted by various workers viz., [23, 24] and were of the same opinion that there was presence of sufficient variability in the tomato germplasm.

Based on STRUCTURE, SNPhylo and Kinship matrix, the tomato (50 lines) germplasm belonging to different species was categorized into four distinct populations. These four subpopulations possessed an uneven distribution of genotypes with P2 receiving the maximum genotypes equal to 37. P3, P1 and P4 received 5, 4, 2 genotypes respectively and the population consisted of 2 admixtures. P2 consisted of majority of germplasm including exotic collections, land races, released varieties belonging majorly to species Solanum lycopersicum (31) and some of them to other species; Solanum pimpinellifolium (2), Solanum lycopersicum var. cerasiforme (3), Solanum peruvianum (1). The grouping of Solanum pimpinellifolium, in the same cluster with other Solanum lycopersicum lines was also reported by [25] and [26] as sister groups. Other subpopulation that is P1 consisted of 4 genotypes belonging to species Solanum peruvianum (1), a derivative of Solanum habrochites (1) and Solanum lycopersicum (2). The clustering of Solanum chilense and Solanum habrochaites together can possibly be due to their origin Southern Peru where they are mainly found [27]. P3 consisted of 5 genotypes belonging to Solanum lycopersicum var. cerasiforme (2), Solanum chilense (1) and Solanum lycopersicum (2). The grouping of Solanum chilense and Solanum lycopersicum var. cerasiforme together can be attributed to the complementary properties they posses for salt-stress resistance and smaller fruit size as proved by [28]. P4 consisted of 2 genotypes both belonging to Solanum lycopersicum. Admixtures consisted of 2 genotypes both of them being exotic collections belonging to species Solanum lycopersicum. The admixture level of first admixture obtained was higher in sub-population 1 and that of second admixture obtained was higher in sub-population 3 among 4 populations generated. The tendency of genotypes being distributed into various clusters by species indicated that the species were genetically unique. Clustering by origin region found that accessions with similar genetic similarity are also geographically similar. The distribution, on the other hand, lacked a distinct pattern. The genetic structure discovered in our study implies that selection for market specialisation has left a genetic trace in grown tomatoes. The observed variance also suggests that various patterns of genetic variation in farmed tomatoes are caused by wild species introgression and market specialised selection. Tomatoes (Solanum lycopersicum) have been subjected to extensive selection both during and after domestication.

Tomato has been a forerunner in QTL mapping for agronomic and yield-related traits on populations. Several genome wide association mapping investigations have been conducted in the previous decade, resulting in the discovery of novel loci for fruit quality, metabolites and flavor-related compounds. Efforts have also been made to explore major agronomic features in core sets of improved and wild cultivars. However, all the research was focused on yield and yield-related variables under normal conditions, resulting in the identification of several associated genomic areas, but no attempts were made to map loci under cold stress and protected environments. But this study, allowed us to identify marker trait associations for cold stress in tomato, since it was the first and one of its kind. In addition to reporting marker trait associations for cold tolerance traits, associations for yield and yield related traits under normal and cold stressed environments was also done which to the best of our knowledge, is the first comprehensive study.

Cold stress impairs the plant system and therefore there are substances which act as markers for susceptibility or resistance. Pollen viability, total leaf chlorophyll content and lycopene act as such markers and are all reduced in response to cold stress in susceptible plants as a response to cold stress. Poor winter fruit set in tomato has been reported by drop in both pollen quality and quantity [29] where in cold stress alters anther’s metabolic pathways, causing pollen sterility. Sharma KD. et al. [30] reported that unlike vulnerable plants, cold-tolerant plants produce a large amount of viable pollen. The mapping of significant SNPs was onto chromosomes 1, 2, 4, 8, 9, 11 and 12 mostly for pollen viability. Xu J. et al. [31] also mapped a single QTL for pollen viability on chromosome 11 in tomato. The gene Solyc04g082980.2 annotated as Tetratricopeptide repeat protein SKI3 was identified for the trait. Rosado A. et al [32] reported that Tetratricopeptide repeat protein regulates the transcript levels of several dehydration-responsive genes, such as the transcription factor DREB2A and genes encoding dehydration response proteins, such as ERD1 (early response to dehydration 1), ERD3 and COR15a. Furthermore Solyc02g022930.3 annotated as 3-hydroxyisobutyrate dehydrogenase was identified and was also colocalized for the trait malondialdehyde which is a marker of oxidative lipid injury. Schertl P. et al. [33] reported that under abiotic stress conditions in Arabidopsis, 3-hydroxyisobutyrate dehydrogenase acts as a NADH-generating enzyme of the branched chain amino acid degradation pathway. For total leaf chlorophyll content the mapping of significant SNPs for total leaf chlorophyll content was onto chromosomes 1, 2, 3, 8, 9, 10 and 11 mostly. Efrati A. et al. [34] mapped chlorophyllase on to chromosomes 6 (CHLASE 1) and chromosome 9 (CHLASE 2) in tomato lines. Gene Solyc11g017345.1 annotated as F-box/FBD/LRR-repeat protein was identified. F-box proteins regulate diverse cellular processes, including cell cycle transition, transcriptional regulation and signal transduction, by playing roles in Skp1p-cullin-F-box protein (SCF) complexes or non-SCF complexes [35]. At least 43 F-box protein-encoding genes have been found to be differentially expressed in rice seedlings subjected to different abiotic stress conditions [36]. It was also colocalized for the trait fruit shape index indicating that if one trait is introgressed other one will simultaneously improve which are key for achieving resilience to cold stress tolerance. Also lycopene is one of the most important carotenoid contributing for 98% of the pigment of tomato fruit colour. In rice seedlings [37] reported that carotenogenic gene mutations lead to increased oxidative stress in rice seedlings and verified the anti-oxidative function of carotenoids. The mapping of significant SNPs for lycopene was onto chromosomes 2, 5, 6, 8, 9, 10 and 11 mostly. Zhang J. et al. [38] in tomato detected significant marker trait associations for carotenoid-derived volatiles on chromosomes 6, 9 and 11 explaining > 10% phenotypic variation. Gene named Thioredoxin was identified. da Fonseca-Pereira P. et al. [39] reported that in Arabidopsis, Thioredoxin transcripts were more highly expressed under drought and produce an effect that is stronger during repetitive drought/recovery events.

Apart from reduced content of few compound’s there are certain compounds which accumulate in response to cold stress in tolerant plants and some which act as protectants. One of the most important amino acid known to impart stress tolerance is proline. A link between stress tolerance acquisition and proline accumulation has been established by [40]. Highly significant marker trait associations were discovered for proline content. For proline content the mapping of significant SNPs was onto chromosome 1, 2, 8 and 11 mostly. Sauvage C. et al. [41] also reported significant association for proline content on chromosome 2 in tomato. Similarly chilling-induced oxidative damage in tomatoes was reported to be mitigated by ascorbic acid [42]. It works by reducing electrolyte loss, lipid peroxidation and hydrogen peroxide levels. For ascorbic acid the mapping of significant SNPs was mostly onto chromosomes 2, 3, 9, 10, 11 and 12 mostly. Stevens R. et al. [43] identified common regions controlling ascorbic acid content on chromosomes, 9 and 10 in tomato and on chromosomes 9 and 11, [41] discovered significant associations for ascorbic acid content in tomato. Also, [44] in their research linked one locus to a previously identified ascorbic acid content large-effect QTL on chromosome 9 by [43] in tomato. Significant marker trait associations were detected on chromosome 3, 11 in tomato by [45]. Gene Solyc03g007550.4 annotated as LIM domain protein was identified for the trait. Park CJ et al. [46] reported that LIM genes have been involved in resistance against biotic and abiotic stresses in Brassica. It was also colocalized for the traits viz., number of fruits per plant thereby indicating an important correlation with fruit yield improvement under cold stressed conditions. Also colocalization with the traits viz., number of fruits per truss, number of fruits per plant, fruit yield per plant under normal and protected conditions indicates that simultaneous improvement can be done for many traits.

Compounds such as phenols and sugars protect plant cells against damage induced by cold stress in a variety of ways, including functioning as osmoprotectants, nutrients and interacting with the lipid bilayer [47]. Phenolic substances which are non-enzymatic antioxidants (Phenolic acids and flavonoids) can trap and scavange reactive oxygen species (ROS). For total phenols the mapping of significant SNPs was onto chromosome 1, 2, 3, 6, 7, 8, 9, 10 and 11 mostly. Ruggieri V. et al. [48] reported that phenol content was associated with markers on chromosomes 8 and chromosome 11 in tomato. The mapping of significant SNPs for total soluble sugars was onto chromosomes 1,2,3,6, 8, 9 and 11 mostly. Similar significant marker trait associations for soluble solids on chromosomes 2, 8 and 9 in tomato were detected by [45, 49] also detected significant associations for soluble solids on chromosome 2 in tomato. Significant marker trait associations for sugars on chromosomes 1, 2, 3 and 11 in tomato by were detected by [50]. Significant associations for soluble solid content on chromosomes 2, 8, 9 and 11 were detected by [50]. Sauvage C. et al. [41] also reported significant associations for sucrose content on chromosomes 2 and rhamnose on chromosome 8 in tomato.

To study yield and yield related traits in response to cold stress is very important aspect of the study because yield is the ultimate goal of a breeder along with climate resilience. For seedling length at transplant the mapping of significant SNPs was onto chromosomes 4, 5,6,8,9 and 11 mostly. Similar results were also observed by [51] in tomato mapped QTLs on chromosomes 1, 4, 6, 9 and 11 for various seedling traits viz., for hypocotyl length on chromosome 6, shoot weight on chromosome 9 and total root size on chromosome 9 and 11. For number of days to first harvest the mapping of significant SNPs was onto chromosomes 1, 2, 3, 8, 9 and 11 mostly. Significant association was observed by [52] in tomato for number of days between planting and ripening date of first fruit on chromosome 2 and for number of days between flowering and red ripe stage on chromosome 2. Significant marker trait associations in tomato for time to ripen on chromosomes 1, 2, 3, 9 and 10 were also observed by [50] in tomato.

For number of flowers per truss the mapping of significant SNPs was onto chromosome 3, 8, 9, 11 and 12 mostly. Significant associations were mapped for flower number on the third inflorescence on chromosome 9 by [44] in tomato. Also another important gene Solyc11g039880.2 annotated as Nucleoporin was identified. Dong CH et al. [53] reported that Nucleoporin (AtNUP160) in Arabidopsis was critical for the nucleocytoplasmic transport of mRNAs and it played important roles in plant growth and flowering time regulation and is required for cold stress tolerance. A defect in Nup160/SAR1 was also found to sensitize plants to chilling stress and disrupt acquired freezing tolerance. This gene is important in context of flowering under cold stressed conditions. For number of fruits per truss the mapping of significant SNPs was onto chromosome 1, 3, 7, 9, 11 and 12 mostly. Significant associations were mapped for fruit number on the third truss on chromosome 9 by [44] in tomato. Solyc11g150146.1 annotated as Elongation factor 1-alpha was identified and it was colocalized for the traits seedling vigor index and duration of harvest.

For fruit yield per plant the mapping of significant SNPs was onto chromosomes 1, 3, 4, 6, 7, 8, 9, 10, 11 and 12 mostly. Phan NT et al. [54] reported significant marker trait associations for fruit height on chromosomes 1, 3, 4, 7, 8; for fruit width on chromosomes 1, 3, 6, 10 and 11and for fruit weight on chromosomes 1, 3, 4, 6, 10 and 11 in tomato. For average fruit weight the mapping of significant SNPs was onto chromosomes 7, 8, 9 and 11 mostly. Significant associations were found for fruit weight on chromosome 11 by [49] and [45] in tomato while on chromosomes 7 and 9 by [52]. Candidate genes Solyc07g065840.2 annotated as molecular chaperone Hsp90-2 was identified it was also colocalized for the trait fruit yield per plant. Park CJ et al. [46] reported that heat shock proteins (HSPs) functioning as molecular chaperones are the key components responsible for protein folding, assembly, translocation and degradation under stress conditions and in many normal cellular processes. Shirasu K. et al. [55] also reported that HSP90s also play essential roles in plant immunity. Fruit weight is one the most important traits contributing to fruit yield and colocalization indicates a higher scope of improvement of fruit yield. Also in case of number of fruits the mapping of significant SNPs was onto chromosomes 2, 3, 9, 10, 11 and 12 mostly. Significant marker trait associations for fruit number on chromosomes 2, 10, 11 and 12 were reported by [50] in tomato. For fruit shape index the mapping of significant SNPs was onto chromosomes 1, 2, 3, 4, 6, 8, 9 and 11 mostly. Significant associations were found for fruit shape on chromosome 1, 8 and 9 by [45], on chromosome 8 by [49] and on chromosome 1, 2, 3, 4, 6, 8, 9 and 11 by [56] in tomato.

Conclusion

A total of 4517 significant marker trait associations were revealed for cold tolerance traits which hampers the growth and development of the crop throughout the season. Also total of 5727 significant marker trait associations were revealed for yield and yield related traits uncovering important associations for fruit yield and directly contributing traits. 685 candidate genes were identified among all traits under study. Based on functional categorization, 60 genes were found to be associated with biological processes in these genomic regions. 7 among 60 were directly found to be related to abiotic stress tolerance and function directly or indirectly as stress responsive genes. This study could be used for breeding programme for developing high yielding hybrids.

Availability of data and materials

The datasets generated and/or analysed during the current study are available in the NCBI repository. https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE200034.

References

  1. Chen P, Zhang W, Wang X, Zhao K, Negi DS, Zhuo L, et al. Lycopene and risk of Prostate cancer: a systematic review and meta-analysis. Medicine. 2015;94(33):886.

    Article  Google Scholar 

  2. Karppi J, Kurl S, Nurmi T, Rissanen TH, Pukkala E, Nyyssönen K. Serum lycopene and the risk of cancer: the Kuopio Ischaemic Heart Disease Risk factor (KIHD) study. Ann Epidemiol. 2009;19(7):512–8.

    Article  PubMed  Google Scholar 

  3. Foolad M, Lin G. Relationship between cold tolerance during seed germination and vegetative growth in tomato: germplasm evaluation. J Am Soc Hortic Sci. 2000;125(6):679–83.

    Article  Google Scholar 

  4. Venema J, Linger P, Van Heusden A, Van Hasselt P, Brüggemann W. The inheritance of chilling tolerance in tomato (Lycopersicon spp). Plant Biol. 2005;7(02):118–30.

    Article  PubMed  CAS  Google Scholar 

  5. Foolad M, Lin G. Genetic analysis of cold tolerance during vegetative growth in tomato, Lycopersicon esculentum Mill. Euphytica. 2001;122:105–11.

    Article  Google Scholar 

  6. Roorkiwal M, Bharadwaj C, Barmukh R, Dixit GP, Thudi M, Gaur PM, et al. Integrating genomics for chickpea improvement: achievements and opportunities. Theor Appl Genet. 2020;133:1703–20.

    Article  PubMed  PubMed Central  Google Scholar 

  7. Jha UC, Nayyar H, Palakurthi R, Jha R, Valluri V, Bajaj P, et al. Major QTLs and potential candidate genes for heat stress tolerance identified in chickpea (Cicer arietinum L). Front Plant Sci. 2021;12: 655103.

    Article  PubMed  PubMed Central  Google Scholar 

  8. Manolio TA, Collins FS, Cox NJ, Goldstein DB, Hindorff LA, Hunter DJ, et al. Finding the missing heritability of complex Diseases. Nature. 2009;461(7265):747–53.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  9. Yamaguchi-Shinozaki K, Shinozaki K. Transcriptional regulatory networks in cellular responses and tolerance to dehydration and cold stresses. Annu Rev Plant Biol. 2006;57:781–803.

    Article  PubMed  CAS  Google Scholar 

  10. Stockinger EJ, Gilmour SJ, Thomashow MF. Arabidopsis thaliana CBF1 encodes an AP2 domain-containing transcriptional activator that binds to the C-repeat/DRE, a cis-acting DNA regulatory element that stimulates transcription in response to low temperature and water deficit. Proceedings of the National Academy of Sciences. 1997;94(3):1035–40.

    Article  CAS  Google Scholar 

  11. Novillo F, Medina J, Salinas J. Arabidopsis CBF1 and CBF3 have a different function than CBF2 in cold acclimation and define different gene classes in the CBF regulon. Proc Nat Acad Sci. 2007;104(52):21002–7.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  12. Gilmour SJ, Sebolt AM, Salazar MP, Everard JD, Thomashow MF. Overexpression of the Arabidopsis CBF3 transcriptional activator mimics multiple biochemical changes associated with cold acclimation. Plant Physiol. 2000;124(4):1854–65.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  13. Dhindsa RS. Inhibition of protein synthesis by products of lipid peroxidation. Phytochemistry. 1982;21(2):309–13.

    Article  CAS  Google Scholar 

  14. Bates L, Waldren Ra, Teare I. Rapid determination of free proline for water-stress studies. Plant Soil. 1973;39:205–7.

    Article  CAS  Google Scholar 

  15. Sadasivam S. Biochemical methods. New Age International; 1996.

  16. Adejo GO. Antioxidant, total lycopene, ascorbic acid and microbial load estimation in powdered tomato varieties sold in Dutsin-Ma market. Open Access Library Journal. 2015;2(08):1.

    Google Scholar 

  17. Bray H, Thorpe W. Standard methods of biochemical analysis. New Delhi: Kalyani Publishers; 1954. p. 287–8.

    Google Scholar 

  18. Olivoto T, Lúcio ADC. Metan: an R package for multi-environment trial analysis. Methods Ecol Evol. 2020;11(6):783–9.

    Article  Google Scholar 

  19. Pritchard JK, Stephens M, Rosenberg NA, Donnelly P. Association mapping in structured populations. Am J Hum Genet. 2000;67(1):170–81.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  20. Marroni F, Pinosio S, Di Centa E, Jurman I, Boerjan W, Felice N, et al. Large-scale detection of rare variants via pooled multiplexed next‐generation sequencing: towards next‐generation ecotilling. Plant J. 2011;67(4):736–45.

    Article  PubMed  CAS  Google Scholar 

  21. Zhang Z, Ersoz E, Lai C-Q, Todhunter RJ, Tiwari HK, Gore MA, et al. Mixed linear model approach adapted for genome-wide association studies. Nat Genet. 2010;42(4):355–60.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  22. Turner SD. Qqman: an R package for visualizing GWAS results using QQ and manhattan plots. Biorxiv. 2014;14:005165.

    Google Scholar 

  23. Reddy B, Reddy D, Reddaiah K, Sunil N. Studies on genetic variability, heritability and genetic advance for yield and quality traits in tomato (Solanum lycopersicum L). Int J Curr Microbiol Appl Sci. 2013;2(9):238–44.

    Google Scholar 

  24. Rai AK, Vikram A, Pandav A. Genetic variability studies in tomato (Solanum lycopersicum L.) for yield and quality traits. Int J Agric Environ Biotechnol. 2016;9(5):739–44.

    Article  Google Scholar 

  25. Consortium TGS, Aflitos S, Schijlen E, de Jong H, de Ridder D, Smit S, et al. Exploring genetic variation in the tomato (Solanum section Lycopersicon) clade by whole-genome sequencing. Plant J. 2014;80(1):136–48.

    Article  Google Scholar 

  26. Pidigam S, Thuraga V, Munnam SB, Amarapalli G, Kuraba G, Pandravada SR, et al. Genetic diversity, population structure and validation of SSR markers linked to Sw-5 and I-2 genes in tomato germplasm. Physiol Mol Biology Plants. 2021;27:1695–710.

    Article  CAS  Google Scholar 

  27. Spooner DM, Peralta IE, Knapp S. Comparison of AFLPs with other markers for phylogenetic inference in wild tomatoes [Solanum L. Section Lycopersicon. (Mill) Wettst ] Taxon. 2005;54(1):43–61.

    Article  Google Scholar 

  28. Martínez JP, Fuentes R, Farías K, Lizana C, Alfaro JF, Fuentes L, et al. Effects of salt stress on fruit antioxidant capacity of wild (Solanum chilense) and domesticated (Solanum lycopersicum var. cerasiforme) tomatoes. Agronomy. 2020;10(10): 1481.

    Article  Google Scholar 

  29. Domínguez E, Cuartero J, Fernández-Muñoz R. Breeding tomato for pollen tolerance to low temperatures by gametophytic selection. Euphytica. 2005;142:253–63.

    Article  Google Scholar 

  30. Sharma KD, Nayyar H. Regulatory networks in pollen development under cold stress. Front Plant Sci. 2016;7:402.

    Article  PubMed  PubMed Central  Google Scholar 

  31. Xu J, Driedonks N, Rutten MJ, Vriezen WH, de Boer G-J, Rieu I. Mapping quantitative trait loci for heat tolerance of reproductive traits in tomato (Solanum lycopersicum). Mol Breeding. 2017;37:1–9.

    Article  Google Scholar 

  32. Rosado A, Schapire AL, Bressan RA, Harfouche AL, Hasegawa PM, Valpuesta V, et al. The Arabidopsis tetratricopeptide repeat-containing protein TTL1 is required for osmotic stress responses and abscisic acid sensitivity. Plant Physiol. 2006;142(3):1113–26.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  33. Schertl P, Danne L, Braun H-P. 3-Hydroxyisobutyrate dehydrogenase is involved in both, valine and isoleucine degradation in Arabidopsis thaliana. Plant Physiol. 2017;175(1):51–61.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  34. Efrati A, Eyal Y, Paran I. Molecular mapping of the chlorophyll retainer (cl) mutation in pepper (Capsicum spp.) and screening for candidate genes using tomato ESTs homologous to structural genes of the chlorophyll catabolism pathway. Genome. 2005;48(2):347–51.

    Article  PubMed  CAS  Google Scholar 

  35. Kuroda H, Takahashi N, Shimada H, Seki M, Shinozaki K, Matsui M. Classification and expression analysis of Arabidopsis F-box-containing protein genes. Plant Cell Physiol. 2002;43(10):1073–85.

    Article  PubMed  CAS  Google Scholar 

  36. Jain M, Nijhawan A, Arora R, Agarwal P, Ray S, Sharma P, et al. F-box proteins in rice. Genome-wide analysis, classification, temporal and spatial gene expression during panicle and seed development, and regulation by light and abiotic stress. Plant Physiol. 2007;143(4):1467–83.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  37. Fang J, Chai C, Qian Q, Li C, Tang J, Sun L, et al. Mutations of genes in synthesis of the carotenoid precursors of ABA lead to pre-harvest sprouting and photo‐oxidation in rice. Plant J. 2008;54(2):177–89.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  38. Zhang J, Zhao J, Xu Y, Liang J, Chang P, Yan F, et al. Genome-wide association mapping for tomato volatiles positively contributing to tomato flavor. Front Plant Sci. 2015;6:1042.

    Article  PubMed  PubMed Central  Google Scholar 

  39. da Fonseca-Pereira P, Daloso DM, Gago J, de Oliveira Silva FM, Condori-Apfata JA, Florez-Sarasa I, et al. The mitochondrial thioredoxin system contributes to the metabolic responses under drought episodes in Arabidopsis. Plant Cell Physiol. 2019;60(1):213–29.

    Article  PubMed  Google Scholar 

  40. Nanjo T, Fujita M, Seki M, Kato T, Tabata S, Shinozaki K. Toxicity of free proline revealed in an Arabidopsis T-DNA-tagged mutant deficient in proline dehydrogenase. Plant Cell Physiol. 2003;44(5):541–8.

    Article  PubMed  CAS  Google Scholar 

  41. Sauvage C, Segura V, Bauchet G, Stevens R, Do PT, Nikoloski Z, et al. Genome-wide association in tomato reveals 44 candidate loci for fruit metabolic traits. Plant Physiol. 2014;165(3):1120–32.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  42. Elkelish A, Qari SH, Mazrou YS, Abdelaal KA, Hafez YM, Abu-Elsaoud AM, et al. Exogenous ascorbic acid induced chilling tolerance in tomato plants through modulating metabolism, osmolytes, antioxidants, and transcriptional regulation of catalase and heat shock proteins. Plants. 2020;9(4): 431.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  43. Stevens R, Buret M, Duffé P, Garchery C, Baldet P, Rothan C, et al. Candidate genes and quantitative trait loci affecting fruit ascorbic acid content in three tomato populations. Plant Physiol. 2007;143(4):1943–53.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  44. Ye J, Li W, Ai G, Li C, Liu G, Chen W, et al. Genome-wide association analysis identifies a natural variation in basic helix-loop-helix transcription factor regulating ascorbate biosynthesis via D-mannose/L-galactose pathway in tomato. PLoS Genet. 2019;15(5): e1008149.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  45. Zhang J, Zhao J, Liang Y, Zou Z. Genome-wide association-mapping for fruit quality traits in tomato. Euphytica. 2016;207:439–51.

    Article  CAS  Google Scholar 

  46. Park C-J, Seo Y-S. Heat shock proteins: a review of the molecular chaperones for plant immunity. Plant Pathol J. 2015;31(4):323.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  47. Yuanyuan M, Yali Z, Jiang L, Hongbo S. Roles of plant soluble sugars and their responses to plant cold stress. Afr J Biotechnol. 2009;8(10):2004–10.

    Google Scholar 

  48. Ruggieri V, Francese G, Sacco A, D’Alessandro A, Rigano MM, Parisi M, et al. An association mapping approach to identify favourable alleles for tomato fruit quality breeding. BMC Plant Biol. 2014;14(1):1–15.

    Article  Google Scholar 

  49. Tripodi P, Soler S, Campanelli G, Díez MJ, Esposito S, Sestili S, et al. Genome wide association mapping for agronomic, fruit quality, and root architectural traits in tomato under organic farming conditions. BMC Plant Biol. 2021;21(1):1–22.

    Article  Google Scholar 

  50. Diouf IA, Derivot L, Bitton F, Pascual L, Causse M. Water deficit and salinity stress reveal many specific QTL for plant growth and fruit quality traits in tomato. Front Plant Sci. 2018;9: 279.

    Article  PubMed  PubMed Central  Google Scholar 

  51. Khan N, Kazmi RH, Willems LA, Van Heusden AW, Ligterink W, Hilhorst HW. Exploring the natural variation for seedling traits and their link with seed dimensions in tomato. 2012.

  52. Bauchet G, Grenier S, Samson N, Bonnet J, Grivet L, Causse M. Use of modern tomato breeding germplasm for deciphering the genetic control of agronomical traits by Genome Wide Association study. Theor Appl Genet. 2017;130:875–89.

    Article  PubMed  CAS  Google Scholar 

  53. Dong C-H, Hu X, Tang W, Zheng X, Kim YS, Lee B-h, et al. A putative Arabidopsis nucleoporin, AtNUP160, is critical for RNA export and required for plant tolerance to cold stress. Mol Cell Biol. 2006;26(24):9533–43.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  54. Phan NT, Trinh LT, Rho M-Y, Park T-S, Kim O-R, Zhao J, et al. Identification of loci associated with fruit traits using genome-wide single nucleotide polymorphisms in a core collection of tomato (Solanum lycopersicum L). Sci Hort. 2019;243:567–74.

    Article  CAS  Google Scholar 

  55. Shirasu K. The HSP90-SGT1 chaperone complex for NLR immune sensors. Annu Rev Plant Biol. 2009;60:139–64.

    Article  PubMed  CAS  Google Scholar 

  56. Rodriguez M, Scintu A, Posadinu CM, Xu Y, Nguyen CV, Sun H, et al. GWAS based on RNA-Seq SNPs and high-throughput phenotyping combined with climatic data highlights the reservoir of valuable genetic diversity in regional tomato landraces. Genes. 2020;11(11): 1387.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

Download references

Funding

The work was financially supported by SERB, Govt. of India, Grant No. EMR/2016/005598 to Khalid Z. Masoodi. The bioinformatics facility was provided by NAHEP-IDP-SKUAST-K. The research was also funded by PURSE grant funding (SR/PURSE/2022/124) to Dr. Khalid Z. Masoodi.

Author information

Authors and Affiliations

Authors

Contributions

Funding acquisition K.Z.M., conceptualization K.Z.M, N.A., methodology, N.A., K.Z.M, L.R.S., Supervision, K.Z.M., N.A., formal analysis, K.Z.M, S.M., A.B.S., L.R.S., data curation, K.Z.M., I.K., S.M writing-original draft preparation, L.R.S, K.Z.M., T.K., Bioinformatic analysis, K.Z.M., B.B. Review and editing, I.M., K.Z.M., K.H., S.M., B.A., S.N. All authors have read and approved the manuscript.

Corresponding authors

Correspondence to Sheikh Mansoor or Khalid Z. Masoodi.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Shah, L.R., Ahmed, N., Hussain, K. et al. Mapping phenotypic performance and novel SNPs for cold tolerance in tomato (Solanum lycopersicum) genotypes through GWAS and population genetics. BMC Genom Data 25, 9 (2024). https://doi.org/10.1186/s12863-024-01190-5

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s12863-024-01190-5

Keywords