- Research article
- Open Access
DNA polymorphism and selection at the bindin locus in three Strongylocentrotus sp. (Echinoidea)
© Balakirev et al. 2016
- Received: 29 September 2015
- Accepted: 2 May 2016
- Published: 12 May 2016
The sperm gene bindin encodes a gamete recognition protein, which plays an important role in conspecific fertilization and reproductive isolation of sea urchins. Molecular evolution of the gene has been extensively investigated with the attention focused on the protein coding regions. Intron evolution has been investigated to a much lesser extent. We have studied nucleotide variability in the complete bindin locus, including two exons and one intron, in the sea urchin Strongylocentrotus intermedius represented by two morphological forms. We have also analyzed all available bindin sequences for two other sea urchin species, S. pallidus and S. droebachiensis.
The results show that the bindin sequences from the two forms of S. intermedius are intermingled with no evidence of genetic divergence; however, the forms exhibit slightly different patterns in bindin variability. The level of the bindin nucleotide diversity is close for S. intermedius and S. droebachiensis, but noticeably higher for S. pallidus. The distribution of variability is non-uniform along the gene; however there are striking similarities among the species, indicating similar evolutionary trends in this gene engaged in reproductive function. The patterns of nucleotide variability and divergence are radically different in the bindin coding and intron regions. Positive selection is detected in the bindin coding region. The neutrality tests as well as the maximum likelihood approaches suggest the action of diversifying selection in the bindin intron.
Significant deviation from neutrality has been detected in the bindin coding region and suggested in the intron, indicating the possible functional importance of the bindin intron variability. To clarify the question concerning possible involvement of diversifying selection in the bindin intron evolution more data combining population genetic and functional approaches are necessary.
- Intron Region
- Noncoding Region
- Neutrality Test
- Synonymous Codon
- Codon Bias
Bindin is one of the most thoroughly investigated genes in sea urchins (reviews in [1–8]). The gene is expressed in males during spermatogenesis [9, 10] and encodes a sperm protein mediating species-specific gamete adhesion and membrane fusion during sea urchin fertilization. Bindin recognizes two different egg surface species-specific sperm receptors, 350-kDa and EBR1 [1, 8]. The bindin protein is the main content of the sea urchin sperm acrosomal vesicles and it serves as molecular glue connecting the sperm to the egg [8, 11]. It has been suggested that bindin plays an important role in post-mating pre-zygotic isolation accounting for reproductive isolation between sea urchin species [3, 5–8, 12, 13]. Indeed, reproductive success has been shown to correlate with the level of bindin sequence divergence [14–16]. Zigler et al.  detected a clear correlation between nonsynonymous bindin divergence and average gametic incompatibility in 14 sea urchin species pairs; however, no such correlation was found between mitochondrial divergence and gamete compatibility [17, 18].
The mature bindin protein is highly variable in length, ranging from 193 to 418 amino acids among ten sea urchin genera from which it has been sequenced [3, 4]. Comparisons of DNA sequences from ten sea urchin species, belonging to six orders, have revealed the presence of a central region in which 55–60 amino acids are highly conserved (the “core” region), flanked by two variable regions [3, 4]. The conserved fusogenic “B18” region has remained unchanged over the entire 250 million year span of extant echinoid evolution [3, 4]. It includes a stretch of 18 amino acids involved in membrane fusion . Sea stars and sea urchins, which diverged roughly 450–500 million years ago, have just one amino acid difference within the B18 region . The flanking bindin repeats, which vary in length among different species, bestow a species recognition mechanism .
The bindin gene is variable not only structurally but also in the pattern of evolution. In some species the evolution of the gene does not deviate from neutrality; however, in other cases strong positive selection was apparent (review in [4–7]). Adaptive divergence of the bindin coding region was detected in four sea urchin genera, Echinometra [22–24], Strongylocentrotus [25, 26], Heliocidaris , and Paracentrotus [28, 29], but not in four other genera, Arbacia [30, 31], Tripneustes , Lytechinus , and Mesocentrotus . Positive selection on bindin variation was detected also in the sea stars Patiria miniata  and Pisaster ochraceus , but not in Pisaster brevispinus, which is consistent with greater polyspermy harm in P. ochraceus .
A number of hypotheses explaining specific bindin evolution in different sea urchin genera have been suggested, including reinforcement  (selection for prezygotic isolation to prevent heterospecific fertilization and avoid production of inferior hybrids), sexual selection at the cellular level [14, 38] (evolution of reproductive traits including “cryptic female choice”), sexual conflict [39, 40] (balance between sperm competition and egg polyspermy avoidance), and immunological defense  (antagonistic coevolution with pathogens). All hypotheses are based on the analysis of the bindin protein-coding region (without consideration the intron) (for details see [2, 4–7, 13]).
Since the time of bindin discovery as a critical protein mediating species-specific gamete recognition and fertilization in sea urchins (review in ), a wealth of data have been obtained concerning the molecular evolution of this protein (as well as other proteins involved in reproduction). This group of proteins was considered one of the most convincing examples of evolution driven by positive selection (review in [5, 7, 13]). However, the mechanisms that drive adaptive diversification of reproductive proteins remain largely unknown, as pointed out by Vacquier and Swanson  (page 14): “The factors responsible for such strong selection on fertilization proteins are at present difficult to define with certainty”. No noticeable divergence in bindin was detected between the sympatric sea urchin species Pseudoboletia indiana and P. maculata , or the subspecies Heliocidaris erythrogramma erythrogramma and H. e. armigera , which hybridize extensively [42, 43]. The data of Addison and Pogson  on asymmetric introgression among strongylocentrotid sea urchins demonstrates that gamete traits alone cannot be responsible for maintaining species integrities; and that genetic boundaries between strongylocentrotid sea urchin species in the northeast Pacific appear to be related to postzygotic isolating mechanisms, which were consistently associated with divergence times, but not with intrinsic gametic incompatibilities per se. These observations challenge a generally accepted doctrine that considers that gamete recognition proteins are the critical factor for developing and maintaining species boundaries, and for the evolution of gamete incompatibility in spawning marine invertebrates (see references above).
The vast majority of the bindin gene studies have focused on coding sequence variation and/or divergence. The intron sequences usually were not investigated at all or only superficially. Ignoring the intron sequences is surprising, even from the structural point of view. The bindin gene of strongylocentrotid sea urchins has a single intron about 0.9 kb in length (ranging from 908 to 965 bp in different individuals and species), and two exons of the mature bindin (around 0.5 kb; ranging from 507 to 591 bp in different individuals and species). The bindin intron may contain important regulatory sites as it has been shown for many other introns (e.g., ) and represent a rich source of adaptively important variation (e.g., [46, 47]). Consequently, we have investigated bindin nucleotide polymorphism and divergence in both, the coding region and the intron in Strongylocentrotus sea urchins, the model group used for studying the evolution of reproductive proteins .
Three congeneric sea urchin species, Strongylocentrotus intermedius (A. Agassiz, 1863), S. pallidus (G. O. Sars, 1871), and S. droebachiensis (O. F. Müller, 1776) represent a monophyletic clade within the family Strongylocentrotidae . The species are common in the North Atlantic, Arctic, and Pacific continental regions [49, 50]. The Strongylocentrotus sea urchins are important farmed and harvested species in many countries including Japan, China, North and South Korea, Norway, Russia, Canada, and U.S.A. . The most widely distributed species, S. pallidus, inhabits the North Atlantic, Arctic, and Pacific continental shelves and slopes, with highest abundances at depths of 150–300 m and up to 1600 m . S. droebachiensis has somewhat similar but not so wide geographical distribution as S. pallidus, preferring littoral waters, with highest abundances at 5–10 m of depth. These two species typically have clearly different depth distribution [49, 51, 52]. In shallow areas, however, S. pallidus and S. droebachiensis may occur sympatrically [49, 53]. The distribution of S. intermedius is limited to the northwest Pacific region, including the Sea of Japan, the Sea of Okhotsk and the East coast of Kamchatka, the Southern Kuril Islands, and the coast of Japan [49, 50]. This species mostly occurs from the littoral and upper sublittoral zone down to a depth of 25 m. The northern Primorye (Sea of Japan) populations of S. intermedius consist of two sympatric morphological forms, “usual” (U) and “gray” (G). The two forms are different in morphology and preferred bathymetric distribution. We have shown that these two forms predominantly harbored highly divergent bacterial symbiont lineages, although they were not distinguished genetically . The geographical ranges of S. intermedius, S. pallidus, and S. droebachiensis overlap in the eastern Sakhalin and Kuril islands, and in the western coast of the Sea of Japan. Thus, all three species can occur sympatrically in some parts of their geographic distribution. However, direct evidence for mixing settlements is not available, and differences in ecological preferences may make the mixing unlikely. Nevertheless, Addison and Hart  detected in mitochondrial DNA significant evidence of introgression between S. pallidus and S. droebachiensis; which was later confirmed with four nuclear loci .
The purpose of the present study is to investigate the patterns of nucleotide variability in the complete bindin locus, including the two exons and one intron of S. intermedius (represented by two morphological forms) from the North Primorye population (the Sea of Japan). For comparative purposes, we have analyzed the bindin sequences of two congeners, S. pallidus and S. droebachiensis, obtained from GenBank. We have detected very different patterns of nucleotide variability and divergence in the coding and intron regions, but with striking similarity among all three species studied. A clear signal of positive selection was detected in the coding region; neutrality tests as well as maximum likelihood analyses suggest the action of diversifying selection in the bindin intron. We have also analyzed the two morphological forms of S. intermedius separately and found only slight differences among them concerning the patterns of bindin variability.
Sea urchin samples and sequences
A sample of S. intermedius (25 individuals) was obtained from the sea urchin settlement close to Cape Zolotoi (46°15’086”N, 138°06’646”E; Sea of Japan, Pacific Ocean). The U (12 individuals) and G (13 individuals) forms were collected at depths of 5 to 10 m and 15 to 20 m, respectively. The procedures for DNA extraction, amplification, cloning, and sequencing have been described previously [47, 56, 57]. A 1,809 bp fragment of the nuclear gene bindin was amplified using primers: 5′-tctgacgattcgaaaagaggag-3′ (forward primer) and 5′-attagcgtctatatctagttag-3′ (reverse primer). The alignment of the bindin sequences of S. franciscanus (M59490; ) and S. purpuratus (M14487; ) was used to design the primers. The amplified fragments include the complete bindin coding region (285 codons without counting the terminating stop codon) consisting of exon I (237 bp), intron (951 bp), and exon II (621 bp) that encompass the complete mature bindin protein. The 273-bp repeat region of exon II containing a 21-bp repeat motif was excluded from the analysis because orthology/paralogy relationships are uncertain in these sequences. Twelve bindin sequences are from Balakirev et al.  (EU003202-EU003213). A 1,056-bp fragment of the mitochondrial gene encoding cytochrome c oxidase subunit 1 (COI) was amplified in 59 individuals of S. intermedius using the following primers: 5′-acactttatttgatttttgg-3′ (forward) and 5′-cccattgaaagaacgtagtgaaagtg-3′ (reverse) . These sequences include the mitochondrial DNA region covering 352 codons of the COI gene, corresponding to positions 5854 to 6909 in the complete S. purpuratus mitochondrial sequence . The new sequences have been deposited in GenBank under accession numbers KP774723—KP774781 (COI) and KP774782—KP774794 (bindin). (See Additional file 1: Text S1 for PCR details and Text S2 for the bindin sequences of the genus Strongylocentrotus and close species obtained from the GenBank database.)
DNA sequence analysis
The bindin sequences were assembled using the program SeqMan (Lasergene, DNASTAR, Inc.). Multiple sequence alignment was carried out using CLUSTAL W . DnaSP, v. 5  and PROSEQ, v. 2.9  were used to analyze the data by the “sliding window” method , and for most intraspecific analyses; MEGA, v. 5  was used for basic phylogenetic analyses (see [57, 67]). Departures from neutral expectations were investigated using the tests HKA , Tajima’s , McDonald and Kreitman’s , Fu and Li’s , Hudson’s et al. , McDonald’s [73, 74], Kelly’s , Depaulis and Veuille’s , and Wall’s . The HKA test was used to compare each bindin exon to the other, or compare the exons to the intron (the mitochondrial COI gene is not appropriate to use for the HKA test as a reference sequence to assess the deviation from neutrality in the nuclear bindin gene). The permutation approach of Hudson et al.  was used to estimate the significance of sequence differences between the morphological forms. Simulations based on the coalescent process with or without recombination [79–81] were performed with the DnaSP and PROSEQ programs to estimate the probabilities of the observed values of Tajima’s D, Kelly’s Z nS and Wall’s B and Q statistics and confidence intervals of the nucleotide diversity values. Simulations with 10,000 replicates were conditional on the sample size, the observed number of segregating sites, and the alignment length, with the population recombination rate parameter, ρ (or 4N 0 r) set to the gene estimates. The method of Sawyer  was used to detect gene conversion events. The population recombination rate was analyzed with the permutation-based approach . The alignments were also analyzed for evidence of recombination using various recombination detection methods implemented in the program RDP3 .
Codon-based sequence analyses
Probabilistic Markov codon-substitution models were fitted to coding alignments assuming phylogenetic trees reconstructed by the maximum likelihood (ML), under model LG + Г + F using PhyML v.3 . Model parameters were estimated using ML. These models measure selective pressure using the ratio of nonsynonymous to synonymous substitution rates ω = d N/d S, which may vary among sites. Positive or negative selection is evidenced by significant deviations of the ω-ratio from 1. We used models that assume constant synonymous rates M0, M3, M7, M8  and FMutSel0, FMutSel  as implemented in PAML v. 4 , and a model accounting for variability of synonymous rates over sites GYxHKY Dual GDD 3x3 , later referred as M3-Dual, and implemented in HYPHY . Hypotheses concerning selection, codon bias, and rate variability were tested using likelihood ratio tests (LRTs). For a review about the application of codon models, see Anisimova and Kosiol . Models combining coding and noncoding sequences were used to test for positive selection on noncoding regions, as implemented in EvoNC . The strength of selection on noncoding regions was measured by ζ, the ratio of the substitution rate in noncoding regions relative to the synonymous rate in coding regions. Under neutrality, these rates are expected to be similar (ζ ≈ 1). Significant deviations from 1 may be considered to be evidence of positive (ζ > 1) or negative (ζ < 1) selection on noncoding regions. Consequently, the null model allowed two classes of sites in noncoding regions: a neutral class with ζ = 1 and a class of sites evolving under negative selection where the average exonic synonymous rate was higher than the substitution rate in the noncoding regions (ζ < 1). The alternative model also allowed two classes of sites, but the rate ratio was estimated for both classes under constraints: ζ ≥ 1 for positive and neutral selection class, and ζ < 1 for the negatively selected class. A Bayesian approach was used to predict sites affected by positive selection in both coding and noncoding regions [86, 89, 92, 93].
Figure 1 indicates strong haplotype structure for the bindin gene in S. intermedius. There is a set of nineteen sequences, U-16, U-34, G-43, U-35, G-42, G-36, U-18, U-49, G-4, G-35, G-38, G-23, G-25, G-41, G-44, U-21, U-30, U-48, and G-45 that are very similar to each other and differ by a fixed single nucleotide deletion (position 857) and five synonymous and intronic substitutions from a second set of five sequences, G-33, U-36, G-30, U-43, and U-8. Moreover, three out of four replacement substitutions (positions 173, 179, and 1508) segregate within each set of sequences but not between them (Fig. 1). Strong haplotype structure is also detected for the two other Strongylocentrotus species, S. pallidus and S. droebachiensis (Additional file 2: Figures S2 and S3).
Highly divergent haplogroups were frequently observed in many genes of Arabidopsis (e.g.,  and references therein), Drosophila (e.g., [46, 47, 96–99]) and other eukaryotes including sea urchins [22–24, 26–29]; they may represent a signature of demographic and/or adaptive processes (e.g., [72, 100–102]). The pattern of variation observed in the genes with dimorphic haplotype structure would seem to be compatible with a constant-size neutral process with no recombination [101, 102]. Indeed, recombination is low for the bindin locus. The method of Hudson and Kaplan  reveals a minimum of three recombination events in the bindin region analyzed in S. intermedius and S. droebachiensis, but none in S. pallidus. Statistically significant signals of recombination are detected by seven methods implemented in the program RDP3  in S. intermedius and S. pallidus, but not in S. droebachiensis (Additional file 3: Table S1). The population recombination rate (ρ = 4N e r, where N e is the effective population size and r is the recombination rate/nucleotide site/generation) obtained by the coalescent-based method of McVean et al.  is 0.0047, 0.0005, and zero for S. intermedius, S. pallidus, and S. droebachiensis, respectively (Additional file 3: Table S2). Low levels of recombination had been also reported for the bindin gene of sea urchins Echinometra , Strongylocentrotus , Heliocidaris , and Paracentrotus . Thus, recombination is present within the bindin gene of the strongylocentrotid sea urchins; however, the frequency of recombination events is less pronounced in comparison, for instance, with some Drosophila genes [46, 47, 97, 98] or sperm bindin in oyster .
The patterns of haplotype structure and low recombination are consistent with the strong linkage disequilibrium (LD) within the bindin gene in all three sea urchin species. There are 66.4, 45.1, and 47.1 % significant associations between the informative sites (Fisher’s exact test) for S. intermedius, S. pallidus, and S. droebachiensis, respectively. After the Bonferroni correction, 8.3 and 15.0 % associations remain significant for S. intermedius and S. droebachiensis; none is significant for S. pallidus. The distribution of the LD values is non-uniform along bindin (Additional file 4: Figure S4; see below the subsection “Sliding window analysis”). Significant associations are mostly due to polymorphic sites located within the bindin exon I and intron in S. intermedius; there is also a clear peak in exon II of S. pallidus and S. droebachiensis (Additional file 4: Figure S4). LD is more pronounced in the S. intermedius U form (14.2 % significant associations with Fisher exact test) than in the G form (5.4 % significant associations).
Nucleotide diversity and divergence in the bindin gene of the sea urchins Strongylocentrotus intermedius, S. pallidus, and S. droebachiensis
Exon I + Exon II
The nucleotide variability and divergence detected in the bindin gene of Strongylocentrotus species is in the range of values observed in other genes involved in reproduction (reviews in [5, 7]). The silent variability of bindin in S. pallidus (π = 0.0195) is close to that of one of the most polymorphic genes of Drosophila melanogaster, ψEst-6 (π = 0.0253) . Extraordinary high level of intraspecific diversity has also been detected in oyster sperm bindin .
Sliding window analysis
We have compared the patterns of polymorphism in the three sea urchin species by calculating correlations of this estimate from sliding windows over the bindin gene (Fig. 3). To obtain equal number of windows for all species we excluded indels and the S. intermedius sequence U-21 with a 48-bp deletion in intron. We also excluded the recombinant sequences in S. intermedius (U-7; Fig. 1) and S. pallidus (PAL_AF133806 and PAL_AF077313; Additional file 2: Figure S2). The sliding window sizes were 100 nucleotides with 25-nucleotide increments, which produced 52 windows along the bindin gene used for the correlation analysis.
We have found significant correlations of polymorphism patterns between the S. intermedius—S. pallidus (Spearman’s coefficient of rank correlation, rho = 0.374; P = 0.0062) and S. droebachiensis—S. pallidus (rho = 0.441; P = 0.0011) species pairs. No correlation was however detected for the S. intermedius—S. droebachiensis (rho = − 0.043; P = 0.7606) species pair. S. intermedius and S. droebachiensis are the most diverged species among five sea urchins belonging to the genus Strongylocentrotus (Fig. 2). An absence of correlation in polymorphism patterns between these two species might be explained by the fact that the pronounced intron peak of polymorphism in S. intermedius occurs in slightly shifted coordinates (approximately on 185 bp) in comparison with S. pallidus and S. droebachiensis (this peak is marked by a vertical arrow on Fig. 3).
We have also compared the patterns of divergence along the bindin gene in the three sea urchin species (Fig. 3) by the same approach as above. The bindin sequence of S. polyacanthus (AF077317) was used in interspecific comparisons. The correlations of divergence patterns between all three species are highly significant: S. intermedius—S. droebachiensis (rho = 0.719; P < 0.0001), S. intermedius—S. pallidus (rho = 0.657; P < 0.0001), and S. droebachiensis—S. pallidus (rho = 0.748; P < 0.0001).
Thus, significant correlations between the patterns of nucleotide diversity in different species support the suggestion that the bindin gene evolves similarly (but not identically) in S. intermedius, S. droebachiensis, and S. pallidus. The evolutionary vectors are remarkably similar for these sea urchin species that diverged around 3–7 million years ago . The revealed pattern is consistent with the phenomenon of parallel evolution, which results from similar or identical mutations maximizing adaptation in independent evolutionary lineages (review in ).
Tests of neutrality
P = 0.0036
P = 0.0001
P = 0.0253
P = 0.0023
P = 0.0001
P = 0.0172
With two sets of divergent haplotypes for the bindin gene of S. intermedius (Fig. 1), it is appropriate to use the haplotype test  to see whether directional selection has increased the frequency of some haplotypes. For the full dataset of 25 bindin sequences, there are a total of 30 parsimony informative polymorphic sites, and there is a homogeneous subset of 20 sequences with two informative sites (Fig 1). The probability of this configuration is significantly low (P = 0.03) with the population recombination rate 0.005 obtained by the method of McVean et al. . The configuration is more asymmetric (the haplotype test P = 0.006) excluding recombinant sequence U-7 and sequence G-45 with two mutations in positions 219 and 244, Fig. 1). Thus, the homogeneous subset of sequences may evolve under the influence of directional selection. The region with amino acid substitutions (Fig. 1) at the beginning of bindin exon I may be a likely candidate as a target for directional selection (see also Fig. 4). The result of the haplotype test is consistent with the results of the McDonald tests (see above). For the other two species, S. pallidus and S. droebachiensis, the haplotype test is not significant, probably due to the very limited number of sequences available for these species.
In general, the signal of positive selection could result from codon bias affecting fixed synonymous sites. The effective number of codons (ENC; ) ranges from 20, which means that the bias is at a maximum (so that only one codon is used from each synonymous codon group), to 61, which means no bias (all synonymous codons are equally used in each codon group). The values of ENC are 56.2, 52.1, and 57.1 for S. intermedius, S. pallidus, and S. droebachinensis, respectively; all numbers close to the maximum possible (61), indicating no codon bias in the bindin gene in the sea urchin studied.
The codon adaptation index (CAI; ) is a measure of the synonymous codon usage bias for a DNA or RNA sequence. The index ranges from 0 to 1, being 1 if a gene always uses the most frequently used synonymous codons in the reference set. We have calculated CAI using the original method proposed by Sharp and Li  implemented in the program E-CAI . The random sequences are generated with the Markov method (Markov Model of order 0). The normality of the CAI values of the random generated sequences is assessed with a Kolmogorov-Smirnov test. The CAI values of the random-generated sequences are used to estimate an expected value of CAI that can be compared with the observed value. For the reference set of sequences we have used 1131 nucleotide sequences (410481 codons) of S. purpuratus from Codon Usage Database . The observed CAI values are significantly lower than expected values (eCAI) (P = 0.01) for all three species studied, indicating no codon usage bias in the bindin gene. For S. intermedius CAI = 0.643, eCAI = 0.734; P = 0.01; for S. pallidus CAI = 0.668, eCAI = 0.750; P = 0.01, and for S. droebachiensis CAI = 0.663, eCAI = 0.754; P = 0.01.
The maximum likelihood analysis of positive selection (see below) cannot be biased due to the effects of selection for codon bias since unequal codon frequencies are explicitly taken into account within the codon model. Thus the positive selection is detected given the background codon bias.
The neutrality tests are typically affected by demography and, therefore, they may be difficult to interpret [115, 116]. We have applied model-based maximum likelihood (ML) methods to confirm the observations made above. All results from the ML analyses shown below held for the full sample of S. intermedius, S. pallidus, and S. droebachiensis, whether or not recombinant sequences were removed.
Maximum likelihood analysis of the coding regions
Three LRTs (M0 vs. M3, M1a vs. M2a, and M7 vs M8) were applied to test for positive selection on the protein in separate alignments of S. intermedius, S. pallidus, S. droebachiensis and the alignment of several other sea urchin species. None of these tests is significant for the S. intermedius and S. droebachiensis alignments. Model M0 offered equally good fit to data as more flexible models. It was estimated that the bindin gene evolved with ω = 0.7 and κ = 4.3 in S. intermedius and with ω = 0.5 and κ = 2.1 in S. droebachiensis. However, it seems unlikely that all sites in the gene are under the same selective pressure, due to constraints imposed by tertiary protein structure and function. We attribute such a result to the low power of LRTs for datasets of low divergence . For the S. pallidus alignment, all three LRT are significant with p-values from 0.01 to 0.02. The ML estimates suggest that 97 % of sites in bindin are very conserved, but the remaining 3 % of sites evolve under strong diversifying selective pressure with ω = 37.3.
Sites inferred under positive selection using the Bayesian prediction based on codon models M2a and M8
Positively selected sites
The site models rely on the unique inferred phylogeny. For population data, phylogenetic inference may lack resolution due to low divergence or may be inaccurate due to recombination. Although site models are rather robust to recent deviations in topological arrangement, it is advantageous to use other techniques with different assumptions to check the robustness of the conclusions, and to see whether additional conclusions can be drawn from different types of analyses.
Therefore, in addition we used the PAC likelihood method based on the approximation to the coalescent with selection and recombination parameters estimated simultaneously in the Bayesian framework . The method was applied to the three population samples above (S. intermedius, S. pallidus, and S. droebachiensis). Sites under positive selection can be inferred on a site-to-site basis or in sliding windows, and the posterior probabilities at each site can be used to measure the confidence of the prediction. However, unlike LRT the technique does not intend to test the hypothesis of positive selection on the gene overall. In this framework, this would be equivalent to answering the question, “Is there at least one site under positive selection in the gene?” Posed this way, the problem can be seen as similar to multiple testing, whereby the hypothesis of positive selection is tested at each site using posterior probabilities rather than p-values as is usually done (and it is not meant to complement the Bayesian framework). However, for explorative purposes, we apply the Benjamini and Hochberg  multiple-testing correction at each site (1- posterior probability) instead of a p-value.
In all three species alignments, several sites were inferred to be under positive selection with posterior probability > 0.95. When sequences for the U and G forms of S. intermedius were analyzed separately, more sites under positive selection were inferred for the U form (excluding U7) compared to the G form, which is consistent with the observation of higher variability in the U form (above). When all bindin sequences of S. intermedius were analyzed together, 10 sites were inferred under positive selection. After the multiple testing correction, some sites were still under positive selection in each of the datasets (according to the full species alignment: sites 65, 67, 87, 160, 163 and 185 in S. droebachiensis; sites 65 and 199 in S. intermedius; and sites 188 and 199 in S. pallidus; see Additional file 5: Figure S5 for the bindin amino acid alignment). These results suggest that diversifying selection acts both at the species and population level of the sea urchins studied. Similar results were obtained previously for the bindin coding region of the sea urchins. A number of sites evolving under positive selection were detected in the 5′ and 3′ bindin regions of the sea urchins genera Strongylocentrotus [25, 26] and Echinomenta [17, 22, 23]. Our results are completely compatible with the previous studies. We found three sites under positive selection both in the 5′ and 3′ regions, but none within the core.
Testing for positive selection in the intron region
Models of variable ζ among sites
Proportions of sites from a corresponding class
ζ0, p 0
ζ0 < 1, ζ1 = 1
p 0, p 1 = 1− p 0
ζ0, ζ1, p 0
ζ0 < 1, ζ1 ≥ 1
p 0, p 1 = 1− p 0
Parameter estimates for ζ-models
ζ0 = 0.99, p 0 = 0.14 [ζ1 = 1, p 1 = 0.86]
ζ0 = 0.69, p 0 = 0.98 ζ1 = 61.77, [p 1 = 0.02]
−2870.740681 p-value = 10−8
ζ0 = 1.00, p 0 = 0.00 [ζ1 = 1, p 1 = 1.00]
ζ0 = 0.00, p 0 = 0.00 ζ1 = 8.80, [p 1 = 1.00]
−2522.600909 p-value = 2x10−6
ζ0 = 0.00, p 0 = 0.53 [ζ1 = 1, p 1 = 0.47]
ζ0 = 0.00, p 0 = 0.91 ζ1 = 7.27, [p 1 = 0.09]
−2277.438710 p-value = 0.02
A number of evolutionary processes in the bindin coding region such as mildly deleterious selection against unpreferred synonymous codons, selective sweeps due to positive selection on nonsynonymous variation, and background selection could potentially influence the results we detected in the bindin intron. However, these possible alternative mechanisms represent the variants of selection that decrease variability (e.g., [120–122]). Contrary to these scenarios we observe highly increased level of variability in the intron, moreover accompanied by a decreased level of divergence, indicating the action of diversifying selection [65, 73–75, 77, 105, 107–110] operating within the bindin intron independently from the coding region.
We report several tests that support positive selection on the bindin intron. The statistical significance of the deviation from neutrality in the intron region is supported not only by the ML analysis based on joint codon-nucleotide models but also by several neutrality tests - Kelly , Wall , and McDonald [73, 74]. The observed consistency between different approaches is an important argument supporting the results obtained in this work.
Further, the results of the ML analysis are likely to be robust. First, the selection measure relies on the d S measured as average over the entire coding region. Since there is not even weak codon bias in the coding region (according to codon bias indices, see above), average d S cannot be affected to inflate the measure of selection as a whole. Site-specific synonymous bias events, if present, are insufficient to affect the average d S significantly. Therefore there is nothing undermining the ML analysis of selection on the bindin intron of the strongylocentrotid sea urchins.
The data obtained here are in accordance with other investigations where positive selection acting at the intron sites was revealed or suggested: in the plant Ficus carica , Drosophila melanogaster [46, 47, 124], and primates including humans [125–129]. Nevertheless, the results of the neutrality tests and the interaction between selective and neutral processes should be cautiously interpreted, given the modest sample size of sequences with the relatively short sequence lengths from a single population (e.g., [130, 131]). Moreover, there are nonselective factors that could partly account for the patterns of the bindin polymorphisms. Possible explanatory processes include bottlenecks and founding effects and/or population (or species) admixture, as well as varying recombination rates in different genomic regions.
Demographic and selective forces shaping nucleotide polymorphism patterns in strongylocentrotid sea urchin species are difficult to disentangle because of their highly complicated evolutionary history, including wide dispersal in the North Atlantic, Arctic, and Pacific continental shelves and slopes, and adaptation to drastically new environments [49, 50]. Previously we showed that the patterns of polymorphism should be influenced by both of these evolutionary forces and is apparent in our data obtained for the Sod, Est-6, ψEst-6, tin, bap, lbe, and lbl genes from four natural Drosophila melanogaster populations (Africa, Europe, North and South America) [46, 47, 97–99]. Comparative analysis showed significant peaks of variability observed both in African and non-African samples, but dimorphic structure was detected only in non-African samples. This observation supports the hypothesis that dimorphic haplotype structure could be generated by demographic process during the recent species history caused by admixture of differentiated populations (or species). Indeed, deep branches due to demographic factors are expected in coalescent theory . Moreover, there is direct evidence for introgression between S. pallidus and S. droebachiensis [44, 55]. However, the elevated levels of nucleotide variability and linkage disequilibrium, accompanied by significant peaks of neutrality test statistics could not be explained by stochastic processes alone and might reflect the effects of balancing selection [65, 73–75, 77, 105, 107–110]. One way of distinguishing between selective and demographic processes could be to perform similar investigations in other populations of strongylocentrotid sea urchins combined with a functional approach using experimental methods to correlate specific bindin haplotypes with specific functional differences related to reproduction. This integral approach promises to be fruitful, thus adding a new aspect in evolutionary studies of bindin including the intron sequences mostly overlooked in previous investigations.
We have studied nucleotide variability in the complete bindin locus including two exons and one intron in the sea urchin S. intermedius (including two morphological forms) and in two other strongylocentrotid species, S. pallidus and S. droebachiensis available in GenBank. The bindin gene introns have not been previously investigated for any species of sea urchins.
The distribution of variability and divergence is non-uniform along bindin, with striking similarity between all three species, indicating similar evolutionary trends of this gene with a reproductive function. The revealed pattern is consistent with the phenomenon of parallel evolution, which results from similar or identical mutations maximizing adaptation in independent evolutionary lineages. This suggestion is supported by significant interspecific correlations of nucleotide diversity patterns from sliding windows over the bindin gene.
The patterns of nucleotide variability and divergence are radically different in the bindin coding and intron regions. The signature of positive selection is detected in the bindin coding region. Moreover, the data suggest the action of diversifying selection in the bindin intron, which to our knowledge has not been shown previously. Different types of positive selection are suggested in different functional regions, with putative multiple targets of selection both in coding and intron regions. Significant deviation from neutrality suggests functional importance of the bindin intron variability, which might be involved in regulatory functions.
The morphological forms of S. intermedius show no evidence of genetic divergence. However they demonstrate slightly different patterns of bindin variability. These observations along with clear morphological and ecological differences, as well as the highly specific symbiotic microorganisms previously found , suggest unique evolutionary trajectories for each form and warrant treating them separately with respect to biodiversity conservation and management.
Ethics (and consent to participate)
The sea urchin Strongylocentrotus intermedius is not listed as endangered, vulnerable, rare, or protected species of the Russian Federation. The S. intermedius is considered as a “commercial” species in the Primorye Territory (the Sea of Japan) of the Russian Federation, where we collected the specimens for the present study. Fishery of S. intermedius is officially permitted and administered.
The described field study was based on the quota limit obtained from the Department of Fisheries and Marine Resources of Primorye Territory (DFMRPT) (order #152, December 12, 2014; signed by the DFMRPT Director A.A. Perednya; see details, http://primorsky.ru/upload/iblock/dee/deed8734a1207787f7658b55df06b654.pdf). The sampling point is located beyond any protected territories. The field study did not involve endangered, vulnerable, rare, or protected species. The locations of the field studies are not privately-owned or protected.
The present field study was approved by the Federal Agency for Fishery of the Russian Federation, which has the highest decision authority concerning marine organisms care and use and should be considered as an equivalent to the Institutional Animal Care and Use Committee.
The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. The authors alone are responsible for the content and writing of the paper.
Consent to publish
Availability of data and materials
The nucleotide sequences obtained in the present work are deposited in GenBank (National Center for Biotechnology Information) under the accession numbers: KP774723—KP774781 (COI) and KP774782—KP774794 (bindin) at www.ncbi.nlm.nih.gov .
We are thankful to anonymous reviewers for their constructive comments and suggestions on the original draft of the manuscript. We thank Elena Balakireva for encouragement and help. We are grateful to Dr. A. G. Bazhin for the details concerning the geographical distribution of Strongylocentrotus sea urchins. The work on the bindin gene cloning and sequencing was supported by the Bren Professor Funds at the University of California Irvine to F. J. Ayala. The analysis of the data was supported by the Russian Science Foundation (RSF; Grant No. 14–50–00034) to E. S. Balakirev. Maria Anisimova was funded by the Kick-off grant of the Zurich University of Applied Sciences.
Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
- Vacquier VD, Swanson WJ, Hellberg ME. What have we learned about sea urchin sperm bindin? Dev Growth Differ. 1995;37:1–10.View ArticleGoogle Scholar
- Swanson WJ, Vacquier VD. Reproductive protein evolution. Annu Rev Ecol Syst. 2002;33:161–79.View ArticleGoogle Scholar
- Zigler KS, Lessios HA. 250 million years of bindin evolution. Biol Bull. 2003;205:8–15.PubMedView ArticleGoogle Scholar
- Zigler KS. The evolution of sea urchin sperm bindin. Int J Dev Biol. 2008;52:791–96.PubMedView ArticleGoogle Scholar
- Palumbi SR. Speciation and the evolution of gamete recognition genes: patterns and process. Heredity. 2009;102:66–76.PubMedView ArticleGoogle Scholar
- Lessios HA. Speciation genes in free-spawning marine invertebrates. Integrat Compar Biol. 2011;51:456–65.View ArticleGoogle Scholar
- Vacquier VD, Swanson WJ. Selection in the rapid evolution of gamete recognition proteins in marine invertebrates. Cold Spring Harb Perspect Biol. 2011;3(11):a002931.PubMedPubMed CentralView ArticleGoogle Scholar
- Vacquier VD. The quest for the sea urchin egg receptor for sperm. Biochem Biophys Res Com. 2012;425:583–87.PubMedView ArticleGoogle Scholar
- Cameron RA, Minor JE, Nishioka D, Britten RJ, Davidson EH. Locate and level of bindin mRNA in maturing testis of the sea urchin, Strongylocentrotus purpuratus. Dev Biol. 1990;142:44–9.PubMedView ArticleGoogle Scholar
- Nishioka D, Ward RD, Poccia D, Kostacos C, Minor JE. Localization of bindin expression during sea urchin spermatogenesis. Mol Reprod Dev. 1990;27:181–90.PubMedView ArticleGoogle Scholar
- Summers RG, Hylander BL, Colwin LH, Colwin AL. The functional anatomy of the echinoderm spermatozoon and its interaction with the egg at fertilization. Am Zool. 1975;15:523–51.View ArticleGoogle Scholar
- Metz EC, Kane RE, Yanagimachi H, Palumbi SR. Fertilization between closely related sea urchins is blocked by incompatibilities during sperm-egg attachment and early stages of fusion. Biol Bull. 1994;187:23–34.View ArticleGoogle Scholar
- Swanson WJ, Vacquier VD. The rapid evolution of reproductive proteins. Nat Rev Genet. 2002;3:137–44.PubMedView ArticleGoogle Scholar
- Palumbi SR. All males are not created equal: fertility differences depend on gamete recognition polymorphisms in sea urchins. Proc Natl Acad Sci U S A. 1999;96:12632–7.PubMedPubMed CentralView ArticleGoogle Scholar
- Levitan DR, Ferrell DL. Selection on gamete recognition proteins depends on sex, density, and genotype frequency. Science. 2006;312:267–9.PubMedView ArticleGoogle Scholar
- Levitan DR, Stapper AP. Simultaneous positive and negative frequency-dependent selection on sperm bindin, a gamete recognition protein in the sea urchin Strongylocentrotus purpuratus. Evolution. 2009;64:785–97.PubMedView ArticleGoogle Scholar
- Zigler KS, McCartney MA, Levitan DR, Lessios HA. Sea urchin bindin divergence predicts gamete compatibility. Evolution. 2005;59:2399–404.PubMedView ArticleGoogle Scholar
- Geyer LB, Palumbi SP. Reproductive character displacement and the genetics of gamete recognition in tropical sea urchins. Evolution. 2003;57:1049–60.PubMedView ArticleGoogle Scholar
- Ulrich AS, Otter M, Glabe CG, Hoekstra D. Membrane fusion is induced by a distinct peptide sequence of the sea urchin fertilization protein bindin. J Biol Chem. 1998;273:16748–55.PubMedView ArticleGoogle Scholar
- Patiño S, Aagaard JE, Maccoss MJ, Swanson WJ, Hart MW. Bindin from a sea star. Evol Dev. 2009;11:377–82.View ArticleGoogle Scholar
- Lopez A, Miraglia SJ, Glabe CG. Structure/function analysis of the sea urchin sperm adhesive protein bindin. Dev Biol. 1993;156:24–33.PubMedView ArticleGoogle Scholar
- Metz EC, Palumbi SR. Positive selection and sequence rearrangements generate extensive polymorphism in the gamete recognition protein bindin. Mol Biol Evol. 1996;13:397–406.PubMedView ArticleGoogle Scholar
- McCartney MA, Lessios HA. Adaptive evolution of sperm bindin tracks egg incompatibility in neotropical sea urchins of the genus Echinometra. Mol Biol Evol. 2004;21:732–45.PubMedView ArticleGoogle Scholar
- Geyer LB, Lessios H. Lack of character displacement in the male recognition molecule, bindin, in Atlantic sea urchins of the genus Echinometra. Mol Biol Evol. 2009;26:2135–46.PubMedView ArticleGoogle Scholar
- Biermann CH. The molecular evolution of sperm bindin in six species of sea urchins (Echinoida: Strongylocentrotidae). Mol Biol Evol. 1998;15:1761–71.PubMedView ArticleGoogle Scholar
- Pujolar JM, Pogson GH. Positive Darwinian selection in gamete recognition proteins of Strongylocentrotus sea urchins. Mol Ecol. 2011;20:4968–82.PubMedView ArticleGoogle Scholar
- Zigler KS, Raff EC, Popodi E, Raff RA, Lessios HA. Adaptive evolution of bindin in the genus Heliocidaris is correlated with the shift to direct development. Evolution. 2003;57:2293–302.PubMedView ArticleGoogle Scholar
- Calderón I, Turon X, Lessios HA. Characterization of the sperm molecule bindin in the sea urchin genus Paracentrotus. J Mol Evol. 2009;68:366–76.PubMedView ArticleGoogle Scholar
- Calderón I, Ventura CRR, Turon X, Lessios HA. Genetic divergence and assortative mating between colour morphs of the sea urchin Paracentrotus gaimardi. Mol Ecol. 2010;19:484–93.PubMedView ArticleGoogle Scholar
- Metz EC, Gomes-Gutierez G, Vacquier VD. Mitochondrial DNA and bindin gene sequence evolution among allopatric species of the sea urchin genus Arbacia. Mol Biol Evol. 1998;15:185–95.PubMedView ArticleGoogle Scholar
- Lessios HA, Lockhart S, Collin R, Sotil G, Sanchez-Jerez P, Zigler KS, et al. Phylogeography and bindin evolution in Arbacia, a sea urchin genus with an unusual distribution. Mol Ecol. 2012;21:130–44.PubMedView ArticleGoogle Scholar
- Zigler KS, Lessios HA. Evolution of bindin in the pantropical sea urchin Tripneuster: comparisons to bindin of other genera. Mol Biol Evol. 2003;20:220–31.PubMedView ArticleGoogle Scholar
- Zigler KS, Lessios HA. Speciation on the coasts of the new world: phylogeography and the evolution of bindin in the sea urchin genus Lytechinus. Evolution. 2004;58:1225–41.PubMedView ArticleGoogle Scholar
- Debenham P, Brzezinski MA, Foltz KR. Evaluation of sequence variation and selection in the bindin locus of the red sea urchin, Strongylocentrotus franciscanus. J Mol Evol. 2000;51:481–90.PubMedGoogle Scholar
- Sunday JM, Hart MW. Sea star populations diverge by positive selection at a sperm-egg compatibility locus. Evol Ecol. 2013;3:640–54.View ArticleGoogle Scholar
- Popovic I, Marko PB, Wares JP, Hart MW. Selection and demographic history shape the molecular evolution of the gamete compatibility protein bindin in Pisaster sea stars. Ecol Evol. 2014;4:1567–88.PubMedPubMed CentralView ArticleGoogle Scholar
- Dobzhansky T. Speciation as a stage in evolutionary divergence. Am Nat. 1940;74:302–21.Google Scholar
- Eberhard WG. Female Control: Sexual Selection by Cryptic Female Choice. Princeton: Princeton Univ. Press; 1996. p. 1996.Google Scholar
- Rice WR, Holland B. The enemies within: Intergenomic conflict. Interlocus contest evolution, ICE and the intraspecific red queen. Behav Ecol Sociobiol. 1997;41:1–10.View ArticleGoogle Scholar
- Gould MC, Stephano JL. Polyspermy prevention in marine invertebrates. Microscopy Research and Technology. 2003;61:379–88.View ArticleGoogle Scholar
- Vacquier VD, Swanson WJ, Lee YH. Positive Darwinian selection on two homologous fertilization proteins: what is the selective pressure driving their divergence? J Mol Evol. 1997;44:S15–22.PubMedView ArticleGoogle Scholar
- Zigler KS, Byrne M, Raff EC, Lessios HA, Raff RA. Natural hybridization in the sea urchin genus Pseudoboletia between species without apparent barriers to gamete recognition. Evolution. 2012;66:1695–708.PubMedView ArticleGoogle Scholar
- Binks RM, Prince J, Evans JP, Kennington WJ. More than bindin: reproductive isolation between sympatric subspecies of a sea urchin by asynchronous spawning. Evolution. 2012;66:3545–57.PubMedView ArticleGoogle Scholar
- Addison JA, Pogson GH. Multiple gene genealogies reveal asymmetrical hybridization and introgression among strongylocentrotid sea urchins. Mol Ecol. 2009;18:1239–51.PubMedView ArticleGoogle Scholar
- Chorev M, Carmel L. The function of introns. Front Genet. 2012;3:1–15.View ArticleGoogle Scholar
- Balakirev ES, Ayala FJ. Nucleotide variation in the tinman and bagpipe homeobox genes of Drosophila melanogaster. Genetics. 2004;166:1845–56.PubMedPubMed CentralView ArticleGoogle Scholar
- Balakirev ES, Anisimova M, Ayala FJ. Complex interplay of evolutionary forces in the ladybird homeobox genes of Drosophila melanogaster. PLoS One. 2011;6(7):e22613.PubMedPubMed CentralView ArticleGoogle Scholar
- Kober KM, Bernardi G. Phylogenomics of strongylocentrotid sea urchins. BMC Evol Biol. 2013;13:88.PubMedPubMed CentralView ArticleGoogle Scholar
- Jensen M. The Strongylocentrotidae (Echinoidea), a morphologic and systematic study. Sarsia. 1974;57:113–48.View ArticleGoogle Scholar
- Bazhin AG, Stepanov VG. Sea urchins fam. Strongylocentrotidae of seas of Russia. Petropavlovsk-Kamchatsky: KamchatNIRO; 2012.Google Scholar
- Swan EF. Evidence suggesting the existence of two species of Strongylocentrotus (Echinoidea) in the northwest Atlantic. Can J Zool. 1962;40:1211–22.View ArticleGoogle Scholar
- Gagnon J-M, Gilkinson KD. Discrimination and distribution of the sea urchins Strongylocentrotus droebachiensis (O.F. Müller) and S. pallidus (G.O. Sars) in the north-west Atlantic. Sarsia. 1994;79:1–11.View ArticleGoogle Scholar
- Buyanovsky AI, Rzhavsky AV. Spatial structure of settlements of green sea urchin Strongylocentrotus droebachiensis (Echinodermata; Strongylocentrotidae) in the Dalne-Zelenetskaya inlet in the Barents sea. Proceedings of the Russian Federal Research Institute of Fisheries and Oceanography. 2007;147:350–75.Google Scholar
- Balakirev ES, Pavlyuchkov VA, Ayala FJ. DNA variation and symbiotic associations in phenotypically-diverse sea urchin Strongylocentrotus intermedius. Proc Natl Acad Sci U S A. 2008;105:16218–23.PubMedPubMed CentralView ArticleGoogle Scholar
- Addison JA, Hart MW. Colonization, dispersal, and hybridization influence phylogeography of North Atlantic sea urchins (Strongylocentrotus droebachiensis). Evolution. 2005;59:532–43.PubMedGoogle Scholar
- Balakirev ES, Krupnova TN, Ayala FJ. Symbiotic associations in the phenotypically-diverse brown alga Saccharina japonica. PLoS One. 2012;7(6):e39587.PubMedPubMed CentralView ArticleGoogle Scholar
- Balakirev ES, Romanov NS, Mikheev PB, Ayala FJ. Mitochondrial DNA variation and introgression in Siberian taimen Hucho taimen. PLoS One. 2013;8(8):e71147.PubMedPubMed CentralView ArticleGoogle Scholar
- Minor JE, Fromson DR, Britten RJ, Davidson EH. Comparison of the bindin proteins of Strongylocentrotus franciscanus, S. purpuratus, and Lytechinus variegatus: sequences involved in the species specificity of fertilization. Mol Biol Evol. 1991;8:781–95.PubMedGoogle Scholar
- Gao B, Klein LE, Britten RJ, Davidson EH. Sequence of mRNA coding for bindin, a species-specific sea urchin sperm protein required for fertilization. Proc Natl Acad Sci U S A. 1986;83:8634–38.PubMedPubMed CentralView ArticleGoogle Scholar
- Lee YH. Molecular phylogenies and divergence times of sea urchin species of strongylocentrotidae, echinoida. Mol Biol Evol. 2003;20:1211–21.PubMedView ArticleGoogle Scholar
- Jacobs HT, Elliott DJ, Math VB, Farquharson A. Nucleotide sequence and gene organization of sea urchin mitochondrial DNA. J Mol Biol. 1988;202:185–217.PubMedView ArticleGoogle Scholar
- Thompson JD, Higgins DG, Gibson TJ. CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 1994;22:4673–80.PubMedPubMed CentralView ArticleGoogle Scholar
- Librado P, Rozas J. DnaSP v5: A software for comprehensive analysis of DNA polymorphism data. Bioinformatics. 2009;25:1451–52.PubMedView ArticleGoogle Scholar
- Filatov DA. PROSEQ: a software for preparation and evolutionary analysis of DNA sequence data sets. Mol Ecol Notes. 2002;2:621–24.View ArticleGoogle Scholar
- Hudson RR, Kaplan N. The coalescent process in models with selection and recombination. Genetics. 1988;120:831–40.PubMedPubMed CentralGoogle Scholar
- Tamura K, Peterson D, Peterson N, Stecher G, Nei M, Kumar S. MEGA5: Molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods. Mol Biol Evol. 2011;28:2731–9.PubMedPubMed CentralView ArticleGoogle Scholar
- Balakirev ES, Krupnova TN, Ayala FJ. DNA variation in the phenotypically-diverse brown alga Saccharina japonica. BMC Plant Biol. 2012;12(108). doi:10.1186/1471-2229-12-108.
- Hudson RR, Kreitman M, Aguadé M. A test of neutral molecular evolution based on nucleotide data. Genetics. 1987;116:153–9.PubMedPubMed CentralGoogle Scholar
- Tajima F. Statistical method for testing the neutral mutation hypothesis by DNA polymorphism. Genetics. 1989;123:585–95.PubMedPubMed CentralGoogle Scholar
- McDonald JH, Kreitman M. Adaptive protein evolution at the Adh locus in Drosophila. Nature. 1991;351:652–4.PubMedView ArticleGoogle Scholar
- Fu Y-X, Li W-H. Statistical tests of neutrality of mutations. Genetics. 1993;133:693–709.PubMedPubMed CentralGoogle Scholar
- Hudson RR, Bailey K, Skarecky D, Kwiatowski J, Ayala FJ. Evidence for positive selection in the superoxide dismutase (Sod) region of Drosophila melanogaster. Genetics. 1994;136:1329–40.PubMedPubMed CentralGoogle Scholar
- McDonald JH. Detecting non-neutral heterogeneity across a region of DNA sequence in the ratio of polymorphism to divergence. Mol Biol Evol. 1996;13:253–60.PubMedView ArticleGoogle Scholar
- McDonald JH. Improved tests for heterogeneity across a region of DNA sequence in the ratio of polymorphism to divergence. Mol Biol Evol. 1998;15:377–84.PubMedView ArticleGoogle Scholar
- Kelly JK. A test of neutrality based on interlocus associations. Genetics. 1997;146:1197–206.PubMedPubMed CentralGoogle Scholar
- Depaulis F, Veuille M. Neutrality tests based on the distribution of haplotypes under an infinite-site model. Mol Biol Evol. 1998;5:1788–90.View ArticleGoogle Scholar
- Wall JD. Recombination and the power of statistical tests of neutrality. Genet Res. 1999;74:65–79.View ArticleGoogle Scholar
- Hudson RR, Boos D, Kaplan NL. A statistical test for detecting geographic subdivision. Mol Biol Evol. 1992;9:138–51.PubMedGoogle Scholar
- Hudson RR. Properties of a neutral allele model with intragenic recombination. Theor Popul Biol. 1983;23:183–201.PubMedView ArticleGoogle Scholar
- Hudson RR. Gene genealogies and the coalescent process. Oxf Surv Biol. 1990;7:1–44.Google Scholar
- Hudson RR. Generating samples under a Wright-Fisher neutral model of genetic variation. Bioinformatics. 2002;18:337–8.PubMedView ArticleGoogle Scholar
- Sawyer SA. Statistical tests for detecting gene conversion. Mol Biol Evol. 1989;6:526–38.PubMedGoogle Scholar
- McVean G, Awadalla P, Fearnhead P. A coalescent-based method for detecting and estimating recombination from gene sequences. Genetics. 2002;160:1231–41.PubMedPubMed CentralGoogle Scholar
- Martin DP, Lemey P, Lott M, Moulton V, Posada D, Lefeuvre P. RDP3: a flexible and fast computer program for analyzing recombination. Bioinformatics. 2010;26:2462–63.PubMedPubMed CentralView ArticleGoogle Scholar
- Guindon S, Dufayard JF, Lefort V, Anisimova M, Hordijk W, Gascuel O. New algorithms and methods to estimate maximum-likelihood phylogenies: assessing the performance of PhyML 3.0. Syst Biol. 2010;59:307–21.PubMedView ArticleGoogle Scholar
- Yang Z, Swanson WJ, Vacquier VD. Maximum-likelihood analysis of molecular adaptation in abalone sperm lysin reveals variable selective pressures among lineages and sites. Mol Biol Evol. 2000;17:1446–55.PubMedView ArticleGoogle Scholar
- Yang Z, Nielsen R. Mutation-selection models of codon substitution and their use to estimate selective strengths on codon usage. Mol Biol Evol. 2008;25:568–79.PubMedView ArticleGoogle Scholar
- Yang Z. PAML 4: phylogenetic analysis by maximum likelihood. Mol Biol Evol. 2007;24:1586–91.PubMedView ArticleGoogle Scholar
- Kosakovsky Pond SL, Muse SV. Site-to-site variation of synonymous substitution rates. Mol Biol Evol. 2005;22:2375–85.View ArticleGoogle Scholar
- Kosakovsky Pond SL, Frost SDW, Muse SV. HyPhy: hypothesis testing using phylogenies. Bioinformatics. 2005;21:676–9.View ArticleGoogle Scholar
- Anisimova M, Kosiol C. Investigating protein-coding sequence evolution with probabilistic codon substitution models. Mol Biol Evol. 2009;26:255–71.PubMedView ArticleGoogle Scholar
- Wong WS, Nielsen R. Detecting selection in noncoding regions of nucleotide sequences. Genetics. 2004;167:949–58.PubMedPubMed CentralView ArticleGoogle Scholar
- Yang Z, Wong WS, Nielsen R. Bayes empirical bayes inference of amino acid sites under positive selection. Mol Biol Evol. 2005;22:1107–18.PubMedView ArticleGoogle Scholar
- Moy GW, Vacquier VD. Bindin genes of the Pacific oyster Crassostrea gigas. Gene. 2008;423:215–20.PubMedView ArticleGoogle Scholar
- Du J, Gu T, Tian H, Araki H, Yang Y-H, Tian D. Grouped nucleotide polymorphism: a major contributor to genetic variation in Arabidopsis. Gene. 2008;426:1–6.PubMedView ArticleGoogle Scholar
- Teeter K, Naeemuddin M, Gasperini R, Zimmerman E, White KP, Hoskins R, et al. Haplotype dimorphism in a SNP collection from Drosophila melanogaster. J Exp Zool. 2000;288:63–75.PubMedView ArticleGoogle Scholar
- Balakirev ES, Chechetkin VR, Lobzin VV, Ayala FJ. DNA polymorphism in the β-esterase gene cluster of Drosophila melanogaster. Genetics. 2003;164:533–44.PubMedPubMed CentralGoogle Scholar
- Balakirev ES, Ayala FJ. Nucleotide variation of the Est-6 gene region in natural populations of Drosophila melanogaster. Genetics. 2003;165:1901–14.PubMedPubMed CentralGoogle Scholar
- Balakirev ES, Balakirev EI, Ayala FJ. Molecular evolution of the Est-6 gene in Drosophila melanogaster: Contrasting patterns of DNA variability in adjacent functional regions. Gene. 2002;288:167–77.PubMedView ArticleGoogle Scholar
- Hudson RR, Sáez AG, Ayala FJ. DNA variation at the Sod locus of Drosophila melanogaster: an unfolding story of natural selection. Proc Natl Acad Sci U S A. 1997;94:7725–29.PubMedPubMed CentralView ArticleGoogle Scholar
- Aguadé M. Nucleotide sequence variation at two genes of the phenylpropanoid pathway, the FAH1 and F3H genes, in Arabidopsis thaliana. Mol Biol Evol. 2001;18:1–9.PubMedView ArticleGoogle Scholar
- Haubold B, Kroymann J, Ratzka A, Mitchell-Olds T, Wiehe T. Recombination and gene conversion in a 170-kb genomic region of Arabidopsis thaliana. Genetics. 2002;161:1269–78.PubMedPubMed CentralGoogle Scholar
- Hudson RR, Kaplan N. Statistical properties of the number of recombination events in the history of a sample of DNA sequences. Genetics. 1985;111:147–64.PubMedPubMed CentralGoogle Scholar
- Moy GW, Springer SA, Adams SL, Swanson WJ, Vacquier VD. Extraordinary intraspecific diversity in oyster sperm bindin. Proc Natl Acad Sci U S A. 2008;105:1993–8.PubMedPubMed CentralView ArticleGoogle Scholar
- Nordborg M. Structured coalescent processes on different time scales. Genetics. 1997;146:1501–14.PubMedPubMed CentralGoogle Scholar
- Stern DL. The genetic causes of convergent evolution. Nature Rev Genet. 2013;14:751–64.PubMedView ArticleGoogle Scholar
- Strobeck C. Expected linkage disequilibrium for a neutral locus linked to a chromosomal arrangement. Genetics. 1983;103:545–55.PubMedPubMed CentralGoogle Scholar
- Charlesworth B, Nordborg M, Charlesworth D. The effects of local selection, balanced polymorphism and background selection on equilibrium patterns of genetic diversity in subdivided populations. Genet Res. 1997;70:155–74.PubMedView ArticleGoogle Scholar
- Takahata N, Satta Y. Footprints of intragenic recombination at HLA loci. Immunogenetics. 1998;47:430–41.PubMedView ArticleGoogle Scholar
- Charlesworth D. Balancing selection and its effects on sequences in nearby genome regions. PLoS Genet. 2006;2(4):e64.PubMedPubMed CentralView ArticleGoogle Scholar
- Wright F. The “effective number of codons” used in a gene. Gene. 1990;87:23–9.PubMedView ArticleGoogle Scholar
- Sharp PM, Li WH. The codon adaptation index – a measure of directional synonymous codon usage bias, and its potential applications. Nucleic Acids Res. 1987;15:1281–95.PubMedPubMed CentralView ArticleGoogle Scholar
- Puigbò P, Bravo IG, Santiago G-VS. CAIcal: A combined set of tools to assess codon usage adaptation. Biol Direct. 2008;3:38.PubMedPubMed CentralView ArticleGoogle Scholar
- Nakamura Y, Gojobori T, Ikemura T. Codon usage tabulated from international DNA sequence databases: status for the year 2000. Nucleic Acids Res. 2000;28:292.PubMedPubMed CentralView ArticleGoogle Scholar
- Wayne ML, Simonsen K. Statistical tests of neutrality in the age of weak selection. Trends Ecol Evol. 1998;13:1292–9.View ArticleGoogle Scholar
- Nielsen R. Statistical tests of selective neutrality in the age of genomics. Heredity. 2001;86:641–7.PubMedView ArticleGoogle Scholar
- Anisimova M, Bielawski JP, Yang Z. Accuracy and power of the likelihood ratio test in detecting adaptive molecular evolution. Mol Biol Evol. 2001;18:1585–92.PubMedView ArticleGoogle Scholar
- Wilson DJ, Mcvean G. Estimating diversifying selection and functional constraint in the presence of recombination. Genetics. 2006;172:1411–25.PubMedPubMed CentralView ArticleGoogle Scholar
- Benjamini Y, Hochberg Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J Royal Statist Soc Ser B. 1995;57:289–300.Google Scholar
- Smith JM, Haigh J. The hitch-hiking effect of a favourable gene. Genet Res. 1974;23:23–35.PubMedView ArticleGoogle Scholar
- Charlesworth B, Morgan MT, Charlesworth D. The effect of deleterious mutations on neutral molecular variation. Genetics. 1993;134:1289–303.PubMedPubMed CentralGoogle Scholar
- Akashi H, Eyre-Walker A. Translational selection and molecular evolution. Curr Opin Genet Develop. 1998;8:688–93.View ArticleGoogle Scholar
- Baraket G, Abdelkrim AB, Salhi-Hannachi A. tRNALeu intron (UAA) of Ficus carica L.: genetic diversity and evolutionary patterns. Genet Mol Res. 2015;14:3817–32.PubMedView ArticleGoogle Scholar
- Balakirev ES, Balakirev EI, Rodriguez-Trelles F, Ayala FJ. Molecular evolution of two linked genes, Est-6 and Sod, in Drosophila melanogaster. Genetics. 1999;53:1357–69.Google Scholar
- Nachman NW, Crowell SL. Contrasting evolutionary histories of two introns of the Duchenne muscular dystrophy gene, Dmd, in humans. Genetics. 2000;155:1855–64.PubMedPubMed CentralGoogle Scholar
- Gazave E, Marqués-Bonet T, Fernando O, Charlesworth B, Navarro A. Patterns and rates of intron divergence between humans and chimpanzees. Genome Biol. 2007;8:R21.PubMedPubMed CentralView ArticleGoogle Scholar
- Ding Y, Larson G, Rivas G, Lundberg C, Geller L, Ouyang C, et al. Strong signature of natural selection within an FHIT intron implicated in prostate cancer risk. PLoS One. 2008;3(10):e3533.PubMedPubMed CentralView ArticleGoogle Scholar
- Szabó JA, Szilágyi Á, Doleschall Z, Patócs A, Farkas H, Prohászka Z, et al. Both positive and negative selection pressures contribute to the polymorphism pattern of the duplicated human CYP21A2 gene. PLoS One. 2013;8(11):e81977.PubMedPubMed CentralView ArticleGoogle Scholar
- Schaschl H, Huber S, Schaefer K, Windhager S, Wallner B, Fieder M. Signatures of positive selection in the cis-regulatory sequences of the human oxytocin receptor (OXTR) and arginine vasopressin receptor 1a (AVPR1A) genes. BMC Evol Biol. 2015;15:85.PubMedPubMed CentralView ArticleGoogle Scholar
- Simonsen KL, Churchill GA, Aquadro CJ. Properties of statistical tests of neutrality for DNA polymorphism data. Genetics. 1995;141:413–29.PubMedPubMed CentralGoogle Scholar
- Jensen JD, Thornton KR, Aquadro CF. Inferring selection in partially sequenced regions. Mol Biol Evol. 2008;25:438–46.PubMedView ArticleGoogle Scholar
- Slatkin M, Hudson RR. Pairwise comparisons of mitochondrial DNA sequences in stable and exponentially growing populations. Genetics. 1991;129:555–62.PubMedPubMed CentralGoogle Scholar
- Benson DA, Cavanaugh M, Clark K, Karsch-Mizrachi I, Lipman DJ, Ostell J, Sayers EW. GenBank. Nucleic Acids Res. 2013;41(D1):D36–42.PubMedPubMed CentralView ArticleGoogle Scholar
- Nei M. Molecular Evolutionary Genetics. New York: Columbia University Press; 1987.Google Scholar
- Watterson GA. On the number of segregating sites in genetical models without recombination. Theor Popul Biol. 1975;10:256–76.View ArticleGoogle Scholar
- Jukes TH, Cantor CR. Evolution of protein molecules. In: Munro HM, editor. Mammalian Protein Metabolism. New York: Academic; 1969. p. 21–120.View ArticleGoogle Scholar