- Research article
Low linkage disequilibrium in wild Anopheles gambiae s.l. populations
BMC Geneticsvolume 11, Article number: 81 (2010)
In the malaria vector Anopheles gambiae, understanding diversity in natural populations and genetic components of important phenotypes such as resistance to malaria infection is crucial for developing new malaria transmission blocking strategies. The design and interpretation of many studies here depends critically on Linkage disequilibrium (LD). For example in association studies, LD determines the density of Single Nucleotide Polymorphisms (SNPs) to be genotyped to represent the majority of the genomic information. Here, we aim to determine LD in wild An. gambiae s.l. populations in 4 genes potentially involved in mosquito immune responses against pathogens (Gambicin, NOS, REL2 and FBN9) using previously published and newly generated sequences.
The level of LD between SNP pairs in cloned sequences of each gene was determined for 7 species (or incipient species) of the An. gambiae complex. In all tested genes and species, LD between SNPs was low: even at short distances (< 200 bp), most SNP pairs gave an r2 < 0.3. Mean r2 ranged from 0.073 to 0.766. In most genes and species LD decayed very rapidly with increasing inter-marker distance.
These results are of great interest for the development of large scale polymorphism studies, as LD generally falls below any useful limit. It indicates that very fine scale SNP detection will be required to give an overall view of genome-wide polymorphism. Perhaps a more feasible approach to genome wide association studies is to use targeted approaches using candidate gene selection to detect association to phenotypes of interest.
When alleles at different loci appear together in individuals more often than would be expected by chance they are said to be in Linkage Disequilibrium (LD) . LD is an indicator of the rate of recombination events between markers during meiosis. In addition to nucleotide distance, the effective recombination rate can be affected by numerous forces in natural populations such as selection that maintains certain allele associations (epistasis), genetic drift, population structure and demographic changes. Non random association between variants has recently become the focus of intense study in the hope that it might facilitate the mapping of complex trait loci through genome wide association studies (GWAS). Indeed, recent progress in the technological ability to genotype genetic variation  opens promising possibilities for identification of variants linked to phenotypes of interest. However, the ability to detect association critically depends on the extent of LD between causative alleles and surrounding markers. When LD extends over large genomic regions, there is a higher chance of finding association with the drawback that the potentially long physical distance between the gene of interest and an associated marker can make causative gene identification tedious. On the other hand, limited LD requires a much denser marker map to find associations, but, when found, identifying the causative allele is expected to be more straightforward .
The Anopheles gambiae species complex is of great interest due to its substantial role in malaria transmission. The two molecular forms or incipient species, M and S, of Anopheles gambiae s.s. and Anopheles arabiensis are major vectors throughout sub Saharan Africa. The other species of the complex can have high local importance in malaria transmission or more minor roles depending on their biology and distribution . Due to its epidemiological importance, several genomes [5, 6] and genes of interest [e.g. [7–10]] have been sequenced in An. gambiae s.s., providing a large data set of SNPs (Single Nucleotide Polymorphisms). Many of these variants are shared with the other members of the complex [7, 8]. Today, the development of large scale genotyping tools makes the implementation of GWAS in An. gambiae s.s. and possibly the other members very realistic. However, to date little is known about LD in natural populations. In An. gambiae s.s., analysis of SNPs in six genes located in and around the 2La chromosomal inversion revealed LD over more than 30 Mbp . Strong LD was also detected within and between centromeric regions of the two incipient species M and S of An. gambiae s.s.. This LD likely indicated subdivided populations adapted to different environments or incipient species, indeed, limited recombination rates in these chromosomal areas are hypothesized to be involved in environmental adaptation by maintaining combinations of alleles adapted to given conditions  and/or implicated in the speciation process through cause or consequence . In other regions of the genome, very little data is available to our knowledge in natural populations; LD was measured only to exclude redundancy of markers or Wahlund effect in population genetic studies and rarely showed linked markers [e.g. [14–16]]. For instance, Lehmann et al. tested the LD between microsatellite markers in several populations and observed only few pairwise combinations in significant LD, corresponding to the proportion of the tests expected to be significant by chance alone. This indicated random association between the tested markers spread throughout the genome of An. gambiae s.s., however this is only mildly informative for short distance LD and the density of markers to be used in association studies.
In this study, we aimed to determine LD in An. gambiae s.l. between short range markers. LD data will be particularly informative for mapping genes involved in susceptibility to infection. In this context, immune related genes are primary candidates and rates of LD decay in these genes will be crucial in determining the density of markers to be used in such association studies. Four immune related genes were selected, namely, Gambicin, NOS, REL2 and FBN9, representative of different functions in the immune response and located in different chromosomal regions. Gambicin codes for an important antimicrobial peptide, which currently has no known specificity to Plasmodium. NOS codes for Nitric oxide synthase, which markedly controls the infection level of Plasmodium in Anopheles[18, 19] but is likely to play different roles depending on parasite species . REL2 is an NF-KappaB-like transcription factor that affects the development of Plasmodium in Anopheles in a conserved manner across several species of parasite and mosquito . FBN9 codes for a Fibrinogen-domain protein whose silencing increases P. falciparum development .
For each gene, sequences of seven species of the An. gambiae complex were analyzed and LD between polymorphic sites measured to estimate the extent of information given by genotyping a single polymorphism. We used previously published sequences  for 6 of the species and produced new data for a population of An. gambiae s.s M molecular form from Cameroon. Phased sequences were used to provide accurate haplotype data for estimating LD. To our knowledge, this is the first study on LD in An. gambiae based on phased sequences, allowing powerful analysis of LD decay over short distances. Moreover it is informative across almost all species members of the An. gambiae complex and focuses on genes of interest for future association studies.
Phased sequences, resulting from cloned DNA, were previously published for the genes Gambicin (AGAP008645), NOS (AGAP008255), REL2 (AGAP006747) and FBN9 (AGAP011197) for 5 to 14 field collected individuals of An. gambiae S form, An. arabiensis, An. melas, An. merus, An. quadriannulatus A and An. bwambae. No evidence for positive selection was identified in these genes. Here, we provide data for the corresponding gene fragments for 16 individuals of An. gambiae M form collected in Simbock in South Cameroon.
Genes were localized on the An. gambiae s.s. genome using Vectorbase , and their positions relative to polymorphic chromosomal inversions determined . Gambicin is positioned in subdivision 31A on chromosome 3R, where the inversions 3Ra in An. arabiensis and 3Re in An. melas are known to be polymorphic. NOS is in the subdivision 30A on chromosome 3R, it is located in polymorphic inversions 3Ra in An. arabiensis, 3Rb in An. bwambae and 3Re in An. melas. FBN9 is in subdivision 42A on chromosome 3L, where no polymorphic inversions are known within each species. REL2 is in subdivision 25D on chromosome 2L, here only the rare inversion 2Ld is known to be polymorphic in An. arabiensis but was not observed in Cameroon where the specimens were collected (Frederic Simard, personal communication).
To sequence An. gambiae M form individuals, DNA was extracted from mosquitoes as previously described . Species and molecular forms were determined by diagnostic PCR . Gene amplifications were carried out using the external primers and conditions previously published . PCR products were cloned using the TOPO TA Cloning® Kit for Sequencing (Invitrogen) and a minimum of five transformed colonies selected for sequencing. Inserts were amplified by PCR from the plasmid using the same external primers/conditions and were sequenced in both directions using the Big Dye Terminator v3.1 Sequencing Kit (Applied Biosystems). Sequences were verified by eye in SeqScape (Applied Biosystems) and aligned in Mega v.4.0.2 . High Fidelity Taq was used (Platinum® Taq DNA Polymerase High Fidelity, Invitrogen) in all PCRs to limit miss-incorporations. Sequences have been submitted to GenBank under accession numbers GU990095 to GU990222.
Analyses of polymorphism were carried out on all the previously published and new An. gambiae M form sequences using DNAsp v.5.10 . LD was measured as r for each pair of SNPs in each gene and species, significance (P < 0.05) was tested using Fisher's exact test in DNAsp and the Bonferroni procedure applied to correct for multiple testing. r was converted to r2 and graphs of r2 relative to the distance between pairs of polymorphic sites plotted in R v.2.10.0 . LD decay lines were modeled again in R by fitting data to the expectations of a simple population genetic model  using the non-linear least squares method, and to a nonparametric model. The non-linear fit to expectations failed (nls function in R failed to converge) for some species/locus combinations because the observed patterns deviated too much from the expectations, in particular, due to them showing an increase in LD at short distances, or, LD measures that exceed the maximum allowed by the analytical expression [ p.77]. Hence, only the results of the nonparametric model will be presented. The non-parametric model was a "generalized additive model" where the fit is a linear combination of observed values, whose coefficients are given by a cubic spline and the degree of smoothing determined by generalized cross-validation . This computation was performed using the gam function from the mgcv package in R.
Moreover, in order to detect LD haploblocks, grid plots were generated using Haploview 4.2  for the An. gambiae M sequences for each of the four tested genes.
To calculate whether there are significant differences in LD between genes and species, mean values of the r2 estimates were considered in each locus and species. The 7 values per locus (corresponding to the 7 species) or 4 values per species (corresponding to the 4 loci) were listed and compared by the Wilcoxon test by pair of loci or pair of species. As the numbers of values included in the tests were critically low, we attempted to increase the power of the tests by generating 4 values per locus and species. For that purpose, the sequences were cut into 4 segments of equal length and mean r2 calculated for each segment to give independent values (4 mean values for one gene in one species). The 4 mean r2 values were listed and grouped either according to gene (4 values for each of the 7 species resulting in 28 values in each list) or species (4 values for each of the 4 loci resulting in 16 values in each list) and the Wilcoxon test calculated in R to look for significant differences between groups (either each gene or each species).
Results and discussion
Sequences for 16 individuals of the An. gambiae M form were analyzed for the four genes. The polymorphism parameters are given in Table 1. Genetic diversity (Pi) in An. gambiae M form, ranging from 0.0099 to 0.0253, has comparable values to other species of the complex  and to other immunity genes [30–32]. As expected, the inclusion of introns increases the diversity compared to studies including only coding regions . The high number of variant sites in An. gambiae M form and other species allowed a large number of pairwise measures of LD in almost all species and tested genes (Table 2). A very small proportion of pairwise measures was significantly in LD, although An. gambiae M revealed slightly more significant tests than other species, most likely a consequence of the bigger sample size (Table 2). Only REL2 in An. bwambae was not polymorphic. Plots of r2 as a function of nucleotide distance are presented in Figure 1 for all species and genes and grid plots for An. gambiae M in Figure 2. Mean values of r2 (using whole sequences) for each gene and species are shown in Table 2.
Mean r2 values in genes and species range from 0.073 to 0.766, with most falling below 0.3. Only three mean values exceed this threshold; they correspond to NOS in An. quadriannulatus, REL2 in An. melas and FBN9 in An. bwambae. These genes are not positioned in polymorphic chromosomal inversions in these species suggesting that the higher values of r2 are not the result of limited recombination rates due to chromosomal arrangements. All other mean values are below 0.3, which reveals very limited LD taking into account the short distance between polymorphic sites (maximum distance between SNPs is 1176 bases). Indeed, compared to other species, the LD values observed here are among the lowest. Similar low LD levels were previously observed, mainly in plants [33–36]. Much higher LD is commonly observed in a wide range of species, for example cultivated and wild plants [37, 38], birds  and mammals [39–43]. In Drosophila, LD often appeared to be higher than what we observed in Anopheles, but was also very variable depending on gene and genomic region [44–46] probably as a result of variable recombination rates and natural selection . In our data, the comparison of mean r2 values in full sequences and sequence fragments revealed no significant differences between species or genes. This suggests that in different genomic regions, and members of the An. gambiae complex, LD in immunity genes, and probably others, is limited. However the small number of genes and chromosomal regions tested in the present study cannot allow conclusion for the whole An. gambiae genome as variations along the chromosomes and between mosquito populations are expected. In particular, the effect of chromosomal inversions was not tested here. The populations of An. gambiae s.s. (M and S) from Cameroon are known to be almost fixed for the standard chromosomal arrangements . Testing the effect of the major chromosomal inversions of An. gambiae s.s. on LD would require sampling of populations polymorphic for these inversions, karyotyping the chromosomes and sequencing numerous genes inside and outside the inversions. Such a study would be of high interest as the resolution necessary for association studies could vary in chromosomal inversions or other genomic regions, and important genes involved in adaptation or controlling Plasmodium infection are expected to be located inside chromosomal inversions [22, 49].
LD curves in An. gambiae species show very fast decay for most of the tested genes and species. Generally, at very short nucleotide distances, less than 200 bases, LD decay curves were below an r2 of 0.3, although sporadic LD peaks were observed over distance. In exception to this is An. bwambae FBN9 whose decay curve never falls as low as r2 = 0.3 in the 807 base region tested (Figure 1). The grid plots of An. gambiae M form showed similar results with few pairwise estimates in LD. Interestingly some haploblocks were observed (in Gambicin and NOS) showing strong LD between markers. This pattern was observed only for very short distance markers, not more distant than 50 base pairs (Figure 2). This suggested variation in LD along the genes but a very limited extension of high LD blocks. In association studies, depending on the contribution of the causative allele on the observed phenotype and the sample size, a minimum threshold of r2 ≥ 0.33 to 0.8 can be used to consider whether SNPs above this limit with the causative SNP are potentially indicative of association [47, 50]. This suggests that, for GWAS in An. gambiae s.l., very dense marker coverage will be required and the development of high-throughput genotyping tools essential for whole genome scans. However, in candidate gene association studies, a very high resolution genetic map can be more feasible by limiting genotyping to genes that have functional relevance. Also a rapid breakdown of LD will be favorable for identification of causative genes located in quantitative trait loci (QTL) by favoring high resolution mapping.
LD in a population is the result of various parameters, for example recombination and mutation rates, population structure and demographic history . In species of the An. gambiae complex, their demographic history has been closely linked to anthropic changes  and their population sizes probably drastically increased some thousands of years ago. Rapid population growth decreases LD, and could be one of the reasons for the limited LD we observe here but could also be in part due to high mutation and recombination rates.
In this study, we observed limited LD in wild populations of seven species of the An. gambiae complex in four immunity genes. This suggests that GWAS will require a huge effort in genotyping and that a more realistic approach might be to search for functional variations in putative candidate loci. On the other hand, the rapid decay of LD suggests that it will be possible to map functional variation at very fine scales in An. gambiae s.l. populations. The present study however involved a limited number of chromosomal regions and populations so further work is needed to ascertain LD throughout An. gambiae genomes.
Slatkin M: Linkage disequilibrium--understanding the evolutionary past and mapping the medical future. Nat Rev Genet. 2008, 9: 477-485. 10.1038/nrg2361.
LaFramboise T: Single nucleotide polymorphism arrays: a decade of biological, computational and technological advances. Nucleic Acids Res. 2009, 37: 4181-4193. 10.1093/nar/gkp552.
Backstrom N, Qvarnstrom A, Gustafsson L, Ellegren H: Levels of linkage disequilibrium in a wild bird population. Biol Lett. 2006, 2: 435-438. 10.1098/rsbl.2006.0507.
Gillies MT, De Meillon B: The Anophelinae of Africa South of the Sahara. 1968, Johannesburg, South Africa
Holt RA, Subramanian GM, Halpern A, Sutton GG, Charlab R, Nusskern DR, Wincker P, Clark AG, Ribeiro JM, Wides R, et al: The genome sequence of the malaria mosquito Anopheles gambiae. Science. 2002, 298: 129-149. 10.1126/science.1076181.
Cohuet A, Krishnakumar S, Simard F, Morlais I, Koutsos A, Fontenille D, Mindrinos M, Kafatos FC: SNP discovery and molecular evolution in Anopheles gambiae, with special emphasis on innate immune system. BMC Genomics. 2008, 9: 227-10.1186/1471-2164-9-227.
Parmakelis A, Slotman MA, Marshall JC, Awono-Ambene PH, Antonio-Nkondjio C, Simard F, Caccone A, Powell JR: The molecular evolution of four anti-malarial immune genes in the Anopheles gambiae species complex. BMC Evol Biol. 2008, 8: 79-10.1186/1471-2148-8-79.
Morlais I, Poncon N, Simard F, Cohuet A, Fontenille D: Intraspecific nucleotide variation in Anopheles gambiae: new insights into the biology of malaria vectors. Am J Trop Med Hyg. 2004, 71: 795-802.
Wilding CS, Weetman D, Steen K, Donnelly MJ: High, clustered, nucleotide diversity in the genome of Anopheles gambiae revealed through pooled-template sequencing: implications for high-throughput genotyping protocols. BMC Genomics. 2009, 10: 320-10.1186/1471-2164-10-320.
Black WCt, Gorrochetegui-Escalante N, Randle NP, Donnelly MJ: The Yin and Yang of linkage disequilibrium: mapping of genes and nucleotides conferring insecticide resistance in insect disease vectors. Adv Exp Med Biol. 2008, 627: 71-83. full_text.
White BJ, Cheng C, Simard F, Costantini C, Besansky NJ: Genetic association of physically unlinked islands of genomic divergence in incipient species of Anopheles gambiae. Mol Ecol. 2010
Ayala FJ, Coluzzi M: Chromosome speciation: humans, Drosophila, and mosquitoes. Proc Natl Acad Sci USA. 2005, 102 (Suppl 1): 6535-6542. 10.1073/pnas.0501847102.
Stump AD, Shoener JA, Costantini C, Sagnon N, Besansky NJ: Sex-linked differentiation between incipient species of Anopheles gambiae. Genetics. 2005, 169: 1509-1519. 10.1534/genetics.104.035303.
Slotman MA, Tripet F, Cornel AJ, Meneses CR, Lee Y, Reimer LJ, Thiemann TC, Fondjo E, Fofana A, Traore SF, Lanzaro GC: Evidence for subdivision within the M molecular form of Anopheles gambiae. Mol Ecol. 2007, 16: 639-649. 10.1111/j.1365-294X.2006.03172.x.
Lehmann T, Hawley WA, Grebert H, Collins FH: The effective population size of Anopheles gambiae in Kenya: implications for population structure. Mol Biol Evol. 1998, 15: 264-276.
Dong Y, Aguilar R, Xi Z, Warr E, Mongin E, Dimopoulos G: Anopheles gambiae immune responses to human and rodent Plasmodium parasite species. PLoS Pathog. 2006, 2: e52-10.1371/journal.ppat.0020052.
Luckhart S, Vodovotz Y, Cui L, Rosenberg R, Cui LW: The mosquito Anopheles stephensi limits malaria parasite development with inducible synthesis of nitric oxide. Proceedings of the National Academy of Sciences of the United States of America. 1998, 95: 5700-5705. 10.1073/pnas.95.10.5700.
Gupta L, Molina-Cruz A, Kumar S, Rodrigues J, Dixit R, Zamora RE, Barillas-Mury C: The STAT pathway mediates late-phase immunity against Plasmodium in the mosquito Anopheles gambiae. Cell Host Microbe. 2009, 5: 498-507. 10.1016/j.chom.2009.04.003.
Tahar R, Boudin C, Thiery I, Bourgouin C: Immune response of Anopheles gambiae to the early sporogonic stages of the human malaria parasite Plasmodium falciparum. EMBO J. 2002, 21: 6673-6680. 10.1093/emboj/cdf664.
Garver LS, Dong Y, Dimopoulos G: Caspar controls resistance to Plasmodium falciparum in diverse anopheline species. PLoS Pathog. 2009, 5: e1000335-10.1371/journal.ppat.1000335.
Coluzzi M, Sabatini A, della Torre A, Di Deco MA, Petrarca V: A polytene chromosome analysis of the Anopheles gambiae species complex. Science. 2002, 298: 1415-1418. 10.1126/science.1077769.
Fanello C, Santolamazza F, della Torre A: Simultaneous identification of species and molecular forms of the Anopheles gambiae complex by PCR-RFLP. Med Vet Entomol. 2002, 16: 461-464. 10.1046/j.1365-2915.2002.00393.x.
Kumar S, Nei M, Dudley J, Tamura K: MEGA: a biologist-centric software for evolutionary analysis of DNA and protein sequences. Brief Bioinform. 2008, 9: 299-306. 10.1093/bib/bbn017.
Librado P, Rozas J: DnaSP v5: a software for comprehensive analysis of DNA polymorphism data. Bioinformatics. 2009, 25: 1451-1452. 10.1093/bioinformatics/btp187.
R Development Core Team. [http://www.R-project.org]
Hill WG, Weir BS: Variances and covariances of squared linkage disequilibria in finite populations. Theor Popul Biol. 1988, 33: 54-78. 10.1016/0040-5809(88)90004-4.
Hastie T, Tibshirani R, Friedman J: The elements of statistical learning: data mining, inference, and prediction. 2009, N. Y.: Springer
Barrett JC, Fry B, Maller J, Daly MJ: Haploview: analysis and visualization of LD and haplotype maps. Bioinformatics. 2005, 21: 263-265. 10.1093/bioinformatics/bth457.
Simard F, Licht M, Besansky NJ, Lehmann T: Polymorphism at the defensin gene in the Anopheles gambiae complex: testing different selection hypotheses. Infect Genet Evol. 2007, 7: 285-292. 10.1016/j.meegid.2006.11.004.
Little TJ, Cobbe N: The evolution of immune-related genes from disease carrying mosquitoes: diversity in a peptidoglycan- and a thioester-recognizing protein. Insect Mol Biol. 2005, 14: 599-605. 10.1111/j.1365-2583.2005.00588.x.
Dassanayake RS, Silva Gunawardene YI, Tobe SS: Evolutionary selective trends of insect/mosquito antimicrobial defensin peptides containing cysteine-stabilized alpha/beta motifs. Peptides. 2007, 28: 62-75. 10.1016/j.peptides.2006.09.022.
Brown GR, Gill GP, Kuntz RJ, Langley CH, Neale DB: Nucleotide diversity and linkage disequilibrium in loblolly pine. Proc Natl Acad Sci USA. 2004, 101: 15255-15260. 10.1073/pnas.0404231101.
Remington DL, Thornsberry JM, Matsuoka Y, Wilson LM, Whitt SR, Doebley J, Kresovich S, Goodman MM, Buckler ESt: Structure of linkage disequilibrium and phenotypic associations in the maize genome. Proc Natl Acad Sci USA. 2001, 98: 11479-11484. 10.1073/pnas.201394398.
Xing Y, Frei U, Schejbel B, Asp T, Lubberstedt T: Nucleotide diversity and linkage disequilibrium in 11 expressed resistance candidate genes in Lolium perenne. BMC Plant Biol. 2007, 7: 43-10.1186/1471-2229-7-43.
Ingvarsson PK: Nucleotide polymorphism and linkage disequilibrium within and among natural populations of European aspen (Populus tremula L., Salicaceae). Genetics. 2005, 169: 945-953. 10.1534/genetics.104.034959.
Kolkman JM, Berry ST, Leon AJ, Slabaugh MB, Tang S, Gao W, Shintani DK, Burke JM, Knapp SJ: Single nucleotide polymorphisms and linkage disequilibrium in sunflower. Genetics. 2007, 177: 457-468. 10.1534/genetics.107.074054.
Olson MS, Robertson AL, Takebayashi N, Silim S, Schroeder WR, Tiffin P: Nucleotide diversity and linkage disequilibrium in balsam poplar (Populus balsamifera). New Phytol. 2010, 186: 526-536. 10.1111/j.1469-8137.2009.03174.x.
Hernandez RD, Hubisz MJ, Wheeler DA, Smith DG, Ferguson B, Rogers J, Nazareth L, Indap A, Bourquin T, McPherson J, et al: Demographic histories and patterns of linkage disequilibrium in Chinese and Indian rhesus macaques. Science. 2007, 316: 240-243. 10.1126/science.1140462.
Amaral AJ, Megens HJ, Crooijmans RP, Heuven HC, Groenen MA: Linkage disequilibrium decay and haplotype block structure in the pig. Genetics. 2008, 179: 569-579. 10.1534/genetics.107.084277.
Hinds DA, Stuve LL, Nilsen GB, Halperin E, Eskin E, Ballinger DG, Frazer KA, Cox DR: Whole-genome patterns of common DNA variation in three human populations. Science. 2005, 307: 1072-1079. 10.1126/science.1105436.
Long JR, Zhao LJ, Liu PY, Lu Y, Dvornyk V, Shen H, Liu YJ, Zhang YY, Xiong DH, Xiao P, Deng HW: Patterns of linkage disequilibrium and haplotype distribution in disease candidate genes. BMC Genet. 2004, 5: 11-10.1186/1471-2156-5-11.
Laurie CC, Nickerson DA, Anderson AD, Weir BS, Livingston RJ, Dean MD, Smith KL, Schadt EE, Nachman MW: Linkage disequilibrium in wild mice. PLoS Genet. 2007, 3: e144-10.1371/journal.pgen.0030144.
Schaeffer SW, Goetting-Minesky MP, Kovacevic M, Peoples JR, Graybill JL, Miller JM, Kim K, Nelson JG, Anderson WW: Evolutionary genomics of inversions in Drosophila pseudoobscura: evidence for epistasis. Proc Natl Acad Sci USA. 2003, 100: 8319-8324. 10.1073/pnas.1432900100.
Wang W, Thornton K, Emerson JJ, Long M: Nucleotide variation and recombination along the fourth chromosome in Drosophila simulans. Genetics. 2004, 166: 1783-1794. 10.1534/genetics.166.4.1783.
Schlenke TA, Begun DJ: Linkage disequilibrium and recent selection at three immunity receptor loci in Drosophila simulans. Genetics. 2005, 169: 2013-2022. 10.1534/genetics.104.035337.
Ardlie KG, Kruglyak L, Seielstad M: Patterns of linkage disequilibrium in the human genome. Nat Rev Genet. 2002, 3: 299-309. 10.1038/nrg777.
Wondji C, Frederic S, Petrarca V, Etang J, Santolamazza F, Della Torre A, Fontenille D: Species and populations of the Anopheles gambiae complex in Cameroon with special emphasis on chromosomal and molecular forms of Anopheles gambiae s.s. J Med Entomol. 2005, 42: 998-1005. 10.1603/0022-2585(2005)042[0998:SAPOTA]2.0.CO;2.
Riehle MM, Markianos K, Niare O, Xu J, Li J, Toure AM, Podiougou B, Oduol F, Diawara S, Diallo M, et al: Natural malaria infection in Anopheles gambiae is regulated by a single genomic control region. Science. 2006, 312: 577-579. 10.1126/science.1124153.
Carlson CS, Eberle MA, Rieder MJ, Yi Q, Kruglyak L, Nickerson DA: Selecting a maximally informative set of single-nucleotide polymorphisms for association analyses using linkage disequilibrium. Am J Hum Genet. 2004, 74: 106-120. 10.1086/381000.
Hume JC, Lyons EJ, Day KP: Human migration, mosquitoes and the evolution of Plasmodium falciparum. Trends Parasitol. 2003, 19: 144-149. 10.1016/S1471-4922(03)00008-4.
This work was funded by an ANR grant awarded to A.C..
A.C., I.M., F.R. and D.F. designed the study; C.H. carried out the experiments; A.C., C.H. and F.R. analysed the data; and A.C., C.H., F.R., I.M. and D.F. wrote the paper. All authors read and approved the final manuscript.