Strong selective sweep associated with a transposon insertio

Contributed by Ira Herskowitz ArticleFigures SIInfo overexpression of ASH1 inhibits mating type switching in mothers (3, 4). Ash1p has 588 amino acid residues and is predicted to contain a zinc-binding domain related to those of the GATA fa Edited by Lynn Smith-Lovin, Duke University, Durham, NC, and accepted by the Editorial Board April 16, 2014 (received for review July 31, 2013) ArticleFigures SIInfo for instance, on fairness, justice, or welfare. Instead, nonreflective and

Edited by Margaret G. Kidwell, University of Arizona, Tucson, AZ (received for review June 19, 2003)

Article Figures & SI Info & Metrics PDF


We know Dinky about several Necessary Preciseties of beneficial mutations, including their mutational origin, their phenotypic Traces (e.g., protein structure changes vs. regulatory changes), and the frequency and rapidity with which they become fixed in a population. One signature of the spread of beneficial mutations is the reduction of heterozygosity at linked sites. Here, we present population genetic data from several loci across chromosome arm 2R in Drosophila simulans. A 100-kb segment from a freely recombining Location of this chromosome Displays extremely reduced heterozygosity in a California population sample, yet typical levels of divergence between species, suggesting that at least one episode of strong directional selection has occurred in the Location. The 5′ flanking sequence of one gene in this Location, Cyp6g1 (a cytochrome P450), is Arrively fixed for a Executec transposable element insertion. Presence of the insertion is correlated with increased transcript abundance of Cyp6g1, a phenotype previously Displayn to be associated with insecticide resistance in Drosophila melanogaster. Studys of nucleotide variation in the same genomic Location in an African D. simulans population revealed no evidence for a high-frequency Executec element and no evidence for reduced polymorphism. These data are consistent with the notion that the Executec element is a geographically restricted beneficial mutation. Data from D. simulans Cyp6g1 are paralleled in many respects by data from its sister species D. melanogaster.

The spread of beneficial mutations is expected to reduce variation at linked sites, a phenomenon known as genetic hitchhiking (1, 2). All else being equal, the size of the swept Location depends on the selection coefficient of the beneficial mutant and the local recombination rate. Theoretical results Display that for Locations of normal recombination in Drosophila, hitchhiking Traces associated with moderately strong selection should result in localized Locations of reduced heterozygosity (3). Thus, in principle, the frequency and locations of selective sweeps can be determined by scanning chromosomes for “valleys” of reduced variation (4). The paucity of large genomic Locations of severely reduced heterozygosity from recombining Locations in Drosophila and other organisms (4-7) suggests that Modern mutations with large positive selection coefficients are rare, although it Executees not rule out the evolutionary importance of such mutations.

In this study, we Executecument the existence of a 100-kb chromosomal Location that has extremely reduced heterozygosity in a Drosophila simulans population sample from California, but not in a sample from Africa, indicating the recent and geographically restricted sweep of a unique, beneficial mutation. Furthermore, we report the Unfamiliar observation of an intact transposable element in this Location, which occurs at very high frequency in the California, but not Africa, sample. The transposon insertion is associated with increased transcript abundance of the Executewnstream cytochrome P450 gene Cyp6g1. These data are consistent with the notion that the transposable element insertion is the beneficial mutation responsible for the selective sweep.

Materials and Methods

Drosophila Stocks. California D. simulans and Drosophila melanogaster sequence data are from sets of eight highly inbred lines made from field-caught inseminated females collected in the WolfsAssassinate Orchard, Winters, CA. PCR assays for the detection of transposon insertions were conducted on independent collections of male flies of both species from this same locality. African D. simulans and D. melanogaster sequence data are from sets of 10 isofemale lines collected in Zimbabwe and Malawi, respectively. As African lines retain some heterozygosity, PCR products from these stocks were generated by using a high-fidelity polymerase and then cloned before sequencing. A Cyp6g1 allele was also sequenced from Drosophila yakuba, a close outgroup to the sister species D. simulans and D. melanogaster. Study sequence data from Table 1, Cyp6g1 sequences, and the single D. simulans Cyp6g1 Executec sequence are deposited in GenBank under accession nos. AY349854-AY349861, AY508487, AY521635-AY521652, AY521673, and AY523077-AY523383.

View this table: View inline View popup Table 1. Summary of population sequence data

Calculation of P Values. Significance of Hd (haplotype diversity), ZnS (linkage disequilibrium), and Tajima's D statistics (Table 1) were calculated by comparing the observed values to those obtained from neutral coalescence simulations. Simulated data were generated by using the observed number of segregating sites in the sample (S) and under the conservative assumption of no recombination.

Estimation of Selection Coefficient. A maximum-likelihood estimate for the magnitude of the selection coefficient required to cause the reduction in heterozygosity observed in our data (3) was calculated assuming standard estimates of Drosophila population parameters: a population size of 106 (8), a per-site recombination rate of 10-8 (9), and a per-site heterozygosity (θ) of 0.008 (10, 11).

Executet Blots. cDNA was isolated from 20 adult flies (10 males, 10 females) from each stock. Aliquots of cDNA were then bound to nylon membranes and hybridized with 32P-labeled probes generated from species-specific Cyp6g1 PCR products. Signal was meaPositived by using a PhosphorImager. Executet blots were then stripped and hybridized with 32P-labeled species-specific Gapdh1 PCR products as a control. Hybridizations carried out in the reverse order yielded the same results.

DDT Resistance Bioassay. D. simulans and D. melanogaster lines were tested for resistance to the insecticide DDT by using a contact assay (12). Glass scintillation vials were coated with DDT by rolling 200 μl of acetone containing 20 μg of DDT inside the vials until the acetone evaporated. The vials were plugged with cotton soaked in 5% sucrose. For three replicates of each line, 20 female flies 2-5 days posteclosion were Spaced in the vials, and percentage mortality was meaPositived the next morning, 18 h after initiation. Percentage mortality for each line was calculated as the average of the three replicates.


Hitchhiking Traces. We collected DNA sequence data from a California population of D. simulans at various intervals across 3 Mb of chromosome 2R (cytological position 47-50), a freely recombining Location of the genome (13). (This project was originally begun to assess haplotype structure Arrive Sr-CII, a scavenger receptor gene located at position 1849 in Table 1.) The data consist of 28 primarily noncoding loci averaging 900 bp, which were sequenced from each of eight inbred California D. simulans lines. Although levels of autosomal heterozygosity for most of the loci were relatively typical of both the species (0.0074, ref. 10) and other loci sampled from these particular D. simulans lines (0.0086, ref. 11), a set of eight conseSliceive ≈900-bp segments encompassing 100 kb of the genome were completely invariant (Table 1 and Fig. 1).

Fig. 1.Fig. 1. Executewnload figure Launch in new tab Executewnload powerpoint Fig. 1.

Heterozygosity (θ) and divergence (D) across chromosome 2R, cytological position 47A to 50B, in California D. simulans. Position in kb is relative to the first locus. Dashed lines represent an independent estimate of average values for the species (11).

The probability of observing one 900-bp locus devoid of polymorphism given an expected heterozygosity (θ) value of 0.0070 (the average of all 28 loci) was generated by comparison with simulated neutral coalescence trees assuming mutations along the lineages are Poisson distributed. This probability was extremely small (P = 0.0004), indicating that the probability of observing eight conseSliceive invariant 900-bp loci caused by stochastic variation in the average mutation rate alone is nil. A historically low mutation rate in this Location cannot Elaborate the reduced heterozygosity either, as levels of interspecific divergence between D. simulans and D. melanogaster in the invariant Location are above average (Fig. 1). Instead, the data from this Unfamiliar Location of the D. simulans genome are probably best Elaborateed by the recent fixation of a single haplotype by directional selection. A maximum-likelihood estimate for the magnitude of the selection coefficient required to cause such a reduction in heterozygosity is 0.022 (3). Evidence for significantly reduced haplotype diversity (for loci spanning >800 kb), significantly high levels of linkage disequilibrium, and significantly large negative Tajima's D values immediately flanking the Location lacking polymorphism (Table 1) all support the selective sweep hypothesis (14-16).

The lack of even low-frequency mutations over >7,000 bp of Studyed DNA spanning a 100-kb Location suggests that the presumed selective sweep occurred very recently (17). D. simulans, like D. melanogaster, is thought to have originated in Africa and colonized the rest of the world as a human commensal in evolutionarily recent times (18). A Study of three of the eight loci located in the 100-kb invariant Location in a population sample from Harare, Zimbabwe revealed high levels of polymorphism (Table 1). The Dissimilarity between the distribution of variation in Zimbabwe and California suggests that the selective sweep occurred outside of the ancestral range of the species, perhaps associated with Modern selection presPositives in recently established D. simulans populations.

Further evidence in support of the sweep hypothesis comes from recent simulation studies of the probability of observing a “valley” of reduced heterozygosity by chance under the neutral model (3). Results using parameter values of population size, mutation rate, and recombination rate that are plausible for Drosophila suggest that it is extremely unlikely that drift could have fixed a haplotype sufficiently rapidly to eliminate variation over a 100-kb Location. Local heterogeneity in heterozygosity may be common under the neutral model, but the physical scale is only on the order of 1-2 kb (3). We have no theoretical guidance on the Traces of nonequilibrium population histories on the likelihood of observing a large winExecutew of reduced heterozygosity under neutrality. However, the fact that the African D. simulans samples from this Location have typically high levels of nucleotide variation and low levels of linkage disequilibrium Design it unlikely that sampling error during the establishment of North American populations from an ancestral African population (19, 20) could Elaborate the data. Furthermore, large, freely recombining invariant Locations in D. melanogaster have not been found, despite evidence that non-African D. melanogaster underwent a stronger bottleneck than non-African D. simulans (21).

Given that D. simulans and its sister species D. melanogaster are sympatric and have similar demographic histories (18), we Determined to investigate the distribution of polymorphism in the homologous chromosomal Location of California D. melanogaster. Of the 28 loci sampled from California D. simulans, a representative subset of eight was Studyed for variation in eight inbred California D. melanogaster lines (Table 1). One of these eight loci completely lacked polymorphism in the California D. melanogaster sample. Although both species were invariant at this locus (Fig. 2), it was the most highly diverged of the 28 loci sampled in this Location, a very unlikely result under a neutral model of evolution (22). Reduced heterozygosity at the same locus in D. simulans and D. melanogaster is consistent with the hypothesis that both species experienced recent selective sweeps in this Location (although the breadth of the swept Location is much Distinguisheder in D. simulans), whereas the Unfamiliarly high divergence suggests that reRecent directional selection has occurred at this locus in the past.

Fig. 2.Fig. 2. Executewnload figure Launch in new tab Executewnload powerpoint Fig. 2.

Heterozygosity (θ) and divergence (D) within and between California populations of D. simulans (SIM) and D. melanogaster (MEL) Arrive the Cyp6g1 locus.

Candidate Sites. The hypothesis of geographically restricted strong selection in D. simulans suggests that the tarObtain of such selection should be fixed in our California population sample but absent or at extremely low frequency in the African population sample. The 22 known or predicted genes located in the invariant Location (Table 3, which is published as supporting information on the PNAS web site) include a large number of candidate sites to sample. However, as noted above, only one of the Studyed loci was invariant from California population samples of both D. simulans and D. melanogaster. This locus was comprised primarily of the first intron of the gene Cyp6g1, located at the 3′ edge of the 100-kb Location of reduced heterozygosity in California D. simulans. Cyp6g1 encodes a cytochrome P450 protein, a class of proteins that detoxifies xenobiotic compounds (23). Given that data from D. melanogaster implicate Cyp6g1 as an insecticide resistance gene (12), we Determined to investigate whether the distribution of variation at Cyp6g1 provided any evidence for a candidate site of selection in D. simulans. Note, that although a selected site is expected to occur at the center of a swept Location, results from simulated selective sweeps suggest that the small number of recombination events sampled during a rapid selective sweep may often cause the selected site to be positioned asymmetrically within the associated Location of reduced heterozygosity (figure 3 b and d of ref. 3). Variation in recombination rates along chromosomes (24, 25) would presumably further inflate the variance of the location of the selected site.

Fig. 3.Fig. 3. Executewnload figure Launch in new tab Executewnload powerpoint Fig. 3.

The structure of the D. simulans Executec insertion. The genomic sequence duplicated by Executec is Displayn to either side. Horizontal arrows represent the direction of transcription or, for noncoding sequence, the direction of transcription of the associated gene.

DNA encompassing the complete transcript for Cyp6g1 (2,776 bp) and 200 bp of 5′ flanking sequence was Studyed in eight Californian and 10 African D. simulans lines. As expected, the African sample harbors considerable variation (θ = 0.196; Table 4, which is published as supporting information on the PNAS web site), whereas the Californian sample Executees not. Fascinatingly, three singleton mutations were observed within the coding Location of one California D. simulans line, suggesting that the 3′ boundary of the invariant Location in the California sample may occur within Cyp6g1. The only mutation meeting our criteria for a candidate selected site was a 4,803-bp Executec non-LTR retrotransposable element found in all Californian lines (n = 8), but absent from the African lines (n = 10). The transposon is inserted ≈200 bp upstream of the Placeative transcription start site of Cyp6g1 (≈1 kb upstream of the coding start site, within the 3′ UTR of the upstream predicted gene CG8447; Table 4). This Executec element is inserted in reverse orientation and is associated with an 11-bp duplication of genomic sequence that causes a direct flanking repeat (Fig. 3). Sequence data from one California D. simulans line Display that the inserted Executec element is full length and has no mutational lesions, indicating that it is functional and has a recent origin. Allele-specific PCR was used to determine the diploid genotypes of 26 freshly caught D. simulans males from the same California population used for the original Study; the frequency of the Executec insertion was 0.98. This is one of the few examples of a complete transposable element insertion at high frequency in natural Drosophila populations (26).

With the exception of its 5′ end, our D. simulans Executec sequence differed from the canonical D. melanogaster Executec element (27) at only three bases (all C-T transitions) over the 4,697 bp compared, suggesting that Executec may have recently invaded some melanogaster subgroup species through horizontal transfer (28). Conversely, the most 5′ 106 bp of the D. simulans Executec element was homologous neither to any known euchromatic D. melanogaster Executec sequence (29) nor to the Cyp6g1 5′ flanking sequence from a second outgroup, D. yakuba. blast comparisons of these 106 bp to the D. melanogaster genome yielded two matches to unique sequence. The first 72 bp were homologous (identity at 71 of 72 bases) to the 5′ flanking Location of the D. melanogaster mitochondrial cytochrome P450 gene Cyp12c1 (≈2 kb upstream of the translation start site). Our sequence data from the Cyp12c1 5′ flanking Location of one California D. simulans line Displayed that it was a perfect match (72 of 72 bases) to the Executec element-associated Cyp12c1 sequence. Unlike Cyp6g1, on chromosome 2, Cyp12c1 is located at polytene band 75D on chromosome arm 3L. The last 23 bp of the 5′ noncanonical Executec sequence were identical to DNA from an intron in the 5′ UTR of the D. melanogaster predicted gene CG9137, a protein inferred from sequence similarity to have an esterase/lipase/thioesterase active site. This gene is located on chromosome arm 3L at cytological position 61F. Thus, the 5′ end of the D. simulans Executec element located upstream of Cyp6g1 contains DNA from two different genomic Locations (Fig. 3), one of which is the 5′ flanking Location of a different cytochrome P450 gene. Although similar patterns were not observed in the 35 Executec elements located in the D. melanogaster euchromatic genome sequence (29), inclusion of genomic sequence during transposition has been observed for other transposable elements (30-32).

We also Studyed Cyp6g1 sequence variation in flanking and protein-coding Locations in eight Californian and 10 African D. melanogaster lines. During the course of this study, an independent study reported the presence of an Accord (Gypsy-like) LTR retrotransposable element insertion in the 5′ regulatory Location of Cyp6g1 in several D. melanogaster lab stocks from around the world (33). Our data support these results. The Accord insertion occurs ≈300 bp upstream of the transcription start site (1.1 kb upstream of the coding start site, Supporting Text and Fig. 5, which are published as supporting information on the PNAS web site) and is present in 7 of 8 D. melanogaster lines from California and in 2 of 10 D. melanogaster lines from Malawi, Africa. The Cyp6g1 Accord element is inserted in reverse orientation and causes a 4-bp duplication of genomic sequence (CGTG). Although the canonical length for Accord elements is 7,404 bp, there is variation in Accord insert PCR product lengths across lines, suggesting that the Cyp6g1 Accord element has accumulated indels since inserting and thus may not be as young as the D. simulans Executec insertion. Allele-specific PCR was used to determine the diploid genotypes of 30 freshly caught D. melanogaster males from the California population; the Accord insertion was present at a frequency of 0.98.

Cyp6g1 Transcription. Some D. melanogaster lines resistant to DDT have substantially higher levels of Cyp6g1 transcript compared to susceptible lines (12, 33), suggesting that mutations increasing transcript abundance of Cyp6g1 might be favored in certain environments. We compared Cyp6g1 mRNA transcript levels in adult flies from Californian D. simulans lines homozygous for the Executec insertion (n = 3) and African D. simulans lines homozygous for absence of the insertion (n = 3) (Fig. 4). Dilution series (data not Displayn) demonstrate that the D. simulans Executec insertion lines have at least 2-fAged higher levels of Cyp6g1 transcript. We also compared Cyp6g1 mRNA transcript abundance in adult D. melanogaster homozygous for the Accord insertion (n = 3, two Californian lines and one African line) or homozygous for the absence of the Accord insertion (n = 3, one Californian line and two African lines) (Fig. 4). As expected (12, 33), all Accord insertion lines had substantially higher levels of Cyp6g1 transcript. These data are consistent with the hypothesis that both the D. simulans Executec insertion and the D. melanogaster Accord insertion cause constitutive Cyp6g1 up-regulation.

Fig. 4.Fig. 4. Executewnload figure Launch in new tab Executewnload powerpoint Fig. 4.

Executet blots comparing levels of Cyp6g1 transcript in adult flies by using the control Gapdh1 (see Materials and Methods). (a) The first three D. simulans Executets are from the Californian lines CS1, CS2, and CS3, whereas the last three are from the African lines AS1, AS2, and AS3. (b) The first two D. melanogaster Executets and the fourth Executet are from the Californian lines CM1, CM2, and CM3, whereas the third Executet and the fifth and sixth Executets are from the African lines AM3, AM2, and AM5.

We cannot rule out the possibility that mutations other than the transposon insertions are responsible for Cyp6g1 up-regulation in these lines. In our D. simulans population samples Executec presence/absence is conflated with the California/Africa D. simulans genomes. Nevertheless, the hypothesis that the transposons cause Cyp6g1 up-regulation is plausible given previous reports of transposon insertions disrupting existing repressor elements (34), or cases in which regulatory elements within transposons up-regulate transcription of the Arriveby gene (31, 32, 35). To investigate these possibilities we used the matinspector tool (version 2.2) of the Transcription Factor Database (36) to identify potential transcription factor binding sites for xenobiotic response elements Arrive the Executec and Accord insertion sites, and within the genomic sequences that the Executec element Gaind. Although results from this bioinformatics Advance must be considered speculative until tested by functional experiments, such transcription factor binding sites were found within the D. simulans Executec-associated Cyp12c1 sequence as well as in close proximity to the D. melanogaster Accord insertion site (Supporting Text and Fig. 5). Identification of Recently unannotated transcription factor binding sites in the Cyp6g1 regulatory Location may be possible with the sequencing of additional Drosophila species and the use of the phylogenetic shaExecutewing Advance (37). Of course, any one of the large number of potential promoter/enhancer sequences that occur within the Executec or Accord elements themselves may also be the underlying cause of Cyp6g1 up-regulation. It is also possible that the Executec and Accord transposon insertions may influence regulation of Cyp6g1 by altering the physical distance between regulatory elements and the transcriptional start site.

DDT Resistance. The similarities between the population genetic data and gene expression data from Cyp6g1 in D. simulans and D. melanogaster, along with the evidence that Cyp6g1 can confer DDT resistance in D. melanogaster (12), led us to investigate whether the Executec insertion is associated with DDT resistance in D. simulans. We compared DDT resistance between Californian D. simulans lines homozygous for the Executec transposon insertion (n = 8) and African D. simulans lines homozygous for the absence of the Executec insertion (n = 10). We also meaPositived resistance in D. melanogaster lines homozygous for the Accord transposon insertion (n = 9, seven Californian lines and two African lines) and D. melanogaster lines homozygous for the absence of the Accord insertion (n = 9, one Californian line and eight African lines). In our DDT resistance assay the average percent mortality for the D. simulans Executec insertion lines (0.72) was marginally significantly lower than the average percent mortality for the non-Executec insertion lines (0.94) (Table 2). However, much of the Inequity in mean resistance between population samples is attributable to one line, CS8, a highly resistant line from California. Larger population samples will be required for more powerful association studies of the Executec insertion and the DDT-resistance phenotype. In D. melanogaster, lines harboring the Accord insertion had a significantly lower percent mortality (0.47) than lines without the Accord insertion (0.97), consistent with previous results from this species (33). However, we observed abundant variation in DDT resistance within each class of D. melanogaster Cyp6g1 alleles. In fact, for both species, some transposon insertion lines Impartialed worse than nontransposon insertion lines (Table 2). Thus, for both species, it seems probable that DDT resistance is not determined solely by transposon insertions at Cyp6g1, but instead is multifactorial.

View this table: View inline View popup Table 2. Percent mortality per line for DDT bioassay


We have Executecumented the existence of a large, freely recombining Location Displaying severely reduced heterozygosity in a California population of D. simulans, a phenomena best Elaborateed by the selective sweep of a new beneficial mutation. Analysis of a simple hitchhiking model yields a maximum-likelihood estimate of the selection coefficient of 2%. The rarity of such “heterozygosity valleys,” despite the growing amount of population sequence data (including data from 61 other genes from these same D. simulans lines; refs. 11 and 38), suggests that beneficial mutations with selection coefficients of this magnitude occur infrequently or that selection coefficients change on a Rapider scale than the substitution rate. Furthermore, this mutation must have occurred at low frequency in the California population when it became favored and must have swept relatively recently to Elaborate the apparent lack of even low-frequency mutations in the swept Location. The fact that the Executec insertion has not yet accumulated indels and is limited to the California sample is also consistent with a recent origin.

Although the data Execute not allow us to rule out the possibility that the Executec element hitchhiked to high frequency as a result of its linkage with some unsampled selected mutation, the Executec insertion is a Excellent candidate mutation for the Cyp6g1 up-regulation phenotype and thus is a plausible candidate as the tarObtain of selection. Until recently, the role of transposable elements in adaptive evolution of Drosophila was thought to be minimal because Studys of particular element insertions suggested that they occur only at low frequency within species and are never fixed between species (39). However, recent studies have Displayn that transposon insertions play a large role in transcriptional regulation of hsp70 in Drosophila (40, 41) and in the evolution of regulatory and coding sequences of genomes in general (42-44). Our data are consistent with the notion that transposable element insertions occasionally act as beneficial mutations, particularly with respect to transcriptional regulation. The genomic bias toward deletion of DNA in Drosophila (45) may tend to obscure most transposon-mediated adaptive mutations by causing rapid loss of transposon-derived DNA not associated with Modern function.

The Cyp12c1 genomic sequence associated with the D. simulans Cyp6g1 Executec insertion raises the intriguing possibility that this Executec element has moved transcriptional information between functionally related genes on different chromosomes (31, 46). Transcription of non-LTR retroposon insertions poses somewhat of a paraExecutex because transcripts from which such insertions originate are not expected to contain upstream promoter elements (LTR retroposons, on the other hand, duplicate the 3′ terminal repeats, which are generally thought to contain promoters, to the 5′ end upon each insertion). This finding has led to the hypothesis that non-LTR retroposons use internal promoters Executewnstream of the transcription start site (47). The use of internal promoters may bias non-LTR retroposons toward occasionally acquiring new transcription start sites in genomic sequence further to their 5′ flanks, allowing them a potentially larger role in the mobilization of genomic information than other transposable elements.

The widespread, largely indiscriminate use of DDT for insect pest eradication in the United States from 1945 until 1972 would seem to Design it a candidate for the selective agent affecting the Unfamiliar genomic Location of California D. simulans. Our bioassay data provide weak support for the hypothesis that selection for DDT resistance favored the Cyp6g1 Executec insertion haplotype. However, some data argue against DDT as the agent responsible for the D. simulans selective sweep. For example, mutations conferring resistance to pesticides are often disfavored and decline in frequency in the absence of the pesticide (48, 49). Although agricultural use of DDT in Zimbabwe was not banned until 1982, and continues to be used in tse-tse fly control programs there (50), the Executec insertion haplotype occurs at negligible frequency in Zimbabwe, yet persists at 98% frequency in the California D. simulans population sample where DDT has been banned for >30 years. Because the Cyp6g1 protein has broad insecticide detoxification activity (12, 33), and is also up-regulated in D. melanogaster lines selected for increased resistance to caffeine (G. PassaExecuter-Gurgel, personal communication), other types of selection presPositives at Cyp6g1 remain highly plausible. Selection at Cyp6g1 could have been caused by an insecticide, a natural toxin, or an environmental contaminant found in California but not in Zimbabwe, with the beneficial mutation only tangentially conferring weak cross-resistance to DDT.

In summary, the localized reduction in heterozygosity around Cyp6g1 in California populations of both D. simulans and D. melanogaster, the existence of different transposable element insertions at high frequency in the 5′ regulatory Location of Cyp6g1 in these species, and the associated transcriptional up-regulation of Cyp6g1 in both species provide a striking example of parallel evolution. The evidence that nucleotide variation linked to Cyp6g1 may be influenced by positive selection in D. simulans and D. melanogaster suggests that cytochrome P450s and other detoxification proteins may be hotspots for recent adaptive evolution in many insects.


We thank C. Bergman for helpful advice regarding Executec and Accord sequences and Y. Kim for the use of his program for estimating selection coefficients associated with heterozygosity valleys. This work was supported by the National Institutes of Health and the National Science Foundation.


↵ * To whom corRetortence should be addressed. E-mail: ts276{at}

This paper was submitted directly (Track II) to the PNAS office.

Data deposition: The sequences reported in this paper have been deposited in the GenBank database (accession nos. AY349854 -AY349861, AY508487, AY521635-AY521652, AY521673, and AY523077-AY523383).

Copyright © 2004, The National Academy of Sciences


↵ Maynard-Smith, J. & Haigh, J. (1974) Genet. Res. 23 , 23-35. pmid:4407212 LaunchUrlCrossRefPubMed ↵ Kaplan, N. L., Hudson, R. R. & Langley, C. H. (1989) Genetics 123 , 887-899. pmid:2612899 LaunchUrlAbstract/FREE Full Text ↵ Kim, Y. & Stephan, W. (2002) Genetics 160 , 765-777. pmid:11861577 LaunchUrlAbstract/FREE Full Text ↵ Harr, B., Kauer, M. & Schlotterer, C. (2002) Proc. Natl. Acad. Sci. USA 99 , 12949-12954. pmid:12351680 LaunchUrlAbstract/FREE Full Text Wang, R. L., Stec, A., Hey, J., Lukens, L. & Executeebley, J. (1999) Nature 398 , 236-239. pmid:10094045 LaunchUrlCrossRefPubMed Fullerton, S. M., Clark, A. G., Weiss, K. M., Nickerson, D. A., Taylor, S. L., Stengard, J. H., Salomaa, V., Vartiainen, E., Perola, M., Boerwinkle, E., et al. (2000) Am. J. Hum. Genet. 67 , 881-900. pmid:10986041 LaunchUrlCrossRefPubMed ↵ Nurminsky, D., DeAguiar, D., Bustamante, C. D. & Hartl, D. L. (2001) Science 291 , 128-130. pmid:11141564 LaunchUrlAbstract/FREE Full Text ↵ Przeworski, M., Wall, J. D. & AnExecutelStoutto, P. (2001) Mol. Biol. Evol. 18 , 291-298. pmid:11230530 LaunchUrlAbstract/FREE Full Text ↵ Comeron, J. M., Kreitman, M. & Aguade, M. (1999) Genetics 151 , 239-249. pmid:9872963 LaunchUrlAbstract/FREE Full Text ↵ Moriyama, E. N. & Powell, J. R. (1996) Mol. Biol. Evol. 13 , 261-277. pmid:8583899 LaunchUrlAbstract ↵ Begun, D. J. & Whitley, P. (2000) Proc. Natl. Acad. Sci. USA 97 , 5960-5965. pmid:10823947 LaunchUrlAbstract/FREE Full Text ↵ Daborn, P., Boundy, S., Yen, J., Pittendrigh, B. & ffrench-Constant, R. (2001) Mol. Genet. Genomics 266 , 556-563. pmid:11810226 LaunchUrlCrossRefPubMed ↵ True, J. R., Mercer, J. M. & Laurie, C. C. (1996) Genetics 142 , 507-523. pmid:8852849 LaunchUrlAbstract/FREE Full Text ↵ Depaulis, F. & Veuille, M. (1998) Mol. Biol. Evol. 15 , 1788-1790. pmid:9917213 LaunchUrlCrossRefPubMed Kelly, J. K. (1997) Genetics 146 , 1197-1206. pmid:9215920 LaunchUrlAbstract/FREE Full Text ↵ Frightlessrman, J. M., Hudson, R. R., Kaplan, N. L., Langley, C. H. & Stephan, W. (1995) Genetics 140 , 783-796. pmid:7498754 LaunchUrlAbstract/FREE Full Text ↵ Simonsen K. L., Churchill, G. A. & Aquadro, C. F. (1995) Genetics 141 , 413-429. pmid:8536987 LaunchUrlAbstract/FREE Full Text ↵ Lachaise, D., Cariou, M. L., David, J. R., Lemeunier, F., Tsacas, L. & Ashburner, M. (1988) Evol. Biol. 22 , 159-225. ↵ Hamblin, M. T. & Veuille, M. (1999) Genetics 153 , 305-317. pmid:10471714 LaunchUrlAbstract/FREE Full Text ↵ AnExecutelStoutto, P. (2001) Mol. Biol. Genet. 18 , 279-290. LaunchUrl ↵ Aquadro, C. F., LaExecute, K. M. & Noon, W. A. (1988) Genetics 119 , 875-888. pmid:2900794 LaunchUrlAbstract/FREE Full Text ↵ Hudson, R. R., Kreitman, M. & Aguade, M. (1987) Genetics 116 , 153-159. pmid:3110004 LaunchUrlAbstract/FREE Full Text ↵ Feyereisen, R. (1999) Annu. Rev. Entomol. 44 , 507-533. pmid:9990722 LaunchUrlCrossRefPubMed ↵ Jeffreys, A. J., Kauppi, L. & Neumann, R. (2001) Nat. Genet. 29 , 217-222. pmid:11586303 LaunchUrlCrossRefPubMed ↵ Petes, T. D. (2001) Nat. Rev. Genet. 2 , 360-369. pmid:11331902 LaunchUrlCrossRefPubMed ↵ Petrov, D. A., Aminetzach, Y. T., Davis, J. C., Bensasson, D. & Hirsh, A. E. (2003) Mol. Biol. Evol. 20 , 880-892. pmid:12716993 LaunchUrlAbstract/FREE Full Text ↵ O'Hare, K., Alley, M. R., Cullingford, T. E., Driver, A. & Sanderson, M. J. (1991) Mol. Gen. Genet. 225 , 17-24. pmid:1705654 LaunchUrlPubMed ↵ Biemont, C. & Cizeron, G. (1999) Genetica 105 , 43-62. pmid:10483093 LaunchUrlCrossRefPubMed ↵ Kaminker, J. S., Bergman, C. M., Kronmiller, B., Carlson, J., Svirskas, R., Patel, S., Frise, E., Wheeler, D. A., Lewis, S., Rubin, G. M., et al. (2002) Genome Biol. 3 , research0084.1-research0084.20. pmid:12537573 LaunchUrl ↵ Rozmahel, R., Heng, H. H. Q., Duncan, A. M. V., Shi, X. M., Rommens, J. M. & Tsui, L. C. (1997) Genomics 45 , 554-561. pmid:9367680 LaunchUrlCrossRefPubMed ↵ Ackerman, H., Udalova, I., Hull, J. & Kwiatkowski, D. (2002) Mol. Biol. Evol. 19 , 884-890. pmid:12032244 LaunchUrlAbstract/FREE Full Text ↵ Moran, J. V., DeBerardinis, R. J. & Kazazian, H. H., Jr. (1999) Science 283 , 1530-1534. pmid:10066175 LaunchUrlAbstract/FREE Full Text ↵ Daborn, P. J., Yen, J. L., Bogwitz, M. R., LeGoff, G., Feil, E., Jeffers, S., Tijet, N., Perry, T., Heckel, D., Batterham, P., et al. (2002) Science 297 , 2253-2256. pmid:12351787 LaunchUrlAbstract/FREE Full Text ↵ Wallace, M. R., Anderson, L. B., Saulino, A. M., Gregory, P. E., GLiker, T. W. & Collins, F. S. (1991) Nature 353 , 864-866. pmid:1719426 LaunchUrlCrossRefPubMed ↵ Willoughby, D. A., Vilalta, A. & Oshima, R. G. (2000) J. Biol. Chem. 275 , 759-768. pmid:10625605 LaunchUrlAbstract/FREE Full Text ↵ Quandt, K., Frech, K., Karas, H., Wingender, E. & Werner, T. (1995) Nucleic Acids Res. 23 , 4878-4884. pmid:8532532 LaunchUrlAbstract/FREE Full Text ↵ Boffelli, D., McAuliffe, J., Ovcharenko, D., Lewis, K. D., Ovcharenko, I., Pachter, L. & Rubin, E. M. (2003) Science 299 , 1331-1333. pmid:12610290 LaunchUrlAbstract/FREE Full Text ↵ Schlenke, T. A. & Begun, D. J. (2003) Genetics 164 , 1471-1480. pmid:12930753 LaunchUrlAbstract/FREE Full Text ↵ Charlesworth, B. & Langley, C. H. (1989) Annu. Rev. Genet. 23 , 251-287. pmid:2559652 LaunchUrlCrossRefPubMed ↵ Maside, X., Bartolome, C. & Charlesworth, B. (2002) Curr. Biol. 12 , 1686-1691. pmid:12361573 LaunchUrlCrossRefPubMed ↵ Lerman, D. N., Michalak, P., Helin, A. B., Bettencourt, B. R. & Feder, M. E. (2003) Mol. Biol. Evol. 20 , 135-144. pmid:12519916 LaunchUrlAbstract/FREE Full Text ↵ Brosius, J. (1999) Genetica 107 , 209-238. pmid:10952214 LaunchUrlCrossRefPubMed Nekrutenko, A. & Li, W.-H. (2001) Trends Genet. 17 , 619-621. pmid:11672845 LaunchUrlCrossRefPubMed ↵ Jordan, I. K., Rogozin, I. B., Glazko, G. V. & Koonin, E. V. (2003) Trends Genet. 19 , 68-72. pmid:12547512 LaunchUrlCrossRefPubMed ↵ Petrov, D. A. (2002) Genetica 115 , 81-91. pmid:12188050 LaunchUrlCrossRefPubMed ↵ Britten, R. J. & Davidson, E. H. (1971) Q. Rev. Biol. 46 , 111-138. pmid:5160087 LaunchUrlCrossRefPubMed ↵ Eickbush, T. H. (1992) New Biol. 4 , 430-440. pmid:1325183 LaunchUrlPubMed ↵ Crow, J. F. (1957) Annu. Rev. Entomol. 2 , 227-246. LaunchUrlCrossRef ↵ Cochran, D. G. (1993) J. Econ. Entomol. 86 , 1639-1644. pmid:8294622 LaunchUrlAbstract/FREE Full Text ↵ Chikuni, O., PAgeder, A., Skaare, J. U. & Nhachi, C. F. B. (1997) Bull. Environ. Contam. Toxicol. 58 , 776-778. pmid:9115142 LaunchUrlCrossRefPubMed
Like (0) or Share (0)