Abstract
The delineation of species among strains assigned to Debaryomyces hansenii was examined using a gene genealogies-based approach in order to compare spliceosomal intron sequences found in four housekeeping genes (ACT1, TUB2, RPL31 and RPL33). This revealed four distinct groups of strains containing, respectively, D. hansenii var. hansenii CBS 767T, D. hansenii var. fabryi CBS 789T, Candida famata var. flareri CBS 1796T (the anamorph of D. hansenii var. fabryi CBS 789T) and Debaryomyces tyrocola CBS 766T, whose species status was no longer accepted. The sequence divergence between these groups, reaching in some cases over 20 %, unambiguously isolated the groups as separate taxa, leading to a proposal for the reinstatement of the originally described species D. hansenii CBS 767T and D. tyrocola CBS 766T. The variety D. hansenii var. fabryi was further subdivided into two taxa, Debaryomyces fabryi CBS 789T and Candida flareri CBS 1796T (previously C. famata var. flareri and Blastodendrion flareri). The comparison of intron sequences therefore exposed cryptic species whose phenotypic traits are not distinguishable from known species, but which have significantly diverged from the genetic point of view. Hence, we describe the new taxon Debaryomyces macquariensis sp. nov. CBS 5571T is related to, but clearly distinct from, the Debaryomyces species mentioned above. The approach used in this work has also revealed the existence of populations within the newly delineated species D. hansenii and genetic exchanges between these populations, indicating an unexpected genetic diversity within this part of the genus Debaryomyces.
- BSR, biological species recognition
- GCPSR, genealogical concordance phylogenetic species recognition
- LGT, lateral gene transfer
- PS, phylogenetic species
- PSR, phylogenetic species recognition
-
The GenBank/EMBL/DDBJ accession numbers for sequences described in this article are: for the ACT1 introns, AM990068–AM990117; for the RPL31 introns, AM990118–AM990167; for the RPL33 introns, AM990543–AM990594; for the TUB2 introns, AM990595–AM990641, FM165416 for the CBS 7761 ACT1 coding gene and FM165417 for the CBS 5572 ACT1 coding gene. The MycoBank () accession number for Debaryomyces macquariensis sp. nov. is MB 512270.
-
Supplementary tables showing the primers used in this study and the sequence divergence within the ACT1 gene intron between clades and species are available with the online version of this paper.
INTRODUCTION
The definition of species among micro-organisms has become increasingly difficult despite the accumulating amount of sequence data. In bacteria, high levels of lateral gene transfer (LGT) and highly variable gene content have blurred the delimitation of species. Hemiascomycetous yeasts constitute a good model for the study of speciation and the definition of species: many species have a sexual cycle, LGT is rare, and gene content is somewhat similar in the species studied so far (Dujon et al., 2004; Hall et al., 2005). The phenotypic approach based on morphological and physiological identification has now been abandoned as in other micro-organisms. It is generally accepted that the biological species recognition concept (BSR) can be used for yeasts species displaying sexuality, although gene genealogies have shown that the BSR could be strongly questioned, at least in the genus Saccharomyces (Liti et al., 2006).
Unlike in euascomycetes, for which a phylogenetic definition of species, phylogenetic species recognition or PSR (Taylor et al., 2000), prevails, the delineation of hemiascomycetous yeasts species has recently been based, in many cases, on the use of rDNA sequence comparison of type strains. A first attempt at using nuclear sequences for this purpose revealed that diversity was much more pronounced than previously thought (Daniel et al., 2001; Daniel & Meyer, 2003). Multigene approaches combining rDNA, housekeeping genes and mitochondrial cytochrome-c oxidase subunit II (COX2) gene sequences have been developed (Kurtzman, 2003; Kurtzman & Robnett, 2003). Although successful, these approaches still rely on the evolution of strongly conserved sequences and may not reflect the real genome sequence evolution (Tsui et al., 2008; Wu et al., 2008). Debaryomyces hansenii is a hemiascomycetous yeast which can be found in a variety of natural habitats as well as fruits and various manufactured foodstuff, especially cheese, in which it participates in the maturation of the products, or as a contaminant. In addition, D. hansenii has also been implicated as a potential emerging pathogen under this name or under that of its anamorphic form Candida famata (see Jacques & Casaregola, 2008).
The taxonomic classification of the species related to D. hansenii has always been subject to debate. Originally two species were described: D. hansenii (Zopf) Lodder et Kreger van Rij and Debaryomyces fabryi Ota. It was proposed to subdivide the species D. hansenii (Zopf) Lodder et Kreger van Rij into two varieties D. hansenii var. hansenii and D. hansenii var. fabryi on the basis of the electrophoretic patterns of glucose-6-phosphate dehydrogenase and on the maximum growth temperature (Nakase & Suzuki, 1985). It was further proposed that the two species D. hansenii (Zopf) Lodder et Kreger van Rij and D. fabryi Ota should be reinstated on the basis of random amplification of polymorphic DNA (RAPD) profiles (Prillinger et al., 1999). The classification of D. hansenii varieties was also questioned using various specific hybridization probes (Corredor et al., 2000). A low degree (0.7 %) of sequence divergence within the actin 1 (ACT1) gene was observed between the sequences of the two varieties, increasing the uncertainty concerning the differentiation of the two taxa (Daniel & Meyer, 2003). PCR-RFLP (restriction fragment length polymorphism) analysis of the rDNA intergenic spacer (IGS) of Debaryomyces species could differentiate both varieties and led to their separation in two taxa, questioning their relationship (Quiros et al., 2006; Romero et al., 2005). On the other hand, rDNA internal transcribed spacer (ITS) variations could not differentiate the two varieties (Martorell et al., 2005) and a recent multi-locus analysis concluded that the observed divergence reflected intra-specific variability (Tsui et al., 2008).
Most of the D. hansenii isolates are thought to be haploid and to diploidize transiently by somatogamous autogamy to form asci containing generally a single spore (Kreger van Rij & Veenhuis, 1975; van der Walt et al., 1977). Inter-strain mating was rare as less than 0.6 % of crosses between individual strains yielded progeny (van der Walt et al., 1977). As little is known concerning their sexuality and their taxonomic classification is confusing, D. hansenii and its related species constitute a good model to test new approaches for species delineation and to explore species complexes. Here, we compared intron sequences in a gene genealogies-based approach on 50 strains, to propose a new delineation of the species within the D. hansenii species complex.
METHODS
Strains and growth conditions.
Most of the yeast isolates used in this study are from the CIRM-Levures (previously CLIB; ) and from the Centraalbureau voor Schimmelcultures () and are listed in Table 1⇓. The strains L0702 and L0455 were isolated from wine fermentation from the Bordeaux area (France) and were kindly provided by Professor A. Lonvaud. Cells were routinely grown in YPD medium (1 % yeast extract, 1 % peptone, 1 % glucose) at 28 °C with shaking.
Debaryomyces strains used in this study and their characteristics
nd, Not determined.
DNA extraction.
Cells grown in 3 ml YPD medium overnight at 28 °C were centrifuged at 2500 g for 3 min, and the resulting pellet was washed in 750 μl EDTA 50 mM. The resulting pellet was resuspended in 200 μl lysis buffer (1 % SDS, 2 % triton, 100 mM NaCl, 50 mM EDTA, 50 mM Tris, pH 8), 200 μl chloroform phenol (pH=8) and 300 mg glass beads. The suspension was supplemented with 200 μ1 TE buffer and centrifuged for 5 min at 12 000 g. The aqueous portion was transferred to a new tube, and two chloroform extractions were carried out. DNA was precipitated with 1 vol. 2-propanol and centrifuged for 4 min at 12 000 g. The pellet was rinsed with 400 μl 70 % ethanol, dried at room temperature for 15 min, resuspended in 50 μl TE buffer, then incubated with 2 μl RNase (10 mg ml−1) for 30 min at 37 °C. The DNA concentration was quantified on agarose gel.
PCR amplification.
The oligonucleotide primers used in this study (see Supplementary Table S1, available in IJSEM Online) were designed with Primer3 () on the Release 2 of the sequence of the strain CBS 767T (; unpublished). A 2 μl quantity of DNA (containing between 25 and 50 ng) was added to 48 μl PCR reaction mix containing 0.8 mM dNTP mixture, 0.1 μM forward and reverse primers in the recommended buffer and 1 U TaKaRa Ex Taq. Reactions were run on a 2720 thermal cycler (Applied Biosystems) as follows: 5 min at 94 °C followed by 30 cycles of 30 s at 94 °C, 40 s at temperatures varying between 48 and 55 °C and between 30 and 60 s at 72 °C, with a final extension step of 7 min at 72 °C.
Cloning.
PCR products were purified using NucleoSpin extract II purification kit (Macherey-Nagel). They were cloned in pBluescript SK− or in pGEM T Easy (Promega) using T4 DNA ligase (Promega). DH10B Escherichia coli cells were transformed by electroporation with pBluescript SK− recombinant plasmids and JM109 E. coli cells were transformed with pGEM T recombinant plasmids. Cells were selected for their ability to grow on ampicillin. Positive recombinant clones were chosen after PCR screening for the presence of insert and restriction analysis. Inserts were then sequenced using universal primers or the primers that served for the PCR amplification.
DNA sequence determination and phylogenetic analysis.
PCR fragments were sequenced on both strands by Genomex using primers that served for the PCR amplification. Sequences were assembled with the phred/phrap/consed package. Sequences were analysed with the GCG Wisconsin package and various programs in the GCG environment, including blast and fasta. Sequence alignments were generated by using clustal_x and were manually adjusted with GeneDoc (). Phylogenetic trees were reconstructed with the neighbour-joining program implemented in clustal_x. Phylogenetic trees were visualized with TreeView, version 1.6.5 (Page, 1996) and NJPlot (Perriere & Gouy, 1996).
RESULTS
Rationale
Liti et al. (2006) found that, in Saccharomyces cerevisiae, sequence divergence of above 5 % for a number of genes (yKu70, yKu80, NEJ1, EST2, TLC1 and SPT4) dramatically reduces the viability of spores produced after mating between divergent strains. As natural D. hansenii isolates rarely mate together and no selection is available, we analysed their sequence divergence and we used the threshold of ∼10 % sequence divergence to place isolates into separate groups that we called phylogenetic species (PS).
Spliceosomal introns are much less subject to selective pressure than are coding sequences and we chose to study the sequence of introns carried by four well-conserved housekeeping genes: ACT1 (Génolevures genetic element DEHA2D05412g) encoding actin, TUB2 (DEHA2D08800g) encoding beta-tubulin, RPL31 (DEHA2D11110g) and RPL33 (DEHA2C16038g) encoding ribosomal proteins. The latter two genes are present in a single copy in D. hansenii var. hansenii CBS 767T, in contrast to the situation in S. cerevisiae, where they are duplicated. These genes were chosen because the strong conservation of their coding sequence results in very little variation in the sequence used for PCR priming on either side of the intron. To PCR-amplify the introns of the ACT1 gene, we used two primers designed by Meyer's group (Daniel & Meyer, 2003), CA11 and CA2R. In addition, we designed other specific ACT1 primers. To PCR-amplify the RPL31, RPL33 and TUB2 introns, we also designed specific primers, according to the complete genome sequence of D. hansenii var. hansenii CBS 767T (Dujon et al., 2004; ). The RPL31 and RPL33 genes were chosen because ribosomal protein coding genes carry generally longer introns than non-ribosomal protein coding genes (Bon et al., 2003).
Identification of phylogenetic species
We amplified the intron-carrying regions of the four genes ACT1, TUB2, RPL31 and RPL33 in 50 isolates and determined the sequences of the PCR products that were generated. The isolates analysed came mainly from food, in particular cheese from various countries. Some were newly isolated yeasts from contaminating biofilms in refrigerators, oenological strains from the Bordeaux and the Colmar regions, and isolates from various other sources: soil, human wound and sea water (Table 1⇑). We compared the sequences and constructed gene trees using the neighbour-joining algorithm. Figs 1⇓ and 2⇓ show the trees obtained for the ACT1 and RPL33 genes, respectively. The ACT1 intron sequence alignment contained 121 informative characters out of 300, TUB2 72 out of 271, RPL31 140 out of 355 and RPL33 155 out of 384. In order to be used for species delineation, PSR must be based on concordance of gene genealogies (coined genealogical concordance phylogenetic species recognition or GCPSR, Taylor et al., 2000). Phylogenetic trees showed a good concordance between the four genes. With the exception of a few cases (discussed below), the distribution of the isolates in the various trees was identical.
Neighbour-joining phylogram of the intron of the ACT1 gene of Debaryomyces isolates. Bootstrap values (%) based on 1000 replicates are indicated at the nodes for main groups and clades. All positions containing gaps and missing data were eliminated from the dataset. Type strains are in bold characters. Groups and clades are indicated. The branch length of D. prosopidis CBS 8450T is shortened. Bar, 0.02 substitutions per site.
Neighbour-joining phylogram of the intron of the RPL33 gene of Debaryomyces isolates. Bootstrap values (%) based on 1000 replicates are indicated at the nodes for main groups and clades. All positions containing gaps and missing data were eliminated from the dataset. Type strains are in bold characters. Groups and clades are indicated. The branch length of D. prosopidis CBS 8450T is shortened. Bar, 0.02 substitutions per site.
The gene trees suggest the existence of four phylogenetic species; the first contained the type-strain of D. hansenii var. fabryi CBS 789T (PS 1) and the second contained its anamorph C. famata var. flareri CBS 1796T (PS 2). The sequence divergence between these two phylogenetic species ranged from 12 to 19 %. According to our criterion, this indicates that these two taxa form distinct species. The divergence between the rDNA sequences of the teleomorph D. hansenii var. fabryi CBS 789T and its anamorph C. famata var. flareri CBS 1796T amounts to two bases in the D1/D2 region and three bases in the ITS region. Divergence of the ACT1-coding gene sequence between these type strains amounted to only 2.4 % (Daniel & Meyer, 2003), indicating that sequences such as introns are more discriminating than rDNA or coding sequence of housekeeping genes. The sequence divergence between the ACT1-coding sequence and the cognate intronic sequence for the type-strains or a representative strain of all the defined groups is summarized in Supplementary Table S2, available in IJSEM Online.
The third phylogenetic species (PS 3) is more complex and contains D. hansenii var. hansenii CBS 767T and its anamorph C. famata var. famata CBS 1795T. The sequence divergence observed between the strains in this group was less than 1.5, 3, 5 and 5.6 % for the TUB2, ACT1, RPL31 and RPL33 introns, respectively. This indicates that, according to our criterion, although the strains in PS 3 display a certain level of genetic diversity, they are conspecific. Hence, in contrast to the situation of the PS 1 and PS 2 where the teleomorph/anamorph pair D. hansenii var. fabryi/C. famata var. flareri are in fact in distinct taxa, PS 3 contains the teleomorph/anamorph pair D. hansenii var. hansenii/C. famata var. famata.
The strains CBS 776, CLIB 660 and CLIB 949 display marked sequence divergence of 4, 5, 8 and 14 % for the TUB2, ACT1, RPL31 and RPL33 introns, respectively, when compared with the D. hansenii var. hansenii CBS 767T reference sequence. This suggests that these strains have undergone a speciation process and that they constitute a distinct species from PS 3 and PS 4 (Figs 1⇑ and 2⇑).
We included in our analysis the Debaryomyces prosopidis type strain CBS 8450T. D. prosopidis cells have been isolated from exudates of mesquite trees. This species was shown to be related to the two varieties of D. hansenii on the basis of phenotypic characteristics, but was considered to be a separate species on the basis of DNA/DNA reassociation and because of its lack of growth on cellobiose (Phaff et al., 1998). Considering the large sequence divergence with respect to D. hansenii var. fabryi CBS 789T and its anamorph C. famata var. flareri CBS 1796T, its two closest relatives, we confirm here that D. prosopidis forms a distinct species (Supplementary Table S2, available in IJSEM Online, Figs 1⇑ and 2⇑). Three gene intron genealogies out of four (TUB2, RPL31 and ACT1) indicate that it has evolved from PS 1 after the divergence of PS 1/PS 2, the other genealogy (RPL33) placing D. prosopidis CBS 8450T closer to the PS 2 branch.
PS 3 constitutes a complex group of strains, some being diploid heterozygotes for the tested markers
The sensitivity of our method, due to the high intrinsic variability of intron sequences, led to the separation of strains of group 3 into three clades (Figs 1⇑ and 2⇑). Clade 1 contained the majority of the isolates, including the type-strain D. hansenii var. hansenii CBS 767T. This clade is made up of food strains, oenological strains and refrigerator contaminants. All the strains in clade 2 (CBS 1121, CBS 1099, CLIB 594, CLIB 617 and CLIB 626) were isolated from food, mainly cheese. Clade 3 contains C. famata var. famata CBS 1795T, CBS 5139 (previously identified as D. hansenii var. fabryi on the basis of rDNA comparison), CBS 7848, several cheese strains, CLIB 613 and CLIB 240, the latter having been differentiated from CLIB 236 (clade 1) using mitochondrial DNA RFLP and RAPD in a previous study (Romano et al., 1996).
Overall, the assignment of strains to phylogenetic species and clades proved robust. The first exception concerns CLIB 662, whose ACT1 and TUB2 genealogies associate the strain to clade 1 and whose RPL31 genealogy places it in the clade 2, indicating that CLIB 662 contains genetic material from two different ancestors. This was observed for CLIB 617 also, whose TUB2 and RPL31 genealogies placed it in clade 1, whereas that with ACT1 and RPL33 placed it in clade 2. The second exception concerns strains yielding PCR products whose sequences could not be obtained, suggesting that mixed populations of PCR products were generated. The PCR products that could not be sequenced were cloned and the inserts of a number of individual clones were sequenced. Except for CBS 1099 and CBS 7848, we found in these strains two different sequences, each corresponding to one of the three described clades, indicating that the strains were diploid heterozygote for the tested markers. One of the RPL31 alleles of CBS 1099 could not be assigned to one of the PS 3 clades and defined therefore another specific clade (Table 2⇓). Moreover, CBS 7848 contained more than two RPL33 alleles, one was assigned to clade 3 and the two others were not characterized further. Table 2⇓ summarizes all the allele combinations. The Japanese strain CBS 7848 (also known as TK) shared TUB2 alleles from clade 2 and from an undescribed new clade (data not shown). The latter allele differed markedly from all the strains described, having 10 % divergence with the clade 2 allele of CBS 7848, 9 % divergence with CBS 767T and 6 % divergence with CBS 766. Considering that the TUB2 intron is the less discriminating marker used in this study, CBS 7848 is very likely a hybrid between a D. hansenii var. hansenii clade 2 strain and an as-yet undescribed species.
Assignment of the sequenced alleles of various strains to D. hansenii clades
nd, Not determined.
Taken together, these results indicate that a number of strains of PS 3 are at least partially diploid and show that genetic exchanges are more common than expected. No such exchange was seen in isolates from PS 1 and PS 2.
Origin of strains and phenotypic differentiation
With the exception of PS 2, no correlation was seen between our species delimitation and the origin of the isolates. For the isolates of PS 2, all the strains are isolated from human and not from manufactured food (one strain was isolated from an apple). All the other phylogenetic species and clades contain strains that have been isolated from a variety of sources (human, food, natural habitats…). This observation is in agreement with the ubiquitous nature of D. hansenii.
The maximal growth temperature character has previously been used to differentiate the two D. hansenii varieties hansenii and fabryi (Nakase & Suzuki, 1985). We found that this character is unsuitable for this purpose, as a number of the isolates of the PS 1 cannot tolerate higher temperatures than 30 °C, whereas strains of PS 3 can grow at 35 °C or even at 37 °C (Table 1⇑). Interestingly, all the isolates from the PS 2 can grow at 37 °C or above, consistent with the fact that all but one were isolated from human sources.
Re-identification of known isolates and detection of a cryptic species
The gene sequence comparison of isolates from collections revealed mis-identifications (Table 1⇑). Strain CBS 7761, identified as D. hansenii var. fabryi (), was found to be closely related to Debaryomyces nepalensis CBS 5921T with only three base changes out of 310 bp of the ACT1 intron. This was confirmed with the ACT1 coding gene sequence comparison which showed only one mismatch out of 790 bp between CBS 7761 and D. nepalensis CBS 5921T. This identifies CBS 7761 as D. nepalensis.
The ACT1 gene intron of CBS 5572 (previously identified as D. hansenii var. fabryi on the basis of rDNA sequence comparison) diverged from all known Debaryomyces species (over 16 % divergence with the closest species), even those which are closely related to D. hansenii proper. The ACT1 coding gene of CBS 5572 showed a 7 % sequence divergence when compared with that of CBS 767T (Fig. 3⇓). We propose that CBS 5572 therefore represents a new cryptic species. This strain, isolated by di Menna (Skerman & Singleton, 1964), was originally identified as Debaryomyces kloeckeri, a species which is heterogeneous and is no longer valid (Nakase et al., 1998).
Neighbour-joining phylogram of the ACT1 coding gene sequence of Debaryomyces type strains. Bootstrap values (%) based on 1000 replicates are indicated at the nodes. Most of the DNA sequences were retrieved from GenBank. Bar, 0.01 substitutions per site.
Latin diagnosis of Debaryomyces macquariensis N. Jacques, S. Mallet et S. Casaregola sp. nov.
In medio liquido YPD post dies 3 ad 25 °C cellulae rondae, 8–9 μm, singulae. Per gemmationem unipolarem reproducentes, hyphae absunt. Ascosporae globosae, 1–2 in asco, ex ascis non liberantur.
Glucosum fermentatur (tarde).
d-Glucosum, d-galactosum, d-ribosum (lente, varium); d-xylosum, l-rhamnosum (lente), sucrosum, maltosum, trehalosum, α-methyl-d-glucosidum, salicinum, arbutinum, lactosum, raffinosum, melezitosum, inulinum, amylum, glycerolum, erythritolum, ribitolum, xylitolum, l-arabinitolum, d-glucitolum, d-mannitolum, d-glucono-1,5-lactonum (lente), 2-keto-d-gluconatum, d-gluconatum, dl-lactatum (lente), acidum succinicum, acidum citricum, propane 1,2 diolum (lente), ethanolum assimilantur, at non l-sorbosum, d-glucosaminum, l-arabinosum, d-arabinosum, melibiosum, galactitolum, myo-inositolum, d-glucuronatum, d-galacturonatum, butane 2,3 diolum, acidum quinicum, d-glucaratum, d-galactonatum nec methanolum.
Ethylaminum hydrochloricum, cadaverinum dihydrochloricum assimilantur at non natrium nitrosum, kalium nitricum, l-lysinum (lente, variabile), creatinum, creatininum, glucosaminum, imidazolum nec d-tryptophanum. Vitamina externa crescentia non sunt necessaria. In agaro extracto fermenti confecto 50 partes glucose per centum non crescit. Parte una cycloheximidi per mille et per centum non crescit.
Maxima temperature crescentiae: 37 °C. Materia amyloidea iodophila, acidum aceticum non formantur. Diazonium caeruleum B non respondens. Ureum non hydrolysatur. Typus stirps CBS 5572T.
Description of Debaryomyces macquariensis N. Jacques, S. Mallet & S. Casaregola sp. nov.
Debaryomyces macquariensis (mac.qua.ri.en′sis. N.L. fem. adj. macquariensis refers to Macquarie Island in Australia, from where the type strain was isolated).
After 3 days at 25 °C in YPD, the cells are round, 8–9 μm and occur singly. Vegetative reproduction is by unilateral budding. No hyphal elements are produced on cornmeal agar. Spherical asci containing one to two ascospores are formed after incubation for 5 days on Mac Clary agar medium.
Glucose fermentation is weak and delayed.
d-Glucose, d-galactose, d-ribose (weak and delayed); d-xylose, l-rhamnose (delayed), sucrose, maltose, trehalose, methyl-α-d-glucoside, salicin, arbutin, lactose, raffinose, melezitose, inulin, starch, glycerol, erythritol, ribitol, xylitol, l-arabinitol, d-glucitol, d-mannitol, d-glucono-1,5-lactone (delayed), 2-keto-d-gluconate, d-gluconate, dl-lactate (delayed), succinate, citrate, propane 1,2 diol (delayed) and ethanol are assimilated. l-Sorbose, d-glucosamine, l- arabinose, d-arabinose, melibiose, galactitol, myo-inositol, d-glucuronate, d-galacturonate, butane 2,3 diol, quinic acid, d-glucarate, d-galactonate and methanol are not assimilated.
Ethylamine and cadaverine hydrochloride are assimilated; potassium nitrate, sodium nitrite, creatine, creatinine, glucosamine, imidazole and d-tryptophan are not assimilated; l-lysine is weak or delayed or negative. Growth in vitamin-free medium, 50 % glucose, 0.01 % and 0.1 % cycloheximide is negative.
Maximum growth temperature is 37 °C. Starch-like compounds and acid acetic are not produced. Diazonium blue B reaction is negative. Urease activity is negative.
The type strain, CBS 5572T, was isolated from soil in Macquarie Island in Australia. This strain has been deposited in the Centraalbureau voor Schimmelcultures (CBS) during December 1964.
Further taxonomic considerations
On the basis of RAPD-PCR, Prillinger et al. (1999) considered that the two varieties D. hansenii var. hansenii and D. hansenii var. fabryi should be reinstated as two distinct species. Daniel & Meyer (2003) observed divergence within the ACT1 gene coding sequence, indicating that the D. hansenii var. hansenii/D. hansenii var. fabryi and D. hansenii var. fabryi/C. famata var. flareri pairs display an important intra-specific variability. These authors proposed that the latter pair should be split into two varieties. Our results confirm and even extend these authors' proposals. We show here, using multi-gene genealogies of intron sequences on a large number of isolates, greater than 19 % divergence between D. hansenii var. hansenii and the teleomorph/anamorph D. hansenii var. fabryi/C. famata var. flareri pair. We also show that the group of strains containing the teleomorph D. hansenii var. fabryi and the anamorph C. famata var. flareri, respectively, differ from each other by at least 14 %. A parallel can be drawn with the Saccharomyces sensu stricto species in which the Saccharomyces cerevisiae/Saccharomyces bayanus pair sequences diverge by at least 20 % and the S. cerevisiae/S. paradoxus pair sequences diverge by at least 10 %. We therefore propose in agreement with Prillinger et al. (1999) to abandon the variety classification for D. hansenii and to reinstate these taxa as two distinct species.
Considering the divergence between the teleomorph D. hansenii var. fabryi and anamorph C. famata var. flareri, we further propose to reinstate C. famata var. flareri as a distinct species under the name Candida flareri.
Candida flareri (Ciferri & Redaelli) Jacques, Mallet & Casaregola.
Basionym: Blastodendrion flareri Ciferri & Redaelli. Arch Mikrobiol 6 (1935) 9.
Typus: CBS 1796.
As mentioned above, the strains CBS 766, CLIB 660 and CLIB 949 differ markedly from strains of PS 3, with sequence divergence up to 14 %. According to our criterion, these strains should constitute a new taxon. Since the first strain to be described in this taxon was CBS 766T, as the type strain of the Debaryomyces tyrocola species, we propose that this species be reinstated.
DISCUSSION
The primary aim of our study was to test whether we could apply a phylogenetic approach to recognized species within a hemiascomycetous yeast species complex that cannot be resolved by existing traditional and molecular methods. We have studied the D. hansenii species complex using GCPSR by sequencing and comparing intron sequences of four genes of 50 strains of the D. hansenii complex. This led to the delineation of phylogenetic species and concordant results for nearly all the markers tested were obtained. Indeed, we observed a lack of concordance in only a few cases for which the isolates very likely resulted from crosses between the isolates from two distinct clades within a phylogenetic species. The divergence between intron sequences that we observed when PS 1, PS 2 and PS 3 were compared is well above the sequence divergence (∼10 %) that abolishes totally viability of spores in crosses of S. cerevisiae (Liti et al., 2006) and that we have chosen as a threshold for species delimitation. We therefore propose, in agreement with a number of authors (Prillinger et al., 1999; Corredor et al., 2000; Romero et al., 2005; Quiros et al., 2006), that the two D. hansenii varieties constitute now the species Debaryomyces hansenii and Debaryomyces fabryi. Considering the heterogeneity within the previously defined D. hansenii var. fabryi variety, we also propose that a number of strains therein should form the new species Candida flareri. The variability of the sequences analysed here allowed us to distinguish clearly three clades within the newly defined species D. hansenii, one which contains C. famata var. famata. The maximal sequence divergence between these clades reached 5.6 %, indicating that the strains belonging to these clades are conspecific. The existence of strains carrying heterozygote alleles defining these clades supports our proposition (see below).
Overall, the RPL33 gene intron proved to be the most divergent, whereas the TUB2 sequence showed the least divergence; this could reflect the different nature of the two sequences, the former being the longest intron used in our study (418 bp in CBS 767T), the latter containing two short introns separated by 36 bp coding sequence (299 bp with 118 bp and 111 bp for intron 1 and 2, respectively). The TUB2 gene was chosen as it constitutes a valuable marker in the taxonomy of filamentous fungi (Geiser et al., 1998), but our results suggest it may not be as useful for hemiascomycetous yeasts.
We demonstrated for the first time natural diploidy in D. hansenii for a number of markers, crosses having only been performed with auxotrophic mutants (van der Walt et al., 1977). Although yeasts are prone to gene duplication and diploidization, the presence of heterozygous markers in some strains by gene duplication can be ruled out, as allele sequences are 100 % identical to the sequences of strains of the various clades. The most parsimonious explanation is that the diploid heterozygote strains result from recent crosses, which have been followed by the massive loss of one specific haplotype (Sipiczki, 2008) or/and gene conversions. The six detected diploid heterozygotes may be partial diploids, as often only one allele of the analysed genes is present (Table 2⇑), consistent with a massive gene loss associated to gene conversion. Interestingly, no such heterozygosity was seen in the other newly proposed species D. fabryi, C. flareri and D. tyrocola (PS 1, PS 2 and PS 4). Heterozygotes were limited to D. hansenii (PS 3). This may be due to the small number of isolates available in the other three taxa. Many strains from PS 3 (Table 1⇑) are involved in fermentation and could be prone to diploidy. It is indeed well known that industrial isolates have a complex, very often diploid, polyploid or even aneupolyploid genome (Bidenne et al., 1992; Kielland-Brandt et al., 1995; Masneuf et al., 1998).
The six isolates presenting some level of diploidy carry sequences from clade 3, suggesting that this clade is more prone to mating. This is paradoxical as this clade contains C. famata var. famata, which does not have a sexual state. Our observation is also surprising, as C. famata var. famata has been associated in the past with a number of clinical cases (Krcmery & Barnes, 2002; Pfaller & Diekema, 2002), whereas the six isolates have been isolated from food. Clinical isolate identification should be reassessed with gene genealogies in order to better understand their relation to the isolates of the phylogenetic group studied here as it has been suspected that species authentication of clinical isolates may be questionable (see Hazen, 1995; Jacques & Casaregola, 2008).
Our approach was successful in identifying cryptic species, in detecting populations within a single species, and also in bringing to light speciation events. It has been proposed that speciation may be linked to major changes in the genome structure, such as unequal translocation that would lead to genetic isolation. This was ruled out by genomic analysis in Saccharomyces sensu stricto (Fischer et al., 2000). The cryptic species studied here, in particular the closely related PS 3 and PS 4, may prove a good material to study speciation in a different model from that of S. cerevisiae, as it was shown that D. hansenii displays one of the highest rates of chromosomal rearrangements among hemiascomycetous yeasts (Fischer et al., 2006). In this respect, the case of the species D. prosopidis is noteworthy. Our study suggests that D. prosopidis has evolved from D. fabryi after the divergence of D. fabryi/C. flareri. The long branch of D. prosopidis may indicate an interesting case of rapid evolution. The substrate on which D. prosopidis (Sonoran mesquite) was isolated and its geographical location of isolation may suggest that we are observing a case of allopatric speciation.
In yeasts, the use of a gene genealogies-based approach on populations for species delineation is expanding (Kuehne et al., 2007; Jacobsen et al., 2008). Crosses between wild Saccharomyces paradoxus isolates from geographically distinct populations tended to produce lower spore viability (Kuehne et al., 2007), in agreement with Liti et al. (2006), strengthening our species delimitation of yeasts with uncharacterized sexuality using similar sequences. Here, we were able to detect cryptic species and very likely interbreeding within the D. hansenii species complex. We believe that the combination of the use of discriminating sequences such as introns and horizontal analysis of a large number of isolates will help in the future to better define the boundaries of yeast species and to reveal cryptic species.
Acknowledgments
We are grateful to Professor C. R. Tinsley (AgroParisTech, Thiverval-Grignon, France) and Professor J. Dupont (Muséum National d'Histoire Naturelle, Paris, France) for reading the manuscript and for helpful discussion. We are grateful to Professor A. Lonvaud (Université Victor Segalen Bordeaux 2, Bordeaux, France) and Dr J.-L. Legras (INRA Colmar, Colmar, France) for providing us with isolates. This work was supported by INRA and CNRS.