Molecular taxonomy of Dunaliella (Chlorophyceae), with a special focus on D. salina: ITS2 sequences revisited with an extensive geographical sampling

We used an ITS2 primary and secondary structure and Compensatory Base Changes (CBCs) analyses on new French and Spanish Dunallela salina strains to investigate their phylogenetic position and taxonomic status within the genus Dunaliella. Our analyses show a great diversity within D. salina (with only some clades not statistically supported) and reveal considerable genetic diversity and structure within Dunaliella, although the CBC analysis did not bolster the existence of different biological groups within this taxon. The ITS2 sequences of the new Spanish and French D. salina strains were very similar except for two of them: ITC5105 "Janubio" from Spain and ITC5119 from France. Although the Spanish one had a unique ITS2 sequence profile and the phylogenetic tree indicates that this strain can represent a new species, this hypothesis was not confirmed by CBCs, and clarification of its taxonomic status requires further investigation with new data. Overall, the use of CBCs to define species boundaries within Dunaliella was not conclusive in some cases, and the ITS2 region does not contain a geographical signal overall.


Background
The Internal Transcribed Spacer 2 (ITS2) of the nuclear rDNA cistron is one of the most frequently used regions for phylogenetic analysis in algae [1][2][3]. Although its application in deep taxonomic levels was initially limited to comparisons of genera within the same family owing to uncertainties in alignment at higher taxonomic levels, the analysis of its secondary structure has provided key solutions to this problem [4]. Thus, the use of an ITS2 secondary structure improves sequence alignments, resulting in a higher robustness and accuracy of phylogenetic reconstructions [5] and providing help to distinguish species [6]. Furthermore, an automatic approach to analysis is possible [7], as a pipeline consisting of the ITS2 Database (annotation/structure prediction), 4SALE (alignment), ProfDistS (inferring phylogenies) and the CBCAnalyzer (distinguishing species) have recently become available (http://its2.bioapps.biozentrum.uniwuerzburg.de/?about).
In Dunaliella (Chlorophyceae), the use of ITS2 secondary structure for phylogenetic analysis has a long tradition [8][9][10][11][12][13]. The genus Dunaliella comprises twenty-eight recognized species separated in two subgenera, Pascheria (which contains the freshwater species), and Dunaliella (grouping the marine species); the latter is further subdivided into four sections: Tertiolecta, Dunaliella, Viridis and Peirceinae [11]. The species ascribed to these four sections occur in a wide range of marine habitats such as oceans, brine lakes, salt marshes, salt lagoons and salt water dishes near the sea [14], being Dunaliella salina Teodoresco (section Dunaliella) the most studied one. Dunaliella salina, is the most halotolerant eukaryotic photosynthetic organism known to date [14,15] since it shows a remarkable degree of adaptation to a variety of salt concentrations and it accumulates large amounts of carotenes under extremely stressful conditions such as high salinity, low nitrogen levels, and high solar radiation [14]. Nowadays, it is the best commercial source of natural β-carotene [14,16], and it also stands out as a source of glycerol [17].
The high genetic diversity detected in D. salina, its morphological plasticity, and the restricted geographical sampling used in all scientific publications to date have hindered taxonomic elucidation in this taxon. Our objectives are (1) to use the ITS2 sequences and secondary structure analysis in a thorough geographic and taxonomic representation of the genus Dunaliella and particularly D. salina (including new Spanish and French strains) to improve understanding of the complex phylogenetic structure in this taxon, (2) to study the relationship of the new strains with the Dunaliella sequences available at the ITS2 Database and/or at Gen-Bank [26]; (3) to investigate if the Compensatory Base Changes (CBCs) analyses could elucidate the species concept in Dunaliella, and reveal potentially new species; and (4) to test if D. salina ITC5105 "Janubio" could be considered a new species.

Results
The ITS2 primary and secondary structure phylogenetic analysis of all the Dunaliella sequences available in the ITS2 Database plus the new sequences revealed great heterogeneity, although some of the clades were not statistically supported (Figure 1). No phylogenetic relationship is supported between the two Dunaliella freshwater species, since D. lateralis was positioned outside the Dunaliella subgenus, while D. acidophila was positioned within this subgenus.
The majority of the D. salina strains were distributed in two different clades (salina-clade-I and salina-clade II), positioned between the tertiolecta sub-clades, whereas two D. salina strains (CCAP19/30, CCAP19/18) were positioned together in a different clade. Only two of the new ITS2 D. salina sequences, ITC5119 ("salinaclade-I") and D. salina ITC5105 "Janubio" (salina-clade II), had a unique ITS2 sequence profile ( Figure 1). The CBC analysis of the ITS2 secondary structure showed that there is at least one CBC between "Janubio" and the other D. salina strains analyzed (see Additional file 1), except D. salina CCAP19/30 and CCAP19/18; however these two strains were phylogenetically distant to Janubio.
The taxonomic identification of the Dunaliella sp. strains (below the D. acidophila strain in Figure 1) was not possible because they did not match with any known Dunaliella species. Despite the phylogenetic tree indicates that this group may indeed represent a new species; the analysis of the CBC did not confirm these results (Additional file 1).
We observed a lack of CBCs in some strains when we compared species in different clades: viridis vs salina, viridis vs tertiolecta, salinas vs tertiolecta, salina vs salina, etc (Additional File 1). Also, the CBC analysis of the different species within the subgenus Dunaliella for the confirmation of the species boundaries was not conclusive in some cases (Additional file 1).
The phylogenetic results of the ITS2 sequences in this study, and previous confirmed identification of some Dunaliella strains (see Table 1), allow us to suggest the re-identification of some strains (Table 1).
Finally, the ITS2 data was not informative regarding the geographic origin of the D. salina strains.

Discusion
The ITS2 Database allows automatic large scale simultaneous analyses of both ITS2 sequences and their secondary structures. Potential pitfalls are in structures obtained by different algorithms; however the main difficulty of performing a phylogenetic analysis of the genus Dunaliella is the misinformation available at Culture Collections and GenBank regarding the identification of strains and sequences. The tracking of the true identification of each strain was only possible after consulting several publications where the authors concluded that they were misidentified and suggested their re-identification [9,10,27]. In this study, we have also suggested the re-identification of some strains based on our ITS2 data. To overcome all these unnecessary problems (and given that it is unlikely that all the Dunaliella strains could be openly available), we suggest to establish a "type strain" for each Dunaliella taxon (including subspecies, forms or varieties). These basic data should be easily obtained from any official culture collection, thereby greatly facilitating comparison with new field isolates and avoiding misleading information and/or false conclusions.
Our ITS2 phylogenetic analysis of Dunaliella reveals several major groups, and positions the freshwater D. lateralis clearly outside Dunaliella, confirming that it no longer should be considered a member of this genus [10]. Nevertheless, the other freshwater species analyzed in this study (D. acidophila, CCAP19/35), maintained its position within the subgenus Dunaliella, and was not phylogenetically related to D. lateralis, as recently proposed [28]. Furthermore, the observation that different species belonging to several Sections (Tertiolecta, Viridis, Dunaliella and Peircei) share the exact same ITS2 sequence, make us believe that they correspond to a single species. These data agree with other authors [10,11,27,29], who suggested that the number of Dunaliella species may be much lower than it has been claimed till now. The possibility that the ITS2 gene is not able to discriminate between these species is highly unlikely; therefore, our observations support the suggestion that the morphological and physiological criteria available to discriminate Dunaliella species are either not very reliable [11], or are difficult to interpret..
In an attempt to clarify the species concept within Dunaliella, we searched for compensatory base changes (CBCs). Several case studies have revealed that the detection of a CBC in the ITS2 secondary structure between two organisms is correlated with sexual incompatibility [6,30,31], and these changes have been proposed as markers for distinguishing species [6,7,30,31]. In summary, these investigations conclude that while a CBC in a pair of sequences is positively correlated with species distinctness at a confidence level of 93%, the lack of a CBC in the ITS2 secondary structure does not necessarily indicate that two organisms belong to the same species [6]. The overall analysis of the CBC was     not able to elucidate completely the species boundaries within the different groups of Dunaliella, since in some cases it was observed that there was a lack of CBCs between known distinct species.
In the special case of D. salina, high variation levels have been reported for decades [19][20][21][22][23][24]. However, only Massjuk [18]   currently known as D. salina [10,11,32], indicating the possibility of cryptic speciation [10]. Our ITS2 phylogenetic analysis does confirm the existence of three different groups within D. salina; however, the CBC results did not resolve if these groups may correspond to distinct species, although several strains of each D. salina group shared one CBC with D. salina strains in other groups. On the other hand, the high morphological and physiological variability found within the Spanish D. salina strains under the same lab conditions [24] was not correlated with the phylogenetic observations of this study. This finding indicates that the phylogenetic closeness found with ITS sequences does not reflect common physiological or morphological attributes. Moreover, our data unequivocally suggest that D. salina is not monophyletic, at odds with previous hypotheses [10,11,32]. The other objective of this work was to elucidate if the physiological uniqueness found in "Janubio" [such as its unique fatty acid profile and accumulation of high levels of carotenes under low light flux density conditions; Mendoza et al.: A new strategy for carotenogenesis under conditions of cellular stress in Dunaliella (a Potential New Species), submitted] could be confirmed by the ITS2 data and the CBC analysis. However, although we observed that this strain has a unique ITS2 sequence profile, and had more than one CBC with the other phylogenetically related D. salina, our data do not allow us to conclude that this is a new species, and further studies must be performed to find out if the differences observed are just reflecting a high intra-specific variability. Finally, in agreement with previous studies [8,11,25], our ITS2 data failed to furnish evidence for isolation by distance among D. salina strains.

Conclusion
This work demonstrates that the taxonomy of Dunaliella should be revised. The great diversity observed within the ITS2 sequences of D. salina suggests that different biological groups may exist within this taxon; however, this was not confirmed with the CBC analysis. Likewise, although the Spanish D. salina strain ITC5105 "Janubio" was characterized by a unique ITS2 sequence, the hypothesis that it may be a new species could not be confirmed by the CBCs analysis, requiring further morpho-physiological and genetic investigation. Overall, the use of CBCs to define species boundaries within Dunaliella was not conclusive in some of the cases assessed.

Strains, DNA extraction and ITS amplification
We sequenced the ITS2 region of 13 D. salina strains from Spanish and French saltworks, one strain obtained from the Culture Collection of Algae and Protozoa UK (CCAP), and one D. salina strain that has been maintained in the Instituto Tecnológico de Canarias (ITC henceforth) for several years (purchased from CCAP as Dunaliella salina 19/30). We also sequenced other Dunaliella species (D. minuta CCAP19/5, D. tertiolecta CCAP19/23 and CCAP19/6B, D. bioculata CCAP19/4) [ Table 1]. The sequences of the other strains analyzed in this study were obtained from the ITS2 Database (http://its2.bioapps.biozentrum.uni-wuerzburg.de/? about). Detailed information about the strains used in this study can be found in Table 1.
DNA extraction was performed with a chelex-100 (Biorad, CA, USA) resin-based protocol [33]. For the DNA amplification of the ITS region, primers AB28 and TW81 in Goff et al. (1994) [34] were used. DNA amplification was carried out in a total volume of 25 μl with 1X iQ SYBR Green Supermix (Biorad, CA, USA) and 10 pM of each primer in a Smart Cycler thermocyler (Cepheid,CA, USA) as follows: 5 min at 94°C; 5 cycles of 1 min at 94°C, 2 min at 50°C and 1 min at 72°C; 30 cycles of 1 min at 94°C, 1 min at 62°C and 1 min at 72°C, with a final extension of 5 min at 72°C.
PCR products were first electrophoresed in a 1.5% agarose gel to assure that a single band of 500-600 bp was present, then purified using the Real Clean Spin kit (REAL, Durviz S.L.U., Valencia, Spain), and finally bidirectionally sequenced on an ABI PRISM 3730xl automatic sequencer (Applied Biosystems, CA, USA) at the DNA sequencing services of Macrogen (Korea).

Phylogenetic analyses
Sequences and their individual secondary structures were obtained from the ITS2 Database [35][36][37]. Newly obtained ITS2 sequences were annotated according to Keller et al. [38], and their secondary structures predicted by homology modeling [39]. The phylogenetic analysis followed the procedure outlined in Schultz and Wolf [7] in accordance with Keller et al. [5]. The software used for the ITS2 sequence-structure analysis can be obtained from http://its2.bioapps.biozentrum.uniwuerzburg.de/?about. A global, multiple sequence-structure alignment was generated in 4SALE v1.5 [40,41]. The sequences and their individual secondary structures were synchronously aligned making use of an ITS2 sequence-structure specific scoring matrix [40], and the start and end of the alignment was manually adjusted. Based on primary and secondary structure information, phylogenetic relationships were reconstructed by Prof-DistS, through the use of an ITS2 specific, general time reversible substitution model [42,43]. Bootstrap support [44] was estimated on 100 pseudo-replicates. The resulting tree was visualized with TreeView [45].
To study the species boundaries within Dunaliella we followed the "distinguishing species" instructions [6] based on compensatory base changes (CBCs) in the ITS2 secondary structure, and we used the CBCAnalyzer option implemented in 4SALE.

Additional material
Additional file 1: Compensatory Base Changes (CBC) analysis. Compensatory Base Changes (CBCs) between different groups and species within the Dunaliella taxon (Excel file).