Research Article |
Corresponding author: Yuuki Kobayashi ( yk208115@nodai.ac.jp ) Corresponding author: Masako Takashima ( mt207623@nodai.ac.jp ) Academic editor: Andrey Yurkov
© 2025 Yuuki Kobayashi, Naoto Tanaka, Minenosuke Matsutani, Yuuna Kurokawa, Keita Aoki, Moriya Ohkuma, Ri-ichiroh Manabe, Masako Takashima.
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
Kobayashi Y, Tanaka N, Matsutani M, Kurokawa Y, Aoki K, Ohkuma M, Manabe R-ichiroh, Takashima M (2025) Whole-genome based phylogeny and comparative genomics of Sporidiobolales and related taxa of Basidiomycetes. IMA Fungus 16: e141626. https://doi.org/10.3897/imafungus.16.141626
|
Sporidiobolales is a fungal order of Basidiomycota within the subphylum Pucciniomycotina. This order encompasses significant yeasts, such as the oleaginous species Rhodotorula toruloides and the opportunistic pathogen R. mucilaginosa. We present the sequencing and comparative analysis of 35 Sporidiobolales strains from 27 species, alongside a Leucosporidium strain (Leucosporidiales), and incorporating publicly available genomic data for related fungi. Based on the phylogenomics data, we found that the topologies obtained were relatively similar and in line with previous reports. A comparison between genomic makeup and previously described phenotypes revealed that the ability to utilize nitrate, raffinose, rhamnose, or sucrose clearly correlated with the existence of key enzymes involved in the corresponding metabolic pathways. However, similar associations could not be established for other carbon sources, such as maltose, galactose, or xylose. We further identified orthologs that are specifically present or absent in each taxon. These results and the genomic sequencing data will help in gaining a better understanding of these non-model yeast species.
Basidiomycete yeast, comparative genomics, phylogenetic analysis, red yeasts
Yeasts are unicellular fungi widely distributed on earth. Since the first discovery of basidiospore formation in yeast (
Because yeasts have fewer morphological characteristics compared to multicellular organisms such as plants and animals, the current classification of yeasts is mainly constructed by molecular methods. The internal transcribed spacer (ITS) sequences of the ribosomal RNA (rRNA) precursor gene in combination with D1-D2 sequences of the 26S rRNA large subunit (LSU) have been primarily used for taxon identification and phylogenetic studies of yeasts (
In addition to phylogenetic assessment, whole-genome sequencing has revealed many biological characteristics of yeasts. In contrast to the model yeast Saccharomyces cerevisiae, whose genome information is widely used in basic biology (
Genomic research on Sporidiobolales initially centered on the oil productivity of R. toruloides. The NP11 strain was initially sequenced to unravel the lipid biosynthesis pathways (
In this study, we sequenced genomes of fungi belonging to Sporidiobolales species from the Japan Collection of Microorganisms (JCM), RIKEN BioResource Research Center (BRC), and the National Biological Resource Center, Japan (NBRC) with the objective of compiling their genome information. Together with publicly available genome sequences, we inferred phylogenetic relationships based on whole-genome information by means of different methods. We compared the repertoire of metabolic genes to described phenotypes such as carbohydrate assimilation and searched for orthologous genes that characterize each taxon to find markers for identification.
All strains sequenced in this study were obtained from the Japan Collection of Microorganisms (JCM), the RIKEN BioResource Center (BRC), Tsukuba, Japan, and the National Biological Resource Center (NBRC), National Institute of Technology and Evolution (NITE), Kisarazu, Japan (Table
Species | Strain | Estimated genome size (bp) | Total assembly size (bp) | No. of contigs | Average length (bp) | Max length (bp) | N50 | BUSCO completeness (fungi_odb10, n = 758) | ||||
---|---|---|---|---|---|---|---|---|---|---|---|---|
Complete | Single | Duplicated | Fragmented | Missing | ||||||||
R. glutinis | JCM 8208T | 21,383,036 | 21,031,453 | 544 | 38,660.8 | 412,935 | 124,170 | 79.9 | 79.6 | 0.3 | 5.4 | 14.7 |
R. graminis | JCM 3775T | 20,228,226 | 20,429,315 | 362 | 56,434.6 | 557,532 | 160,126 | 77.8 | 77.7 | 0.1 | 6.1 | 16.1 |
R. babjevae | JCM 9279T | 20,980,473 | 21,042,041 | 369 | 57,024.5 | 595,387 | 178,254 | 84.2 | 83.9 | 0.3 | 4.5 | 11.3 |
R. araucariae | JCM 3770T | 19,161,872 | 19,688,143 | 815 | 24,157.2 | 190,232 | 51,923 | 83.7 | 83.6 | 0.1 | 4.1 | 12.2 |
R. paludigena | JCM 10292T | 20,643,218 | 20,813,848 | 177 | 117,592.4 | 853,605 | 293,322 | 89.7 | 89.4 | 0.3 | 2.6 | 7.7 |
R. sphaerocarpa | JCM 8202T | 17,478,281 | 17,460,028 | 221 | 79,004.7 | 1,019,573 | 173,083 | 86.5 | 86.4 | 0.1 | 3.6 | 9.9 |
R. mucilaginosa | JCM 8115T | 20,611,328 | 20,115,778 | 343 | 58,646.6 | 606,815 | 125,031 | 87.6 | 87.6 | 0.0 | 2.8 | 9.6 |
R. dairenensis | JCM 3774T | 19,296,451 | 19,455,747 | 534 | 36,434.0 | 256,967 | 79,538 | 85.7 | 85.4 | 0.3 | 4.4 | 9.9 |
R. pacifica | JCM 10908T | 20,492,110 | 20,374,152 | 102 | 199,746.6 | 1,534,540 | 562,053 | 88.8 | 88.5 | 0.3 | 2.6 | 8.6 |
R. toruloides | JCM 10020 | 20,645,899 | 20,492,593 | 88 | 232,870.4 | 1,672,916 | 976,034 | 86.7 | 85.5 | 1.2 | 3.6 | 9.7 |
R. toruloides | JCM 10021 | 20,781,719 | 20,389,818 | 78 | 261,407.9 | 1,973,210 | 938,736 | 88.2 | 87.9 | 0.3 | 3.0 | 8.8 |
R. toruloides | JCM 10049 | 20,313,569 | 20,270,281 | 53 | 382,458.1 | 1,516,062 | 930,903 | 85.7 | 85.2 | 0.5 | 4.7 | 9.6 |
R. toruloides | JCM 24501 | 20,174,084 | 20,321,374 | 38 | 534,773.0 | 1,645,103 | 1,204,811 | 88.3 | 87.9 | 0.4 | 1.7 | 10.0 |
R. toruloides | NBRC 10512 | 20,153,584 | 20,381,470 | 324 | 62,905.8 | 464,688 | 168,846 | 86.8 | 86.4 | 0.4 | 3.3 | 9.9 |
R. toruloides | NBRC 10513 | 21,662,728 | 20,583,880 | 155 | 132,799.2 | 1,266,296 | 475,791 | 88.5 | 88.4 | 0.1 | 2.8 | 8.7 |
Rhodotorula azoricus | JCM 11251T | 22,839,766 | 22,682,430 | 300 | 75,608.1 | 603,599 | 223,031 | 89.1 | 89.1 | 0.0 | 2.1 | 8.8 |
Rhodotorula microsporus | JCM 6882T | 25,074,185 | 25,687,078 | 608 | 42,248.5 | 437,949 | 101,261 | 85.7 | 85.4 | 0.3 | 3.7 | 10.6 |
Rhodotorula lusitaniae | JCM 8547T | 24,604,526 | 24,563,961 | 510 | 48,164.6 | 546,650 | 135,555 | 87.7 | 87.7 | 0.0 | 2.8 | 9.5 |
Rhodotorula ruineniae | JCM 8097 | 25,069,789 | 25,142,625 | 409 | 61,473.4 | 765,765 | 188,449 | 87.8 | 87.5 | 0.3 | 2.8 | 9.4 |
Rhodotorula poonsookiae | JCM 10207T | 23,993,186 | 23,978,565 | 374 | 64,113.8 | 575,888 | 167,670 | 86.2 | 85.9 | 0.3 | 4.1 | 9.7 |
Rhodotorula odoratus | JCM 11641T | 26,022,727 | 25,070,242 | 911 | 27,519.5 | 431,847 | 98,890 | 88.8 | 88.4 | 0.4 | 1.7 | 9.5 |
Rhodotorula nylandii | JCM 10213T | 24,785,751 | 24,737,636 | 726 | 34,073.9 | 473,705 | 128,444 | 85.9 | 85.5 | 0.4 | 4.2 | 9.9 |
S. johnsonii | JCM 1840T | 22,015,142 | 21,811,286 | 422 | 51,685.5 | 698,303 | 136,638 | 79.9 | 79.9 | 0.0 | 5.5 | 14.6 |
S. johnsonii | JCM 5296 | 23,744,835 | 21,533,936 | 815 | 26,422.0 | 190,683 | 54,066 | 80.6 | 80.3 | 0.3 | 5.1 | 14.3 |
S. salmonicolor | JCM 1841T | 19,818,358 | 20,466,144 | 785 | 26,071.5 | 259,920 | 50,696 | 80.6 | 80.5 | 0.1 | 5.9 | 13.5 |
S. salmonicolor | JCM 21900 | 19,507,291 | 20,162,018 | 1,018 | 19,805.5 | 131,956 | 37,030 | 78.6 | 78.5 | 0.1 | 7.0 | 14.4 |
S. pararoseus | JCM 3765 | 20,731,455 | 21,161,000 | 463 | 45,704.1 | 658,648 | 124,825 | 91.9 | 91.6 | 0.3 | 1.5 | 6.6 |
S. pararoseus | JCM 5350T | 22,872,948 | 22,408,324 | 938 | 23,889.5 | 333,600 | 57,498 | 90.4 | 90.0 | 0.4 | 2.5 | 7.1 |
S. salmoneus | JCM 6883 | 20,539,372 | 20,345,533 | 264 | 77,066.4 | 962,155 | 285,613 | 90.8 | 90.8 | 0.0 | 1.5 | 7.7 |
S. carnicolor | JCM 3766T | 19,484,300 | 19,514,829 | 526 | 37,100.4 | 450,810 | 91,840 | 88.4 | 88.4 | 0.0 | 3.8 | 7.8 |
S. ruberrimus | JCM 16303 | 21,017,201 | 20,920,594 | 275 | 76,074.9 | 804,554 | 212,308 | 89.2 | 89.1 | 0.1 | 1.8 | 9.0 |
S. phaffii | JCM 11491T | 19,158,692 | 19,322,758 | 463 | 41,733.8 | 449,409 | 105,658 | 88.1 | 88.1 | 0.0 | 3.7 | 8.2 |
S. koalae | JCM 15063T | 18,341,066 | 18,348,248 | 157 | 116,867.8 | 1,700,008 | 371,635 | 90.5 | 90.5 | 0.0 | 1.6 | 7.9 |
S. roseus | JCM 5353T | 23,222,652 | 23,327,241 | 1,591 | 14,662.0 | 165,583 | 29,640 | 89.8 | 89.7 | 0.1 | 2.8 | 7.4 |
S. blumeae | JCM 10212T | 20,156,654 | 19,940,609 | 814 | 24,497.1 | 300,686 | 53,586 | 84.0 | 83.9 | 0.1 | 4.5 | 11.5 |
L. creatinivora | JCM 10699 | 27,665,643 | 27,969,159 | 122 | 229,255.4 | 1,866,479 | 836,129 | 89.0 | 87.9 | 1.1 | 2.6 | 8.4 |
From R. toruloides strains JCM 10020, JCM 10021, JCM 10049, JCM 24501 and L. creatinivora (JCM 10699), cells were freeze-dried and crushed with a mortar and pestle, genomic DNA was extracted according to
The genome sizes were estimated with GenomeScope1.0 (
Genes were predicted with MAKER 2.31.8 (
For the construction of a concatenated BUSCO single-copy tree, conserved single-copy orthologs were searched from protein sequences with BUSCO 5.4.2 (
The common orthogroup (OG) score was calculated following the description in
The amino acid sequences of predicted genes were assigned to KEGG orthologs (KOs) using the GhostKOALA web server (
Assembly genomes and gene catalogs used as reference, i.e. R. toruloides NP11 (
AAI: average amino acid identity
BRC: BioResource Research Center
ITS: internal transcribed spacer
JCM: Japan Collection of Microorganisms
KEGG: Kyoto Encyclopedia of Genes and Genomes
KO: KEGG ortholog
LSU: large subunit
NBRC: National Biological Resource Center
OG: orthogroup
rRNA: ribosomal RNA
We sequenced 35 yeast strains from 27 species of Sporidiobolales and a Leucosporidium strain, maintained in RIKEN BRC-JCM and NBRC, including type strains of nine Rhodotorula, six Rhodosporidiobolus, and eight Sporobolomyces species (Table
In line with the predicted genome sizes, the genome assemblies for Sporidiobolales were between 17 and 26 Mbp, and that of L. creatinivorum was approximately 28 Mbp (Table
Homology-based gene prediction estimated that Sporidiobolales strains have approximately 6,400 to 9,800 genes, while L. creatinivorum has 10,454 genes (Table
Species | Strain | No. of genes | BUSCO completeness (fungi_odb10, n = 758) | ||||
---|---|---|---|---|---|---|---|
Complete | Single | Duplicated | Fragmented | Missing | |||
R. glutinis | JCM 8208T | 7,882 | 95.3 | 94.9 | 0.4 | 1.7 | 3.0 |
R. graminis | JCM 3775T | 7,626 | 96.3 | 96.2 | 0.1 | 1.3 | 2.4 |
R. babjevae | JCM 9279T | 7,741 | 96.3 | 96.0 | 0.3 | 1.1 | 2.6 |
R. araucariae | JCM 3770T | 7,482 | 94.6 | 94.2 | 0.4 | 1.7 | 3.7 |
R. paludigena | JCM 10292T | 7,753 | 97.2 | 97.1 | 0.1 | 1.1 | 1.7 |
R. sphaerocarpa | JCM 8202T | 6,408 | 95.8 | 95.8 | 0.0 | 1.5 | 2.7 |
R. mucilaginosa | JCM 8115T | 7,155 | 96.3 | 96.3 | 0.0 | 1.2 | 2.5 |
R. dairenensis | JCM 3774T | 6,885 | 93.1 | 93.0 | 0.1 | 3.0 | 3.9 |
R. pacifica | JCM 10908T | 7,406 | 97.1 | 96.8 | 0.3 | 0.5 | 2.4 |
R. toruloides | JCM 10020 | 8,047 | 96.8 | 95.5 | 1.3 | 0.8 | 2.4 |
R. toruloides | JCM 10021 | 8,007 | 96.7 | 96.0 | 0.7 | 1.2 | 2.1 |
R. toruloides | JCM 10049 | 7,938 | 96.8 | 96.3 | 0.5 | 0.7 | 2.5 |
R. toruloides | JCM 24501 | 8,001 | 96.8 | 96.4 | 0.4 | 0.7 | 2.5 |
R. toruloides | NBRC 10512 | 8,254 | 97.5 | 97.0 | 0.5 | 0.7 | 1.8 |
R. toruloides | NBRC 10513 | 8,308 | 97.5 | 97.0 | 0.5 | 0.4 | 2.1 |
Rhodotorula azoricus | JCM 11251T | 8,028 | 96.8 | 96.4 | 0.4 | 1.1 | 2.1 |
Rhodotorula microsporus | JCM 6882T | 9,769 | 95.3 | 94.9 | 0.4 | 1.7 | 3.0 |
Rhodotorula lusitaniae | JCM 8547T | 9,421 | 97.4 | 97.0 | 0.4 | 0.7 | 1.9 |
Rhodotorula ruineniae | JCM 8097 | 9,586 | 96.2 | 95.9 | 0.3 | 1.5 | 2.3 |
Rhodotorula poonsookiae | JCM 10207T | 9,313 | 96.4 | 96.0 | 0.4 | 1.2 | 2.4 |
Rhodotorula odoratus | JCM 11641T | 8,476 | 94.8 | 94.3 | 0.5 | 1.6 | 3.6 |
Rhodotorula nylandii | JCM 10213T | 9,337 | 96.9 | 96.2 | 0.7 | 1.1 | 2.0 |
S. johnsonii | JCM 1840T | 7,701 | 96.8 | 96.7 | 0.1 | 1.3 | 1.9 |
S. johnsonii | JCM 5296 | 7,614 | 94.4 | 94.3 | 0.1 | 1.2 | 4.4 |
S. salmonicolor | JCM 1841T | 7,111 | 96.1 | 96.0 | 0.1 | 1.7 | 2.2 |
S. salmonicolor | JCM 21900 | 6,992 | 94.3 | 94.3 | 0.0 | 2.2 | 3.5 |
S. pararoseus | JCM 3765 | 7,917 | 96.3 | 96.0 | 0.3 | 1.2 | 2.5 |
S. pararoseus | JCM 5350T | 8,374 | 95.9 | 95.6 | 0.3 | 1.2 | 2.9 |
S. salmoneus | JCM 6883 | 7,662 | 97.1 | 97.0 | 0.1 | 0.7 | 2.2 |
S. carnicolor | JCM 3766T | 7,215 | 93.6 | 93.3 | 0.3 | 2.8 | 3.6 |
S. ruberrimus | JCM 16303 | 7,444 | 96.5 | 96.4 | 0.1 | 0.9 | 2.6 |
S. phaffii | JCM 11491T | 7,249 | 95.6 | 95.5 | 0.1 | 1.2 | 3.2 |
S. koalae | JCM 15063T | 6,593 | 96.6 | 96.2 | 0.4 | 0.7 | 2.7 |
S. roseus | JCM 5353T | 9,067 | 95.3 | 94.6 | 0.7 | 2.0 | 2.7 |
S. blumeae | JCM 10212T | 7,167 | 93.9 | 93.8 | 0.1 | 2.5 | 3.6 |
L. creatinivora | JCM 10699 | 10,454 | 97.2 | 96.0 | 1.2 | 0.4 | 2.4 |
We predicted the phylogenetic relationships between these strains from whole-genome data, including Sporidiobolales and other Microbotryomycetes (Leucosporidium and Microbotryum) the data for which were retrieved from the JGI Mycocosm (
A phylogenetic tree based on concatenated protein sequences of 331 BUSCO genes found as single copy in all strains. The numbers beside the nodes represent the reliability based on SH-aLRT, aBayes, ultrafast bootstrap values, gene concordance factors (gCF), and site concordance factors (sCF) from left to right.
The BUSCO assessment of the gene catalog for each species found that 331 BUSCO orthologs (fungi_odb10) were present as single copies in all strains (Suppl. material
Orthologs were retrieved with OrthoFinder, which assigned 361,309 genes to 13,277 orthogroups, of which 3,181 orthogroups were found in all strains. A phylogenetic species tree inferred by orthologous gene trees also supported the monophyly of each of the genera Rhodotorula, Rhodosporidiobolus, and Sporobolomyces, as well as the subclades mentioned above (Fig.
The alignment-free phylogenetic tree based on the distance of nucleotide similarity, although keeping the monophyly of Rhodotorula and Rhodosporidiobolus, resulted in a somewhat different topology (Fig.
Based on the predicted gene sets, we investigated the gene diversity of genera in Sporidiobolales. Initially, we utilized the presence-absence patterns of orthologs (PAPO) method (
A Ratio of conserved orthogroups (OGs) in each genus. The common OG score was calculated as the ratio of the number of OGs maintained in all members of a genus to the total number of OGs, as described by
Subsequently, we conducted average amino acid identity (AAI) analysis, following the methodology described by
Since yeasts have species-specific physiological traits (Kurzman et al. 2010), we examined metabolic genes that contribute to key steps of energy source utilization. We compared gene catalogs derived from sequenced and reference genomes to orthologs annotated in the Kyoto Encyclopedia of Genes and Genomes (KEGG) and counted the number of genes found for each KEGG ortholog (Suppl. material
Comparison between assimilation phenotypes and the numbers of metabolic enzymes in the genome for which a correlation is observed. The phenotypes are from Kurtzman et al. eds. (2011),
For several other carbon sources, however, no correlation was found between the number of genes linked to the associated metabolic pathway and the described phenotypes. This was the case for maltose, starch, galactose, sorbose, xylose, arabinose, ribose, mannitol, and ethanol (Fig.
Given the known ability of yeasts in the order Sporidiobolales to produce carotenoids such as gamma-carotene, beta-carotene, and torulene (
Several species, such as R. toruloides and R. glutinis, have been reported to contain high levels of lipids (
To investigate potential marker genes that characterize taxa, we searched for KEGG Orthologs (KOs) that are unique to specific groups. To assess the relationship between gene presence and taxa, we calculated the correlation coefficient between binarized gene presence and taxa. Only KOs identified with an absolute correlation coefficient of 0.7 or higher were considered as relevant. Initially, we analyzed genes that exhibited a varying presence across three orders of Microbotryomycetes: Sporidiobolales, Leucosporidiales, and Microbotryales (Suppl. material
Next, we searched for genes that are specifically present or absent in one of the three genera of Sporidiobolales. In the comparison between Rhodotorula and the other two genera, 14 KOs were identified (Suppl. material
In addition to comparisons among existing taxa, we checked KOs whose presence differs between the two clades of Sporobolomyces. We found 83 relevant KOs, including 4 KOs that are absent in S. salmonicolor and S. johnsonii but present in all other clades of Sporobolomyces, and 21 KOs that are present in S. salmonicolor and S. johnsonii but not in other Sporobolomyces clades (Suppl. material
We also conducted a search for genes that are indicative of the previous classification, which was based on the presence of ballistospores. Specifically, we examined the presence of certain KOs that exhibited significant differences between the ballistospore-forming group (previously classified as Sporobolomyces and Sporidiobolus) and the ballistospore non-forming group (previously classified as Rhodotorula and Rhodosporidium). Despite identifying multiple KOs with absolute correlation coefficients exceeding 0.7, we were unable to find any KO that distinctly distinguished between the two groups (Suppl. material
Lastly, we conducted a comparative analysis between taxa by focusing on the number of gene components present in specific metabolic pathways or modules. As a result, it was reconfirmed that Leucosporidium completely lacks the components of the carotenoid biosynthesis pathway, map00906 (Suppl. material
Sporidiobolales is a yeast order that consists of three genera, Sporobolomyces, Rhodotorula, and Rhodosporidiobolus (
Our phylogenetic trees suggest that certain clades of R. toruloides show genetic distances comparable to those observed in other interspecific relationships, such as those involving R. glutinis, R. graminis, and R. babjevae (Figs
Our phylogenetic trees were based on different methods and resulted in topologies that differed to some extent. In the field of phylogenetics, the concatenation of gene sequences has been widely used in multigene analyses (
Our comparison between phenotypes and enzymes involved in the corresponding metabolic pathways (Fig.
In investigating whether orthologs specific to a taxon could be identified, we found several genes whose presence strongly correlates with the taxonomic group they belong to (Suppl. material
In this study, we sequenced 35 genomes of Sporidiobolales and a Leucosporidium genome. Our phylogenetic study confirmed the monophyly of the genera Rhodotorula and Rhodosporidiobolus with all methods tested, while the monophyly of Sporobolomyces was supported by two of the three methods. Also, our OG and AAI analyses revealed that strains within each genus are closely related to each other at the orthologous gene level. Comparative studies showed that metabolic traits linked to the usage of several compounds, such as rhamnose and nitrate, can be predicted from gene content, while this is not possible in the case of many other compounds. We identified KEGG orthologs whose presence significantly correlates with each taxon, making them potential taxon markers. These results, along with genome information, may help facilitate a better understanding of Sporidiobolales yeasts.
We thank Dr. Rikiya Endoh at the Japan Collection of Microorganisms, RIKEN BioResource Research Center, for DNA preparation that enabled genome sequencing.
The authors have declared that no competing interests exist.
No ethical statement was reported.
All the fungal strains used in this study have been legally obtained, respecting the Convention on Biological Diversity (Rio Convention).
This work was supported by the foundation of the Institute for Fermentation, Osaka (IFO, no. K-2020-006 to M.T.) and a Grant-in-Aid for Scientific Research from the MEXT (no. 20K06801 to M.T.). R.M. was supported by a Research Grant from the Ministry of Education, Culture, Sports, Science and Technology of Japan to the RIKEN Center for Integrative Medical Sciences. Genome sequencing was partially funded by the Genome Information Upgrading Program of the National BioResource Project of MEXT, with financial support provided to M.O. and R.M.
Y. Ko., N. T., and M. T. designed and coordinated the study. M. O. and M. T. isolated and cultured fungal strains. Y. Ku. isolated and M. M. sequenced genomic DNA. Y. Ko. and N. T. assembled and analyzed genomic data. Y. Ko., N. T. and M. T. drafted the manuscript. All authors contributed to the assessment of results, manuscript revisions and final approval for publication.
Yuuki Kobayashi https://orcid.org/0000-0003-1061-3639
Minenosuke Matsutani https://orcid.org/0000-0002-6262-3188
Keita Aoki https://orcid.org/0000-0003-2079-4031
Masako Takashima https://orcid.org/0000-0002-7686-8661
All sequence reads and assemblies were submitted to DDBJ/NCBI under BioProject accession numbers PRJDB3686 (JCM 10020), PRJDB3687 (JCM 10021), PRJDB3688 (JCM 10049), PRJDB3691 (JCM 24501), PRJDB3719 (JCM 10699), and PRJDB17412 (all other strains). The detailed accession numbers for sequence reads and assemblies were described in Suppl. material
Images for supplemenatl explanation and tables of full BUSCO tables, KEGG-OGs, taxon-specific genes and accession numbers
Data type: zip