Research Article |
Corresponding author: Brenda D. Wingfield ( brenda.wingfield@up.ac.za ) Academic editor: Marc Stadler
© 2025 Davide D’Angelo, Roberto Sorrentino, Tiphany Nkomo, Xianzhi Zhou, Niloofar Vaghefi, Byron Sonnekus, Tanay Bose, Domenico Cerrato, Loredana Cozzolino, Nicky Creux, Nunzio D’Agostino, Gerda Fourie, Giovanna Fusco, Almuth Hammerbacher, Alexander Idnurm, Levente Kiss, Yanping Hu, Hongli Hu, Ernesto Lahoz, Jason Risteski, Emma T. Steenkamp, Maurizio Viscardi, Magriet A. van der Nest, Yuan Wu, Hao Yu, Jianjin Zhou, Chinthani S. Karandeni Dewage, Loly I. Kotta-Loizou, Henrik U. Stotz, Bruce D. L. Fitt, Yong-Ju Huang, Brenda D. Wingfield.
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
D’Angelo D, Sorrentino R, Nkomo T, Zhou X, Vaghefi N, Sonnekus B, Bose T, Cerrato D, Cozzolino L, Creux N, D’Agostino N, Fourie G, Fusco G, Hammerbacher A, Idnurm A, Kiss L, Hu Y, Hu H, Lahoz E, Risteski J, Steenkamp ET, Viscardi M, van der Nest MA, Wu Y, Yu H, Zhou J, Karandeni Dewage CS, Kotta-Loizou LI, Stotz HU, Fitt BDL, Huang YJ, Wingfield BD (2025) IMA GENOME - F20 A draft genome assembly of Agroathelia rolfsii, Ceratobasidium papillatum, Pyrenopeziza brassicae, Neopestalotiopsis macadamiae, Sphaerellopsis filum and genomic resources for Colletotrichum spaethianum and Colletotrichum fructicola. IMA Fungus 16: e141732. https://doi.org/10.3897/imafungus.16.141732
|
Hemp is an ancient plant that, over the centuries, has united generations and continents, shaping agricultural landscapes. Over time, the economies, uses and traditions of many people have been influenced by this species and its numerous applications (
Italy, a major producer and exporter of hemp, has progressively reduced its cultivated area to the point of almost eliminating it due to the laboriousness of cultivation and processing, the diffusion of synthetic fibres, the availability of alternative and cheaper vegetable fibres such as cotton and jute, and regulations on the use of drugs (
With the approval of Law N. 242 of 2 December 2016, entitled “Provisions for the promotion of hemp cultivation and the agro-industrial supply chain,” the interest of the agricultural, industrial, and entrepreneurial world towards hemp and its multifunctionality is renewed.
Hemp is susceptible to several diseases (
Fungi of the genus Sclerotium are characterized by the production of sclerotia and sterile mycelia, devoid of spores. The genus Sclerotium includes more than 40 phytopathogenic species (Farr 2008).
Many Sclerotium species reproduce asexually, with sexual reproduction rare, and are recognized exclusively in their anamorphic phase (Punja and Rahe 2001). Sclerotium rolfsii, the best-known species of the genus, presents the teleomorph phase Agroathelia rolfsii (Curzi), characterized by the development of resupinate basidiocarps and hyphal filaments emerging from the germinating sclerotia (
A total of three A. rolfsii isolates, which produce symptoms of stem rot disease on groundnut, were isolated. Their genomes were sequenced and assembled, providing insights into aspects of the pathogen’s pathogenicity (Iquebal et al. 2021;
Noteworthy, strains MR10 and ZY, which have genome sizes exceeding 70 Mb, are noted in the NCBI database as “atypical assemblies” due to their unusually large genome sizes. Assembling and annotating the genome of A. rolfsii isolated from hemp can help researchers to develop more effective control strategies to protect this crop. The identification of putative virulence candidate genes can be exploited to develop agrochemicals or targeted therapeutic approaches for disease control, mainly in the post-penetration phase. Here, we report the draft genome of A. rolfsii isolated from industrial hemp in Campania in June 2022 (Fig.
Italy: Campania: Caserta. Isolated from symptomatic plant collar of Cannabis sativa L. var. Codimodo, 07/2021.
The draft genome sequence has been deposited in NCBI GenBank under the accession number JAXRUS000000000. The version described in this paper is version JAXRUS010000000. Accession for SRA data: PRJNA1052013.
The collars of 10 symptomatic plants were washed under running water for 10 minutes, then cut into 3 mm sections. For each sample, five sections of the collar underwent surface sterilization by immersion in 5% sodium hypochlorite for 1 minute, followed by immersion in 70% ethanol for an additional minute. The excised small pieces of tissue were then rinsed five times in sterile distilled water in a 50 ml tube, dried by blotting on sterile 3 mm Whatman paper, subdivided into smaller portions of tissue and, finally plated on potato dextrose agar (PDA) amended with 100 mg liter−1 of streptomycin sulphate and stored at 24 °C ±2 °C in the dark. Plates were inspected daily, and growing colonies were sub-cultured on PDA for 10 days. In total, 50 monohyphal purified fungal isolates were obtained. Colonies of these isolates were white and produced sclerotia after 10 days. We identified the target fungal isolate from the pool by sequencing the Internal Transcribed Spacer (ITS), as amplified using PCR. DNA was extracted using the DNeasy Plant Mini Kit (Qiagen, USA) following the manufacturer’s instructions and PCR reactions were carried out using 10 ng of DNA with the ITS1/ITS4 primers (
DNA was purified from mycelia obtained by culturing A. rolfsii on PDA plates covered with cellophane films and kept in the dark at 24 °C for 7 days. Mycelia were collected, ground in liquid nitrogen and 100 mg were used to isolate the DNA using the QIAGEN® Genomic-tip kit following the manufacturer’s instructions. Genomic DNA was resuspended in EB buffer (Qiagen) to avoid EDTA in the final preparation. DNA quality (260/280 ratio) was assessed using a NanoDrop ND-1000 UV-Vis Spectrophotometer (NanoDrop Technologies Inc., Wilmington, DE). The amount of DNA was assessed using the Qubit platform (Invitrogen, Paisley, UK). Library preparation was performed using ILMN DNA LP (M) Tagmentation (96 Samples, IPB), and IDT® for Illumina® DNA/RNA UD Indexes Set A, Tagmentation (Illumina Italy S.r.l. Milano-Italy) according to the Illumina DNA preparation guide. The quality and amplicon size of the libraries were assessed using a Tapestation 4150 (Agilent) at multiple steps during the protocol, typically after size selection and PCR amplification. Three different libraries were normalized to a working concentration pool of 12 pM using the molarity calculated from Qubit measurements adjusted for fragment size. The sequencing run was performed on Illumina MiSeq (Illumina), equipped with MiSeq Reagent Kit v3 (600 cycles) which includes: Paired-End Reagent Plate, MiSeq Flow Cell.
Quality control on FASTQ files (paired-end 150 bp) was performed using FastQC v0.11.9 (
De novo genome assembly was performed using ABYSS v2.3.5 (
Unique alignments less than 500 bp and scaffolds less than 1,000 bp were filtered out. Gap closure was achieved using Sealer v2.3.5 (
Finally, the draft genome sequence was subjected to repeat region masking using RepeatMasker v4.1.2 (Tarailo-Graovac and Chen 2009) in slow search mode with rmblastn as the search engine and the Agaricomycetidae dataset from the Dfam repository (
Quality assessment of the assembly was performed with QUAST v5.2.0 (
The assembled genome was compared with the A. rolfsii reference genome downloaded from GenBank (accession number GCA_018343915.1) using MUMmer v3.23 (
The ITS sequence from an A. rolfsii strain isolated from C. sativa (GenBank accession: MZ242252.1) was utilized as a BLASTn query to identify the corresponding ITS sequence within the assembled genome. A phylogenetic analysis was then conducted using the identified ITS sequence from the draft genome, along with sequences from 16 A. rolfsii strains, 5 A. delphinii strains, 3 A. coffeicola strains, and a Rhizoctonia solani strain used as an outgroup. Multiple sequence alignment was performed using Mafft v.7.490 with the G-INS-i method and 1,000 iterations. Finally, a phylogenetic tree was generated using IQ-TREE v2.0.7 (
The protein complement of A. rolfsii was compared with the proteomes of four other fungal species belonging to the Agaricomycotina. To this end, the protein sequences of Rhizoctonia solani (GCA_016906535.1), Armillaria mellea (GCA_030407055.1), Agaricus bisporus (GCA_000300575.1), and Pleurotus ostreatus (GCA_014466165.1) were downloaded from GenBank.
OrthoFinder v2.5.5 (
BinGO v3.0.3 software (
The enrichment analysis involved a hypergeometric test and a Benjamini and Hochberg false discovery rate (FDR) correction of the p-value. Only GO terms in the biological process and molecular function domains with an FDR-corrected p-value > 0.05 were considered statistically significant.
Gene prediction was performed by running Helixer v0.3.2 (
Blast2GO v3.1.9 (
The sequencing run yielded 10.5 Gbp with a Q index >30. A total of 25,898,770 high-quality paired-end reads were used for genome assembly, resulting in 3,138 contigs, whose total sequence length is 44,845,534 bp with an N50 of 3,998,658 bp (Table
The size of the assembled genome were larger than that estimated based on k-mer distribution (~45 Mb vs ~37 Mb) assuming a genome coverage of 86.6 X. A total of 427,196 bp (0.95%) were masked, comprising 9,616 simple repeats and 1,663 low complexity regions. The reads used in the assembly process were mapped onto the assembled genome to assess assembly accuracy. Over 99.9% of the reads were successfully mapped.
A total of 14,804 genes was predicted, including 11,230 protein-coding genes and 3,574 non-coding RNAs. The BUSCO analysis indicated a completeness level of 92.5% for the Agaricomycetes, identifying 2,471 complete and single-copy BUSCOs, along with 208 complete but duplicated BUSCOs, and 72 fragmented BUSCOs. A total of 36,139 GO terms were assigned to 4,428 protein coding genes. A total of 4,086 proteins has been assigned to KEGG, covering 311 pathways (Suppl. material
The annotation of genes involved in the biosynthesis of secondary metabolites led to the identification of 137 genes grouped into 36 clusters (Suppl. material
Nine hundred sixty-two (N. = 962) proteins possibly secreted into the extracellular space were identified, as they include a signal peptide and lack transmembrane domains. Further filtering based on the number of amino acid residues and cysteine content reduced the dataset to 30 putative pathogenesis-related (PR) proteins (Suppl. material
Finally, exploration of plant-pathogen interactions led to the identification of 73 protein-coding genes involved in 126 interactions, of these, 37 were classified as “association”, 26 as “direct interaction”, 31 as “physical association”, 29 as “suppressive genetic interaction defined by inequality” and one as “colocalization” (Suppl. material
Alignment between the 3,138 contigs and the 8 scaffolds of the A. rolfsii reference genome (GCA_018343915.1) resulted in 76.34% aligned bases and a total of 584,388 single nucleotide polymorphisms, indicating a high degree of similarity. Furthermore, aligning only the five largest contigs still provided good coverage of the GCA_018343915.1 reference genome (Fig.
The substitution model “K3Pu+F” was identified as the most suitable model for tree inference using the maximum likelihood method. The ITS sequence-based tree positioned the A. rolfsii strain CCF1 within a major clade alongside other A. rolfsii strains, showing a distinct clustering pattern. In contrast, A. delphinii and A. coffeicola strains formed separate clusters (Fig.
Comparison between the proteomes of A. rolfsii, R. solani, A. mellea, A. bisporus and P. ostreatus led to the identification of 9,854 orthogroups, 3,967 (40.3%) of which were classified as core-orthogroups. A total of 10,441 (93%) A. rolfsii protein-coding genes were assigned to 6,176 orthogroups, of which 330 (3.35%) were species-specific (Fig.
Plant pathogenic fungi pose a major threat to agriculture, causing significant yield losses. The development of new organic or traditional agrochemicals and targeted therapeutic approaches for the control of plant pathogenic fungi requires a comprehensive knowledge of their gene repertoire.
In the present study, the genome of the pathogenic fungus A. rolfsii strain CCF1 isolated from hemp (Cannabis sativa L.) was assembled and annotated. The data produced will allow the scientific community to have a further resource to study genetic architecture and discover new effectors and their pathogenic mechanism. Indeed, few A. rolfsii genomes have been assembled so far. Although fragmented into over 3,000 contigs, the genome we released is quite complete. Indeed, the five largest contigs cover much of the genome of another A. rolfsii strain sequenced using third-generation sequencing technologies (
In this study, a combination of self-learning and deep learning algorithms, so far rarely applied in fungal genome annotation, allowed the prediction of a considerable number of genes: 11,230 protein-coding genes and 3,574 non-coding RNAs. The quality of the annotation was satisfactory and near-complete as 92.5% of the Agaricomycetes BUSCOs were identified.
The phylogenetic analysis revealed the evolutionary context of A. rolfsii CCF1, placing it within a major clade that exclusively includes A. rolfsii isolates. In contrast, A. coffeicola and A. delphinii isolates were positioned in separate clades. The search for orthologs between different fungal species led to the identification of core-orthogroups, thus highlighting common and species-specific protein-coding genes.
The identification of overrepresented GO terms associated with species-specific genes provides valuable insights into the molecular mechanisms that have shaped the biology of A. rolfsii CCF1. The presence of these genes suggests evolutionary innovations that have contributed to its divergence and adaptation from other fungal relatives within the Agaricomycotina. Studying these genetic features can enhance our understanding of the evolutionary processes driving diversification and specialization among fungi. From a biological perspective, analysing species-specific proteins in A. rolfsii through GO enrichment provides insights into unique biological processes and molecular functions critical for its survival and adaptation. The presence of various types of interactions suggests a complex relationship between the fungal pathogen and the host plant; these could be studied to gain a deeper understanding of the molecular mechanisms of host-pathogen coevolution and adaptation. In addition, the effector identification procedure narrows the field to a small number of target genes for further study. Further studies on these genes could deepen our knowledge of A. rolfsii’s biology, illuminate its distinct characteristics, and aid in developing precise strategies for managing hemp diseases.
Authors: Davide D’Angelo, Roberto Sorrentino, Ernesto Lahoz, Domenico Cerrato, Maurizio Viscardi, Loredana Cozzolino, Giovanna Fusco, Nunzio D’Agostino*
*Contact: nunzio.dagostino@unina.it
Assembly features | A. rolfsii scaffolds |
---|---|
Total sequence length (bp) | 44,845,534 |
Number of contigs | 3,138 |
Number of contigs (≥50,000 bp) | 16 |
Largest contig size (bp) | 5,959,244 |
GC content (%) | 46.49 |
N50 | 3,998,658 |
N90 | 4,804 |
L50 | 5 |
L90 | 1,500 |
Missingness (%) | 0.406 |
A Dot plot illustrating the alignment results between the five largest contigs assembled in this study and the 8 scaffolds from the A. rolfsii reference genome (GenBank accession: GCA_018343915.1) B Maximum-Likelihood tree constructed using Internal Transcribed Spacer (ITS) sequences, depicting the phylogenetic relationships among taxa within the Agroathelia genus. Rhizoctonia solani was used as the outgroup, with bootstrap values > 70 indicated on the branches C Venn diagram displaying orthogroups identified in the proteomes of A. rolfsii and four other fungal species within the Agaricomycotina.
Ceratobasidium
is a genus of Basidiomycetes fungus that plays diverse ecological roles (
Presently, there are 44 species within the Ceratobasidium genus (https://www.mycobank.org/). However, DNA sequences are unavailable for the majority of these species. Genomic data are only available for C. theobromae, the causative agent of vascular-streak dieback (VSD) in cacao (
Ceratobasidium papillatum
was described as a symbiont of orchids by
Australia: Toowoomba, Queensland: isolated from Sarcochilus dilatatus (CBS 570.83 ex type strain).
This Whole Genome Shotgun project for Ceratobasidium papillatum CBS 570.83 has been deposited at DDBJ/ENA/GenBank under the accession JAYRCO000000000.1; BioProject PRJNA1046290 and BioSample SAMN38480858. The version described in this paper is version JAYRCO010000000.
The isolate Ceratobasidium papillatum (CBS 570.83) was received from the Westerdijk Fungal Biodiversity Institute in the Netherlands. The isolate was revitalised on half-strength potato dextrose agar (PDA) medium (19 g PDA powder (Merck, South Africa); 7 g agar; distilled water 1 L) at 25 °C. Total genomic DNA was extracted from 14-day-old cultures following the protocol suggested by
The sequencing library was prepared using the MGIEasy Universal DNA library prep kit to generate 150 bp paired-end libraries. Whole genome sequencing was performed using the MGI DNBSeq G400 sequencer at the Agricultural Research Council Biotechnology Platform, South Africa (ARC-BTP) using the PE150 sequencing strategy.
The quality of the reads was assessed using FastQC v0.11.7 (https://github.com/s-andrews/FastQC) and was assembled using SPAdes v3.15.0 (
To confirm the identity of the isolate used for genome sequencing, phylogenetic analysis was conducted using maximum likelihood (ML) and Bayesian (BI) approaches. For this, a sequence dataset was prepared that included the complete internal transcribed spacer (ITS) gene region extracted from the assembled genome using CLC Genomics Workbench v23.0.5 (CLC bio, Aarhus, Denmark) and 16 Ceratobasidium species retrieved from NCBI GenBank, including the ITS sequence of the same C. papillatum isolate deposited by CBS (NR_154600). Cantharellus eucalyptorum (JN944001) was used as an outgroup. This dataset was aligned using MAFFT v7.490 (
The C. papillatum genome was 41.00 Mb in size and assembled into 3,091 contigs, of which 2,897 were longer than 1,000 bp. The N50 and N75 values were 23,435 bp and 12,862 bp, respectively, while the L50 and L75 values were 520 and 1,119, respectively. The GC content was 49.01%. Apart from C. papillatum, only one genome from a well-described Ceratobasidium species, C. theobromae, is available in GenBank and Mycocosm. However, genomes from several undescribed Ceratobasidium species are also present in these databases. The genome size of C. papillatum is 10 Mb larger than that of C. theobromae, yet 17–90 Mb smaller than those of the undescribed species. This shows a notable variation in the genome sizes of Ceratobasidium species. BUSCO analysis showed completeness of 93.5%, 92.5% and 79.3% using the Fungi, Basidiomycota and Agaricomycetes datasets, respectively. Notably, the number of predicted genes for C. papillatum (12,614) was found to be smaller than that of other Ceratobasidium species, indicative of the smaller genome size. The phylogenetic position of this isolate, using the genomic copy of the ITS region, confirms that the sequenced genome is C. papillatum (Fig.
Authors: Tiphany Nkomo, Almuth Hammerbacher*, Tanay Bose, Brenda D. Wingfield*
*Contact: Brenda.Wingfield@fabi.up.ac.za; Almuth.Hammerbacher@fabi.up.ac.za
The maximum likelihood tree constructed using ITS sequences of 16 Ceratobasidium species. Type isolates and the sequence extracted from the genome of Ceratobasidium papillatum (CBS 570.83) are suffixed with T and G, respectively. Cantharellus eucalyptorum served as an outgroup. Branch support values are indicated as bootstrap values/posterior probabilities. Support values below 70 and 0.80 were deemed unreliable and, hence, were excluded from the tree.
Polygonatum cyrtonema
Hua. (Duohua Huangjing in Chinese; family Asparagaceae), is an edible medicinal plant widely used in China to treat diabetes and asthma (Pharmacopeia Committee of P. R. China, 2020). The rhizomes are particularly valued as the primary components in various medicines and food. However, wild P. cyrtonema resources cannot meet the increasing demand for health products, such as Huangjing liquor, Huangjing tea, Huangjing noodles, etc., making artificial cultivation a necessity. Unfortunately, many diseases also appeared as amplified with the development of artificial cultivation of P. cyrtonema. Anthracnose is one of the common diseases found on P. cyrtonema and has become a serious threat to P. cyrtonema production. The leaf and stem symptoms include large elliptical, irregular or long strip spots, with pale to dark brown lesions, in some case intermediate white necrosis (
Colletotrichum
is the pathogen responsible for many plant anthracnose and among the most important plant pathogenic fungi worldwide (
Nanping is an important P. cyrtonema-producing region with a total planting area of approximately 1,000 ha. Thus, identifying the Colletotrichum species causing P. cyrtonema anthracnose in Nanping is a key issue. Anthracnose-diseased samples of P. cyrtonema were collected from four cultivation areas in Nanping City, Fujian Province: Chongren, Yushan, Heping, and Huaqiao. Previously, 76 representative Colletotrichum strains were isolated from diseased samples based on morphological characteristics and multilocus phylogenetic analysis. 13 species were identified, with two being novel, and then 12 were first reported in P. cyrtonema. The dominant causal agents of anthracnose in P. cyrtonema were C. spaethianum (31 strains, 40.79%) and C. fructicola (20 strains, 26.32%). C. spaethianum infects leaves, stems, and fruit stalks, while C. fructicola infects leaves, stems, and fruits. The former species is more serious leaves, while the latter is more virulent on stems. So far, the genetic differences behind these species are not clear as there are no available genome resource. Here, we report the genome of one isolate of C. spaethianum Y1_DY3_A (hereafter CsY1), and one isolate of C. fructicola C1_DY2_B (hereafter CfC1) from P. cyrtonema.
Colletotrichum spaethianum : China: Fujian: Huaqiao, isolated from infected leaves of P. cyrtonema, 2021, X. Zhou (Y1_DY3_A).
Colletotrichum fructicola : China: Fujian: Chongren, isolated from infected leaves of P. cyrtonema, 2021, X. Zhou (C1_DY2_B).
All reported genome sequences and genes have been deposited in the Genome Warehouse (GWH, https://ngdc.cncb.ac.cn/gwh, accession number GWHEQUI00000000 (CfC1) and GWHEQUR00000000 (CsY1) at the National Genomics Data Center, China National Center for Bioinformation (CNCB-NGDC Members and Partners 2023). Raw sequence reads are publicly accessible at the Genome Sequence Archive (GSA, https://ngdc.cncb.ac.cn/gsa/, accession number CRA013791) of CNCB-NGDC under the BioProject PRJCA021822.
The isolates of C. spaethianum Y1_DY3_A (CsY1) and C. fructicola C1_DY2_B (CfC1) were deposited in the College of Plant Protection, Fujian Agriculture and Forestry University. The two strains were subcultured on potato dextrose agar (PDA) with 12 h light / 12 h dark at 25 °C for 4 days. The mycelia were collected by vacuum filtration and ground in liquid nitrogen. Genomic DNA was extracted using a rapid fungal genomic DNA isolation kit (Sangon Biotech, Shanghai, China). The purified DNAs were sent to Beijing Novogene Bioinformatics Technology Co. Ltd for library preparation and genome sequencing.
Genome size was estimated in GenomeScope 2.0 (
Protein-coding genes were annotated with BRAKER version 2.1.6 (
We obtained 8.51 Gb (CsY1, approximately 152×, N50 = 12.30 kb) and 18.81 Gb (CfC1, approximately 336×, N50 = 17.73 kb) long reads using the PacBio RSII platform, as well as 4.52 Gb (CsY1, approximately 81×) and 8.90 Gb (CfC1, approximately 149×) 150 bp paired-end (PE150) short reads (fragment size ~350 bp) using the Illumina HiSeq 3000 platform. K-mer based genome size estimation yielded expected genome sizes of 55.82 Mb for CfC1 and 52.97 Mb for CsY1. Genome assembling using PacBio long reads followed by 2-round base error correction using both of long reads and Illumina short reads resulted in the final genome assemblies of 56,767,350 bp for strain CfC1 (GC: 53.16%) and 54,491,462 bp for strain CsY1 (GC: 52.42%), representing 99.88% and 97.92% of the estimated genome sizes, respectively (Table
The final CfC1 genome contains 13 contigs, with N50 and N90 contig lengths of 4,938,816 bp (L50:5) and 4,303,157 bp (L90:9), respectively. Its average contig length is 7,120,155 bp and its maximum contig length is 7,595,691 bp. The final CsY1 genome contains 107 contigs, with N50 and N90 contig lengths of 1,599,232 bp (L50:12) and 319,619 bp (L90:46), respectively. Its average and maximum contig lengths are 509,266 bp and 4,068,025 bp (Table
The assembled CfC1 genome contains 747 (98.55%) complete (744 single, three duplicated) and one fragmented orthologs when compared against the fungi_odb10 lineage (n = 758), as well as 1,666 (97.66%) complete (1,660 single, six duplicated) and two fragmented orthologs compared against the ascomycota_odb10 lineage (n = 1,706). The assembled CsY1 genome contains 740 (97.63%) complete (733 single, seven duplicated) and one fragmented orthologs compared against the fungi_odb10 lineage (n = 758), as well as 1,667 (97.71%) complete (1,653 single, 14 duplicated) and two fragmented orthologs compared against the ascomycota_odb10 lineage (n = 1,706) (Table
Mapping rates of long reads were 99.36% (1,276,006 / 1,284,226 reads) for CfC1 and 99.19% (1,012,391 / 1,020,624 reads) for CsY1); for short reads, they were 99.92% (59,315,969 / 59,362,650 reads, properly paired: 99.02%) for CfC1 and 99.36% (29,999,011 / 30,191,513 reads, properly paired: 97.96%) for CsY1. Assembly quality was further estimated in Merqury revealed average base error rates of 2.09 × 10-5 (QV = 46.81) and 2.31 × 10-4 (QV = 36.36) for CfC1 and CsY1, respectively, along with 99.88% and 97.92% genome completeness (Table
We then de novo annotated and masked 1,983,026 bp (3.49% of genome size) and 4,111,949 bp (7.55%) of repeat sequences for CfC1 and CsY1, respectively. In CfC1 repeats, 1,372,796 bp (69.23%) were interspersed repeats, including short interspersed nuclear elements (SINEs, 5,009 bp), long interspersed nuclear elements (LINEs, 242,286 bp), long terminal repeats (LTRs, 241,689 bp), DNA transposons (491,988 bp), and Unclassified (391,824 bp). In CsY1 repeats, 3,483,650 bp (84.72%) were interspersed, including SINEs (3,657 bp), LINEs (125,572 bp), LTRs (2,405,105 bp), DNA transposons (148,291 bp) and Unclassified (801,025 bp) (Table
We identified 16,928 and 15,753 protein-coding genes in CfC1 and CsY1, respectively, 11,547 (68.21%, CfC1) and 11,456 (72.72%, CsY1) genes in Pfam, 6,837 (40.39%, CfC1) and 6,778 (43.03%, CsY1) genes in GO, 5,121 (30.25%, CfC1) and 6,428 (40.80%, CsY1) in KEGG, as well as 12,378 (73.12%, CfC1) and 12,680 (80.49%, CsY1) in KOG. The Pfam annotations included a set of candidate fungal pathogenicity-related genes, with CfC1 possessing more than CsY1 (Table
Phylogenetic analysis confirmed the identity of the two sequenced isolates (Fig.
Authors: Xianzhi Zhou*, Yuan Wu, Yanping Hu, Hao Yu, Jianjin Zhou, Hongli Hu*
*Contact: xianzhizhou@126.com; huhongli7905@gmail.com
Genome assembly features of Colletotrichum spaethianum strain Y1_DY3_A (CsY1) and Colletotrichum fructicola strain C1_DY2_B (CfC1).
Features | CsY1 | CfC1 |
---|---|---|
PacBio long reads (Gb) | 8.51 | 18.81 |
Illumina short reads (Gb) | 4.52 | 8.90 |
Estimated genome size (bp) | 52,970,789 | 55,821,760 |
Assembly size (bp) | 54,491,462 | 56,767,350 |
Contig number | 107 | 13 |
Contig N50 (bp) | 1,599,232 | 4,938,816 |
Contig N90 (bp) | 319,619 | 4,303,157 |
Average contig length (bp) | 509,266 | 7,595,691 |
Maximum contig length (bp) | 4,068,025 | 7,120,155 |
GC content (%) | 52.42 | 53.16 |
Completeness and accuracy of the genome assemblies of the two Colletotrichum species isolated from Polygonatum cyrtonema.
Methods | CsY1 | CfC1 | |||
---|---|---|---|---|---|
BUSCO | Database | Fungi * | Ascomycota * | Fungi | Ascomycota |
Completeness | 98.55% | 97.66% | 97.63% | 97.71% | |
Single copy BUSCOs | 744 | 1,660 | 733 | 1,653 | |
Duplicated BUSCOs | 3 | 6 | 7 | 14 | |
Fragmented BUSCOs | 1 | 2 | 1 | 9 | |
Missing BUSCOs | 8 | 34 | 9 | 30 | |
Mapping rates | PacBio long reads | 99.19% | 99.36% | ||
Illumina short reads | 99.36% | 99.92% | |||
(97.96% Properly paired) | (99.02% Properly paired) | ||||
Merqury | Average base error | 2.31 × 10-4 | 2.09 × 10-5 | ||
(QV = 36.36) | (QV = 46.81) | ||||
Genome completeness | 97.92% | 99.88% |
Repetitive sequence analysis of the genome of the two Colletotrichum species isolated from Polygonatum cyrtonema.
Repetitive sequence | Features | C. spaethianum Y1_DY3_A | C. fructicola C1_DY2_B | ||
---|---|---|---|---|---|
Length (bp) | Percentage% | Length (bp) | Percentage% | ||
Interspersed repeats | SINEs | 3,657 | 0.01 | 5,009 | 0.01 |
LINEs | 125,572 | 0.23 | 242,286 | 0.43 | |
LTR elements | 2,405,105 | 4.41 | 241,689 | 0.43 | |
DNA transposons | 148,291 | 0.27 | 491,988 | 0.87 | |
Unclassified | 801,025 | 1.47 | 391,824 | 0.69 | |
Tandem repeats | Small RNA | 14,070 | 0.03 | 8,487 | 0.01 |
Satellites | 0 | 0.00 | 14,968 | 0.03 | |
Simple repeats | 556,157 | 1.02 | 531,614 | 0.94 | |
Low complexity | 59,823 | 0.11 | 58,036 | 0.10 | |
Total repeats | 4,111,949 | 7.55 | 1,983,026 | 3.49 |
Gene functional annotation of the genome of the two Colletotrichum species isolated from Polygonatum cyrtonema.
Annotation | C. spaethianum Y1_DY3_A | C. fructicola C1_DY2_B | ||
---|---|---|---|---|
Gene numbers | Percentage% | Gene numbers | Percentage% | |
Protein-coding genes | 15,753 | 100.00 | 16,928 | 100.00 |
Pfam | 11,456 | 72.72 | 11,547 | 68.21 |
GO | 6,778 | 43.03 | 6,837 | 40.39 |
KEGG | 6,428 | 40.80 | 5,121 | 30.25 |
KOG | 12,680 | 80.49 | 12,378 | 73.12 |
CAZys | 279 | 1.77 | 253 | 1.49 |
PHIs | 3,616 | 22.95 | 3,531 | 20.86 |
Cytochrome P450 enzymes | 158 | 1.00 | 279 | 1.65 |
Membrane transport proteins | 481 | 3.05 | 337 | 1.99 |
Putative secreted proteins | 885 | 5.62 | 1,299 | 7.67 |
Effectors | 395 | 2.51 | 677 | 4.00 |
SMBGCs# | 59 | / | 90 | / |
Bayesian inference phylogenetic tree of Colletotrichum sections gloeosporioides and spaethianum indicating the two sequenced isolates (in blue). Monilochaetes infuscans (CBS 869.96) is used as outgroup. The tree was built using concatenated sequences of the ACT, CHS-1, GAPDH, HIS3, ITS and TUB2. Maximum likelihood bootstrap values ≥ 50%. Bayesian posterior probabilities ≥ 0.90 (PP /MLBS) are displayed on the phylogenetic tree.
Pestalotiopsis
, a genus in the Sordariomycetes was reclassified in 2014 into 3 genera (Pestalotiopsis, Neopestalotiopsis and Pseudopestalotiopsis) based on morphology and phylogenetic analysis using the β-tubulin (β-tub), Internal Transcribed Spacer (ITS), and Translation Elongation Factor 1-α (tef1) gene regions (
Neopestalotiopsis macadamiae
represents an important pathogen of macadamia (Macadamia integrifolia and M. tetraphylla) grown in commercial plantations in tropical and subtropical frost-free regions in various countries around the world, including South Africa (
Genome data are a valuable tool to understand the different aspects that can contribute towards developing control strategies for this pathogen. Currently, the pathogen is poorly understood and further research to understand various characteristics such as its plant host interactions, pathogenesis-related gene repertoire, and reproduction is required. Neopestalotiopsis rosae causes leaf blight and crown rot on strawberry and is the only publicly available genome for this important genus (
South Africa: Barberton: Mpumalanga, isolated from Macadamia integrifolia × M. tetraphylla inflorescences, 2022, B. Sonnekus (CMW 64591)
The Whole Genome Shotgun project of Neopestalotiopsis macadamiae CMW 64591 has been deposited at DBJ/EMBL/GenBank under the accession JBEXAB000000000. The version described in this paper is version JBEXAB000000000.
The culture of isolate CMW 64591 was grown on MEA (malt extract agar; Biolab, South Africa) at 25 °C and 90% relative humidity with a 12-h photoperiod for 10 days. Genomic DNA was extracted from mycelia using the Quick-DNA Fungal/Bacterial Miniprep Kit (Zymo Research, California, USA) and submitted to Inqaba Biotechnical Industries (Pty) Ltd (Pretoria, South Africa) for whole genome sequencing using PacBio® Hifi Sequel IIe Sequencer. The quality of the sequenced long reads was assessed using FastQC v0.11.7 (Babraham Bioinformatics, Babraham Institute, Cambridge, UK). Canu v2.0 (
Genes were predicted in the masked assembly with Funannotate predict, which uses Augustus v3.5.0 (
The assembled genome size of N. macadamiae CMW 64591 was 50.05 Mb (for 0.5 kb+ scaffolds), consisting of 10 contigs and an approximated genome coverage of 22×. At 50.05 Mb, the N. macadamiae CMW 64591 genome is slightly smaller than the N. rosae ML1664 genome (53.78 Mb) (
The BUSCO analysis showed a completeness level of 99.4% with respect to the fungal dataset. Slightly more than the 98.4% BUSCO score observed for the N. rosae ML1664 genome (
A total of 14,221 genes was predicted of which 14,012 were functionally annotated. This included 829 CAZymes of which the AA7 and AA3 CAZyme families, which are involved in lignin degradation in plants, were most abundant. ML analysis of three barcoding gene regions grouped the genome isolate close to N. macadamiae isolates obtained from Australia (Fig.
Authors: Byron Sonnekus*, Magriet A. van der Nest, Brenda D. Wingfield, Emma T. Steenkamp, Nicky Creux, Gerda Fourie
*Contact: Byron.Sonnekus@fabi.up.ac.za
Maximum-likelihood (ML) tree topology of Neopestalotiopsis inferred on an alignment of the concatenated sequences of the β-tub, ITS and tef1 gene regions. Pestalotiopsis diversiseta (MFLUCC 12-0287) was used as an outgroup taxon. The ML tree was based on the GTR substitution model with gamma-distribution rate variation. ML bootstrap support values (>75%) and Bayesian inference posterior probabilities (>90%) are displayed at the nodes.
The genus Sphaerellopsis belongs to the family Leptosphaeriaceae in the order Pleosporales. Eudarluca, the typified name for the sexual morph of this genus is treated as a synonym (
Sphaerellopsis filum
is commonly found in uredinia and telia of diverse rust species in the field (
Strikingly, S. filum is closely related to Ampelomyces spp. (
Sphaerellopsis filum : Portugal: isolated from Puccinia hordei on Ornithogalum divergens, 1951, B. d’Oliveira (CBS 235.51 = ATCC 22604).
The annotated genome sequence of Sphaerellopsis filum CBS 235.51 has been deposited at GenBank under the accession JBFTXA000000000.
Sphaerellopsis filum
strain CBS 235.51 was obtained from the Westerdijk Fungal Biodiversity Institute (Utrecht, The Netherlands) and was maintained on potato dextrose agar (Amyl Media, Australia) at 18 °C in darkness. For DNA extraction, the strain was cultured in 20 ml Czapek-Dox broth (BD, Bacto Laboratories, Australia) in a Falcon tube and incubated on a shaker-incubator at 150 rpm for 2 weeks at 18 °C. Harvested mycelia were flash-frozen in liquid nitrogen and lyophilized for 48 hours. Eighty milligrams of lyophilized mycelia were ground with stainless steel beads (2.8 mm diameter, Sigma–Aldrich, Australia) in a Qiagen TissueLyser II (Qiagen, Australia) at 30 Hz/s for 30 s, flash-frozen in liquid nitrogen again and stored at -80 °C until DNA extraction. DNA was extracted using a Qiagen DNeasy Plant Mini Kit (Qiagen, Australia) following a modified protocol as described by
The same DNA sample used for the ITS amplification was used for Illumina library construction and Illumina NovaSeq sequencing (150 bp paired-end [PE]) at the Australian Genome Research Facility (AGRF, Melbourne, Australia).
Total RNA was extracted from two-week-old cultures grown in Czapek-Dox broth and incubated at room temperature on a shaker at 150 rpm. Fresh fungal mycelia were flash-frozen and ground in liquid nitrogen, then used for RNA extraction with an RNeasy Plant Mini Kit (Qiagen, Australia) following manufacturer’s instructions and including column-based DNase treatment. The extracted RNA was assessed via agarose gel electrophoresis and quantified using a Qubit v.4.0 fluorometer (ThermoFisher Scientific, Australia). The RNA transcriptome was sequenced by the AGRF (150 bp PE).
Raw sequence data were screened and filtered for contaminating sequences using Kraken v.2.1.2 (
Using Illumina NovaSeq technology, 34,834,531 paired-end reads (10.5 Gb) were generated, 33,974,265 of which passed contamination and quality filtering (10 Gb). GenomeScope estimated a genome size of 28.9 Mb based on these DNA sequence data. De novo genome assembly of the S. filum strain included a total of 850 scaffolds larger than 250 bp and a draft genome size of 28,451,751 with 300× coverage (Table
The ITS sequence of CBS 235.51 produced using Sanger sequencing (GenBank accession PP972725) was identical to the ITS sequence of CBS 317.68 (GenBank KP170657), the ex-neotype culture of S. filum. The ITS sequence extracted from the Illumina genome sequencing of CBS 235.51 was identical to this sequence. The phylogenetic placement of strain CBS 235.51 based on the ITS sequence is provided in Fig.
This is the first genome assembly of a Sphaerellopsis strain, which revealed a smaller genome size compared to Ampelomyces spp., another group of mycoparasitic fungi within Pleosporales (Table
Authors: Niloofar Vaghefi*, Jason Risteski, Alexander Idnurm, Levente Kiss
*Contact: vaghefin@unimelb.edu.au
Strain | Host species | Sequencing technology | Assembly size (Mb) | Cova | No. of scaffolds >500 bp | N50 (Mb) | GC content (%) | Genome Completeness (%)b | Percentage of AT-rich regionsc | NCBI Accession Number | Reference |
---|---|---|---|---|---|---|---|---|---|---|---|
CBS 235.51 | Puccinia hordei | Illumina paired-end NovaSeq | 28.5 | 300× | 330 | 375,353 | 49.7 | 95.5 | 10.1 | JBFTXA000000000 | This study |
BRIP 72107 | Golovinomyces bolayi | Illumina paired-end MiSeq and Oxford Nanopore MinION | 40.4 | 400× | 24 | 2,994,887 | 45.5 | 96.6 | 33.9 | JAGTXZ000000000 |
|
HMLAC 05119 | Undetermined powdery mildew | Illumina paired-end and mate-pair | 36.8 | 103× | 464 | 258,565 | 46.5 | 96.3 | 26.6 | VOSX00000000.1 |
|
Phylogenetic tree of Sphaerellopsis species with available sequence data, based on maximum likelihood analysis of the ITS region. Analyses were performed in Geneious Prime v.2024.0.5 using RAxML v. 8.2.11 (
Pyrenopeziza brassicae
Sutton and Rawlinson is an air-borne, extracellular fungal pathogen of Brassica species. It has been taxonomically accommodated within the family Ploettnerulaceae of the order Helotiales within the phylum Ascomycota. This pathogen was also known by its anamorph, Cylindrosporium concentricum, for many years; this was initially described by
The pathogen thrives in areas with cool, wet climates and shows a wide geographic distribution, occurring in many regions of the world including the UK, continental Europe, Japan, New Zealand and North America (
Thorough understanding of the underpinning molecular mechanisms and the genetic basis of host-pathogen interactions is a key to successful management of light leaf spot. However, there is little information available on the genes involved in P. brassicae pathogenicity, and no published whole genome sequence of P. brassicae is available at present. Also, a genome sequence can provide new tools to study the population structure of this pathogen, since there may be variations in pathogenicity between isolates from different geographic regions (Karandeni Dewage et al. 2018). According to
Considering its economic and evolutionary importance, we present the de novo whole genome sequence of P. brassicae. This will enable new studies of the biology and genetics of this pathogen, providing knowledge to improve current light leaf spot management strategies and address evolutionary questions.
P. brassicae isolate 15WOSR64-SS1 was obtained from diseased leaves of oilseed rape (Brassica napus) cultivar Bristol from Hereford, UK in 2015 (Karandeni Dewage et al. 2021). The mating type of this isolate was confirmed to be MAT1-1 by mating-type PCR (Foster et al. 2002).
The whole genome sequence of P. brassicae (isolate 15WOSR64-SS1) has been deposited at the European Nucleotide Archive (ENA) under the accession number GCA_958299125.
A single conidial isolate (15WOSR64-SS1) of P. brassicae (Karandeni Dewage et al. 2021) was grown in potato dextrose broth (Oxoid Ltd., England) at 18 °C in a shaking incubator at 120 rpm. Mycelia were harvested from three-week-old cultures and freeze-dried. Genomic DNA was extracted from freeze-dried mycelia using the CTAB DNA extraction method for high molecular weight genomic DNA (
The P. brassicae genome was sequenced using both the Illumina NovaSeq sequencing platform and a single PacBio Sequel SMRT cell at GENEWIZ Europe (Leipzig, Germany). The single paired-end Illumina NovaSeq library was prepared with an insert size of c. 350–450 bp and 150 bp read length. The PacBio DNA library was prepared with an insert size of 20 kbp with Blue Pippin size selection as per the manufacturer’s protocol. The prepared PacBio library was sequenced on the PacBio Sequel platform with v2.0 chemistry. The hybrid de novo assembly was done using the MaSuRCA 3.4.0 genome assembly and analysis toolkit (
To obtain P. brassicae RNA sequencing (RNA-seq) data, fungal mycelia harvested from two liquid cultures of the isolate 15WOSR64-SS1 (grown as described above) were snap frozen with liquid nitrogen and sent to GENEWIZ Europe (Leipzig, Germany), where total RNA was extracted using the RNeasy Plant Mini Kit according to the manufacturer’s protocol. Purity of the extracted RNA was tested with a Nanodrop 2000 spectrophotometer and the concentration was measured using a Qubit™ fluorometer. Further quality assessment was done using an RNA ScreenTape assay, followed by cDNA synthesis, library preparation and sequencing using the Illumina HiSeq sequencing platform at GENEWIZ Europe. Raw Illumina reads were trimmed using skewer v0.2.2 (
Structural annotation of the genome was done at DEEP Seq (Queen’s Medical Centre, University of Nottingham, Nottingham) using the MAKER annotation pipeline v2.31.11 (
To confirm the identity of the newly sequenced genome, phylogenetic analysis was done using nucleotide sequences of the internal transcribed spacers (ITS), translation elongation factor 1 (TEF1) and the beta tubulin genes. Publicly available nucleotide sequences of related fungal species and other P. brassicae isolates were obtained from the National Center for Biotechnology Information (NCBI) database and compared with those extracted from P. brassicae isolate 15WOSR-64SS1. Multiple alignments of ITS, TEF1 and beta tubulin nucleotide sequences were produced using MUSCLE (
The genome sequence of P. brassicae was assembled into 45 scaffolds larger than 1,000 bp with a total genome size estimated to be 73.24 Mb. Of the 45 scaffolds, 38 were larger than 50,000 bp and the PacBio reads provided an estimated genome coverage of 95×. The genome had an average GC content of 42.85%, N50 value of 3,083,431 bp and L50 value of 11. BUSCO analysis against the ascomycota_odb10 dataset indicated that the assembled genome was 98.4% complete, identifying 1,684 out of 1,706 BUSCOs as present in the genome assembly (1,679 complete and single-copy, 2 complete and duplicated, 3 fragmented and 22 missing BUSCOs). BUSCO analysis against the leotiomycetes_odb10 dataset identified 3,181 out of 3,234 BUSCOs as present in the genome assembly (3,168 complete and single-copy, 1 complete and duplicated, 13 fragmented and 53 missing BUSCOs) with the genome completeness score of 97.9%. Table
This genome assembly is the first genome sequence of P. brassicae available. Considering the increased importance of this pathogen in recent years in the UK and North America, this genome resource will help to gain better understanding of P. brassicae and guide the development of more effective and efficient control strategies for light leaf spot. Control of crop diseases plays a key role in maintaining agricultural production for food security. Recently, genomic resources have accelerated the research on host-pathogen interactions, making considerable improvements to crop disease control. Regarding the hosts, the genome sequences of five Brassica species have been made available, including B. rapa (
Authors: Chinthani S. Karandeni Dewage*, Loly I. Kotta-Loizou, Henrik U. Stotz, Bruce D. L. Fitt, Yong-Ju Huang
*Contact: c.s.karandeni-dewage@herts.ac.uk
Metric | Value |
---|---|
Number of contigs | 45 |
Total length (bp) | 73,238,495 |
Largest contig (bp) | 4,725,742 |
GC content (%) | 42.81 |
Number of contigs (>= 1,000 bp) | 45 |
Number of contigs (>= 5,000 bp) | 43 |
Number of contigs (>= 10,000 bp) | 42 |
Number of contigs (>= 25,000 bp) | 39 |
Number of contigs (>= 50,000 bp) | 38 |
N50 (bp) | 3,083,431 |
N75 (bp) | 1,656,915 |
L50 | 11 |
L75 | 18 |
Number of N’s per 100 kbp | 0.14 |
Predicted gene models | 25,717 |
BUSCO completeness (%) | |
ascomycota_odb10 | 98.4 |
leotiomycetes_odb10 | 97.8 |
Phylogenetic analysis of the newly sequenced Pyrenopeziza brassicae and related fungi based on the nucleotide sequences of their (A) internal transcribed spacers (ITS), (B) beta-tubulin and (C) translation elongation factor 1 (TEF1). Phylogenetic analysis was done using nucleotide sequences of five related fungal species: Rhynchosporium commune (isolate IDs - 765.03.01, R157, UK7), Oculimacula acuformis (isolate IDs - 22-418, 22-443, 22-498, 22-499, CBS 495.80), Oculimacula yallundae (isolate IDs - 22-495, CBS 128.31, CBS 110665), Leptosphaeria maculans (isolate IDs - AFTOL-ID 277, Pk4), L. biglobosa (15PL-42, Azad4, Khal12) and of P. brassicae (isolate IDs – 233716, Cyc001, Cyc007, PC18, PC38). A multiple alignment of nucleotide sequences was produced using MUSCLE (
The authors of the section on Sphaerellopsis would like to thank Dr Alexandros Georgios Sotiropoulos (University of Southern Queensland, Australia) for constructive discussions.
The authors of the section on Colletotrichum thank the College of Plant Protection, Fujian Agriculture and Forestry University for providing the laboratory and computational facility used in our research.
Brenda D. Wingfield is a Senior Editor for IMA Fungus. The authors have declared that no competing interests exist.
No ethical statement was reported.
Funding was received from the European Union’s Horizon 2020 research and innovation program (RISE) under the Marie Skłodowska-Curie grant agreement No. 101008129, project acronym “Mycobiomics”. The present work was supported by either Regione Campania PSR Campania 2014–2020, sottomisura 16.1 Azione 2 and Test Center authorized by Ministerial Decree 36000, 4th February 2004. This project was supported by the University of Melbourne and the Centre for Crop Health of the University of Southern Queensland, partly through Discovery Project DP210103869 funded by the Australian Research Council.
The authors of the section on Pyrenopeziza brassicae acknowledge funding support from UK Biotechnology and Biological Sciences Research Council (grant/award number: BBSRC/BB/P00489X/1) and Innovate UK (grant/award number: 102641).
This work was supported by grants from the Fujian Provincial Public Welfare Research Project (2021R1034002), Science and Technology Innovation Team Construction Project of FAAS (CXTD2021014-2), and the “5511” Collaborative Innovation Project of High-quality Agricultural Development and Surpassment in Fujian Province (XTCXGC2021003).
D.D. and R.S. conceptualization; D.D. and R.S. wrote the original draft; R.S., M.V., L.C. and G.F. carried out laboratory work; D.D. performed bioinformatic analysis; E.L. and N.D revised and edited the manuscript; E.L. and N.D supervised the work; D.C. and E.L. funding acquisition.
XZ developed the project and obtained the funding, designed, and performed bioinformatic analysis, interpretation of data, wrote and reviewed the manuscript. YW, YH, HY and JZ performed the fungal subculture, genomic DNA extraction and purification. HH conceived and designed experiments, and performed bioinformatic analysis, interpretation of data, wrote and reviewed the manuscript. All authors read and approved the final manuscript.
Davide D’Angelo https://orcid.org/0009-0007-6542-6298
Tiphany Nkomo https://orcid.org/0000-0003-3355-5080
Xianzhi Zhou https://orcid.org/0000-0002-3734-075X
Niloofar Vaghefi https://orcid.org/0000-0003-0430-4856
Byron Sonnekus https://orcid.org/0000-0002-7228-4810
Tanay Bose https://orcid.org/0000-0002-2069-042X
Nicky Creux https://orcid.org/0000-0002-4179-6995
Nunzio D’Agostino https://orcid.org/0000-0001-9840-3817
Gerda Fourie https://orcid.org/0000-0003-2650-5448
Almuth Hammerbacher https://orcid.org/0000-0002-0262-2634
Alexander Idnurm https://orcid.org/0000-0001-5995-7040
Levente Kiss https://orcid.org/0000-0002-4785-4308
Emma T. Steenkamp https://orcid.org/0000-0003-0217-8219
Magriet A. van der Nest https://orcid.org/0000-0001-6914-8343
Chanthani S. Karandeni Dewage https://orcid.org/0000-0001-6621-8203
Brenda D. Wingfield https://orcid.org/0000-0002-6189-1519
All of the data that support the findings of this study are available in the main text or Supplementary Information. This includes cited data base accessions.
List of protein-coding genes identified in A. rolfsii strain CCF1 and their annotations
Data type: xlsx