Research Article |
Corresponding author: Shu Zhang ( zhangshu@sxu.edu.cn ) Corresponding author: Yong-Jie Zhang ( zhangyj2008@sxu.edu.cn ) Academic editor: Marc Stadler
© 2025 Jia-Ni Li, Shu Zhang, Yong-Jie Zhang.
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
Li J-N, Zhang S, Zhang Y-J (2025) Multi-omics insights into growth and fruiting body development in the entomopathogenic fungus Cordyceps blackwelliae. IMA Fungus 16: e147558. https://doi.org/10.3897/imafungus.16.147558
|
Cordyceps blackwelliae is an entomopathogenic fungus with significant potential for research and development due to its ease of cultivation. However, the lack of omics-based studies has limited our understanding of the molecular mechanisms governing its growth and fruiting body development. This study employed a multi-omics approach, integrating genomic, transcriptomic and metabolomic analyses. Utilising both Illumina and Nanopore sequencing technologies, we assembled a 31.06 Mb nuclear genome comprising 11 scaffolds, with telomere presence at one or both ends in eight scaffolds and annotated 8,138 identified genes (8,136 from genome prediction and two from local BLAST searches). Transcriptomic analysis identified 2,078 differentially expressed genes across three developmental stages: liquid culture mycelia, wheat culture mycelia and fruiting bodies. Amongst these, 745 genes were up-regulated in fruiting bodies, primarily associated with biosynthetic and catabolic pathways. Metabolomic analysis identified 1,161 metabolites, with 1,014 showing significant variations across developmental stages. Integrated transcriptomic and metabolomic analyses uncovered 17 genes positively correlated with 34 metabolites, which are likely crucial regulators of fruiting body development. These findings provide new insights into the molecular networks underlying C. blackwelliae growth and fruiting body formation.
Cordyceps blackwelliae, fruiting body development, genome, transcriptome, metabolome
The genus Cordyceps comprises approximately 180 recognised species of entomopathogenic fungi that infect insects (
Cordyceps blackwelliae is an entomopathogenic fungus first described in 2018 (
Throughout its cultivation cycle, C. blackwelliae undergoes distinct morphological and physiological adaptations across three developmental stages. During the liquid culture mycelium (LM) phase, the fungus forms dense, filamentous biomass characterised by rapid growth and high antioxidant capacity, making it ideal for industrial fermentation and metabolic investigations (
A multi-omics strategy integrating genomic, transcriptomic and metabolomic analyses provides a comprehensive framework for systematically investigating the molecular mechanisms underlying fungal growth and fruiting body development. Genomics enables the construction of high-quality reference genomes, facilitating the identification of key genes and regulatory elements involved in fungal growth and development (
Fungal growth and development are governed by complex, multi-level regulatory systems. We hypothesise that the growth and fruiting body development of C. blackwelliae are modulated at both the transcriptional and metabolic levels. To test this hypothesis, we performed whole genome sequencing of C. blackwelliae and conducted multi-omics analyses (transcriptomics and metabolomics) across three key developmental stages: LM, WM and FB. Specifically, this study aims to: 1) assemble and annotate the first nuclear genome of C. blackwelliae; 2) investigate transcriptional patterns across different developmental stages; 3) identify key metabolites associated with fungal growth and fruiting body formation; and 4) explore the relationship between differentially expressed genes (DEGs) and differential metabolites (DMs). This research provides a foundational understanding of the intrinsic molecular mechanisms regulating C. blackwelliae growth and fruiting body development, paving the way for future studies on fungal morphogenesis and bioactive compound production.
The Cordyceps blackwelliae strain ZYJ0835 used in this study was isolated from a fresh specimen collected from Guangdong, China in 2019 and deposited at the China Center for Type Culture Collection (CCTCC) under no. M2022663 (
Genome sequencing was performed by Majorbio Bio-Pharm Technology Co., Ltd. (Shanghai, China) using both Illumina NovaSeq 6000 (PE150) and Nanopore PromethION R9.4 platforms. Illumina data underwent quality control filtering by fastp v.0.23.0 (
The aforementioned annotation methodology effectively identified APN2 (encoding a DNA lyase), one of the genes commonly located adjacent to mating-type (MAT) genes. However, the MAT genes (MAT1-1-1 and MAT1-2-1) and SLA2 (encoding a cytoskeleton assembly control protein) were not annotated. To locate these genes within the genome of C. blackwelliae, local BLAST searches were conducted with corresponding genes of C. militaris (GenBank accession nos. KP721256 for MAT1-1-1, KP721273 for MAT1-2-1 and XM_006671662 for SLA2) as queries. To assess the expression of the identified MAT genes in C. blackwelliae, RNA sequencing (RNA-seq) and real-time quantitative reverse transcription PCR (RT-qPCR) analyses were conducted, following the methodologies outlined below.
Structural comparisons at the MAT locus were performed across nine additional Cordyceps species/strains, including C. cateniannulata (FRD 24 and MBC 23), C. chanhua (ZJ1611, BA-001 and CC02), C. fumosorosea (ARSEF 2679), C. javanica (F084), C. militaris (ATCC 34164) and C. tenuipes (YF17002). All sequences were retrieved from GenBank (Suppl. material
To investigate the evolution of MAT genes in Cordyceps, nucleotide sequences of MAT1-1-1, MAT1-2-1, as well as the concatenated sequences of APN2 and SLA2, were used for phylogenetic analyses. The nine Cordyceps species/strains mentioned earlier, along with C. blackwelliae ZYJ0835, were designated as the ingroup, with Beauveria asiatica ARSEF 4384 serving as the outgroup. Following sequence alignment by MUSCLE (
Two datasets were used for phylogenomic analyses. Dataset I comprised 3,488 single-copy nuclear genes, which were extracted using BUSCO v.5.8.0 with the hypocreales_odb10 database (
Complementary DNA (cDNAs) libraries were constructed by enriching mRNA from total RNA using polyT oligo-attached magnetic beads and then reverse-transcribed into cDNA. Subsequently, the cDNA libraries were sequenced by Novogene Technology Co., Ltd. (Beijing, China) on an Illumina NovaSeq platform with PE150 reads. Raw sequencing data underwent filtration using fastp v.0.23.0 (
Alternative splicing (AS) events were analysed using rMATS v.4.1.0 (
To confirm the reliability of RNA-seq data, four DEGs (gene 6087, gene 0120, gene 1814 and gene 1408) and two non-DEGs (gene 5222 and gene 0234) were randomly selected and their expression levels were quantified through RT-qPCR. Complementary DNA were synthesised from 100 ng of mRNA, enriched as described above, using the TransScript® One-Step gDNA Removal and cDNA Synthesis SuperMix (TransGen Biotech, Beijing, China) in a 20 μl reaction mixture that included 1 μl of TransScript® RT/RI Enzyme Mix, 10 μl of 2×TS Reaction Mix, 1 μl of Anchored Oligo(dT)18 Primer, 1 μl of gDNA Remover, 2 μl of RNase-free water and 5 μl of mRNA. RNA concentration and purity (i.e. OD260/OD280) were measured using NanoDropTM One Microvolume UV-Vis Spectrophotometers (Thermo Scientific, USA). RT-qPCR was performed on a CFX ConnectTM Real-Time PCR system (Bio-Rad, USA). The RT-qPCR reaction mixture (in 20 μl) consisted of 10 μl of 2× PerfectStart® Green qPCR SuperMix, 0.4 μl of each primer (10 μM), 3 μl of diluted cDNA (200 ng/μl) and 6.2 μl of nuclease-free water. The RT-qPCR programme included an initial denaturation step at 95°C for 30 s, followed by 40 cycles of 10 s at 95°C, 20 s at 56°C and 15 s at 72°C. All reactions were conducted in biological triplicates. Fluorescence signals were recorded during the extension stage and a melting curve was set to detect the specificity of the amplified products. Relative gene expression was evaluated using the 2-ΔΔCT method (Livak et al. 2001). The 18S rRNA was utilised as an internal reference, which has been previously validated as a stable internal reference gene in C. blackwelliae (
The R package WGCNA was employed to conduct a weighted correlation analysis based on all genes (Langfelder et al. 2008). An adjacency matrix was calculated with the optimal soft-thresholding power (β) (Suppl. material
Samples of LM, WM and FB were subjected to grinding in liquid nitrogen to produce fine powders suitable for Ultra-High-Performance Liquid Chromatography coupled with Tandem Mass Spectrometry (UHPLC-MS/MS) analysis. This analysis was conducted using a Vanquish UHPLC system (Thermo Fisher, Germany) in conjunction with an Orbitrap Q ExactiveTM HF mass spectrometer (Thermo Fisher, Germany). Samples were injected on to a Hypersil Gold column (100 × 2.1 mm, 1.9 μm) using a 12-min linear gradient at a flow rate of 0.2 ml/min. For a positive polarity mode, analyses were performed with (A) 0.1% formic acid in water and (B) methanol. For a negative polarity mode, mobile phases comprised (A) 5 mM ammonium acetate (pH = 9.0) and (B) methanol. The separation process followed the gradient: 2% B, 1.5 min; 2%–85% B, 3 min; 85%–100% B, 10 min; 100%–2% B, 10.1 min; and 2% B, 12 min. The Q ExactiveTM HF mass spectrometer was operated in positive/negative polarity mode with a spray voltage set at 3.5 KV, a capillary temperature of 320°C, a sheath gas flow rate of 35 psi, an aux gas flow rate of 10 l/min, an S-lens RF level of 60 and an aux gas heater temperature of 350°C.
The raw data files obtained from UHPLC-MS/MS were processed using Compound Discoverer 3.3 (Thermo Fisher, Germany) for data pretreatments, which included peak alignment, peak feature extraction and quantification. Subsequently, the identified peaks were matched against mzCloud (https://www.mzcloud.org/), mzVault (Thermo Fisher, Germany) and MassList (Novogene, China) databases. Statistical analyses were performed using R v.3.4.3 and Python v.2.7.6. The metabolites were annotated using HMDB (https://hmdb.ca/metabolites) and LIPIDMaps (http://www.lipidmaps.org/) databases. Partial Least Squares Discriminant Analysis (PLS-DA) was performed using metaX (
To investigate the correlation between DEGs and DMs, Pearson’s correlation coefficient between gene expression and metabolite abundance were calculated using the mixOmics package in R (
To establish a reference genome for C. blackwelliae, Illumina and Nanopore sequencing technologies were employed to generate short and long reads, respectively. These sequencing efforts yielded 6.55 Gb of Illumina data (21,697,168 paired reads, 211× coverage) and 5.37 Gb of Nanopore data (623,214 reads, 173× coverage). Except for a single scaffold (i.e. Scaffold10) representing the mitochondrial genome, the nuclear genome was assembled into 11 scaffolds, totally 31.06 Mb, with an N50 value of 4.14 Mb (Table
Statistics of genome assembly and gene prediction of C. blackwelliae ZYJ0835.
Item | Value | Item | Value |
---|---|---|---|
Genome size (bp) | 31,062,909 | No. of putative genes | 8,138† |
Sequence coverage | >173× | Total length of protein-coding genes (bp) | 19,606,463 |
No. of scaffolds | 11 | Gene average length (bp) | 2256.47 |
Largest scaffold size (bp) | 7,729,812 | Gene density (per Mb) | 280 |
N50 (bp) | 4,140,478 | No. of tRNA | 135 |
GC content (%) | 53.86 | Repeat rate (%) | 0.05 |
Complete BUSCOs (%) | 99.30 | No. of secondary metabolite clusters | 41 |
Missing BUSCOs (%) | 0.70 | No. of CAZymes | 255 |
Complete CEGMA (%) | 95.56 | No. of secreted proteins | 713 |
A total of 8,136 genes were predicted, including 7,969 protein-coding genes, 129 tRNA genes and 38 rRNA genes. Functional annotations using NR, GO, KEGG, COG and Pfam databases revealed that 1,585 genes were annotated across all databases and 6,435 genes had annotation in at least two databases (Suppl. material
To investigate C. blackwelliae fruiting body formation, the MAT locus structure was analysed. The genome contained a complete MAT1-2-1 gene and a truncated MAT1-1-1 gene, a pattern conserved across ten Cordyceps genomes (Fig.
Structural and evolutionary analysis of the MAT locus in C. blackwelliae: a schematic representation of the MAT1-2 locus across various Cordyceps species/strains. The APN2 gene, which encodes a DNA lyase and the SLA2 gene, which encodes a protein involved in cytoskeleton assemble control, flank the MAT genes. Gene orientations are indicated by arrows. Please refer to Suppl. material
To further clarify the potential functions of MAT1-2-1 and the truncated MAT1-1-1 gene in the regulation of fruiting body formation, gene expression profiles were examined across three different developmental stages. The MAT1-2-1 gene exhibited differential expression (read counts 0–122 and FPKM values 0–8.67), with the lowest expression levels observed at the FB stage, while expression levels increased by 8.2-fold and 4.2-fold at the LM and WM stages, respectively (Suppl. material
To ascertain whether the different MAT genes exhibit divergent evolutionary trajectories, phylogenetic analyses were conducted, based on nucleotide sequences of MAT1-2-1, the truncated MAT1-1-1 or concatenated sequences of SLA2 and APN2. The topologies of the three phylogenetic trees were found to be basically congruent, consistently placing C. blackwelliae in close relationship to C. chanhua and C. tenuipes (Fig.
To elucidate the phylogenetic placement of C. blackwelliae within the family Cordycipitaceae, phylogenetic trees were constructed, based on 3,488 nuclear single-copy orthologs and 14 mitochondrial core protein-coding genes, respectively. Both analyses robustly clustered Cordyceps species together (Fig.
Phylogenomic analyses based on both nuclear and mitochondrial genomes. The nuclear phylogeny (left) was constructed, based on 3,488 single-copy orthologs, while the mitochondrial phylogeny (right) was constructed, based on 14 core protein-coding genes. Detailed information regarding the fungal taxa and their corresponding accession numbers can be found in Suppl. material
To elucidate the molecular mechanisms underlying fruiting body development, a total of 15 RNA-seq libraries were generated from three growth stages (i.e. LM, WM and FB), each with five biological replicates. The sequencing process yielded 631 million raw reads; following quality control and cleaning procedures, 601 million clean reads were retained, resulting in an average of 40.1 million reads per replicate (Suppl. material
To identify genes that may play a crucial role in mycelial growth and fruiting body development, three comparative analyses were conducted: WM vs. LM, FB vs. WM and FB vs. LM. The analyses revealed that 2,078 genes exhibited differential expression with a |log2FoldChange| > 2 (Suppl. material
Gene expression profiles across three developmental stages: a clustering heat map of DEGs. Each column represents a sample, and each row corresponds to a gene. The FPKM values are indicated by a colour gradient, with red indicating high expression and green indicating low expression. The results of gene clustering are displayed as the dendrograms on the left; b–d volcano plot of DEGs between different developmental stages, with red dots indicating up-regulated genes, green dots indicating down-regulated genes and blue dots representing genes without statistically significant expression variation. The two vertical dashed lines represent |log2FoldChange| of 2, while the horizontal dashed lines represent the Padj value thresholds of 0.05; e C. blackwelliae samples at three different developmental stages that were used to perform RNA-seq and metabolomic analyses.
To identify pathways associated with phenotypes, GO and KEGG enrichment analysis was performed on the 2,078 DEGs. The findings indicated a significant overexpression of genes linked to DNA repair and recombination, including those involved in “Base excision repair”, “Mismatch repair” and “Homologous recombination”. This overexpression is crucial for maintaining genomic stability during rapid cell division and growth, particularly at the LM stage, where the mycelia exhibited rapid growth, forming dense and homogeneous mycelial pellets (Fig.
Alternative splicing (AS) events were examined across the three developmental stages (Suppl. material
WGCNA was utilised to construct a gene co-expression network and to identify hub genes implicated in fungal development, based on 8,136 initially predicted genes and 553 novel genes. The analysis revealed 26 distinct gene modules (Fig.
WGCNA investigation of modules associated with growth and fruiting body development: a display of original and merged modules within the clustering tree. The upper section of the diagram illustrates the gene clustering tree within the network. Each branch of the hierarchical tree or each vertical line in the accompanying colour bar corresponds to an individual gene. The middle section represents the original modules generated through dynamic tree clipping. The merged colours at the bottom indicate merged modules with a dissimilarity coefficient of less than 0.25. Genes not assigned to any modules are depicted in grey; b heatmap of correlations between modules and samples as indicated by a colour gradient, with red indicating a positive correlation and blue indicating a negative correlation between modules and samples. The turquoise, magenta and black modules are positively correlated with the LM, WM and FB stages, respectively; c–e gene expression patterns of the turquoise (c), magenta (d) and black (e) modules, with specific gene information available in Suppl. material
In order to elucidate the metabolic dynamics associated with morphogenesis and maturation, an analysis utilising UHPLC-MS/MS was performed. A comparative assessment of the total ion chromatograms indicated that, although the types of metabolites present across the three developmental stages were comparable, there were notable differences in the concentrations (Suppl. material
To characterise the global metabolic differences across the three developmental stages, PLS-DA was conducted. The first two principal components of the PLS-DA score plot accounted for over 70% of the total variance, clearly distinguishing between developmental stages. Additionally, the R2Y and Q2Y values approached one, confirming the model’s accuracy and reliability (Suppl. material
To better understand the pivotal genes and metabolites implicated in the differentiation of fruiting bodies, we analysed the similarities in gene expression and metabolite profiles across the three developmental stages. KEGG co-enrichment analysis revealed that both DMs and DEGs were significantly enriched in the pathways of “Nitrogen metabolism” and “Lysine biosynthesis” when comparing FB to LM (Fig.
KEGG pathway enrichment analysis of DEGs and DMs across the three distinct comparisons. The horizontal axes indicate gene ratios, which reflect the proportion of DEGs/DMs relative to the total number of genes/metabolites associated with a specific KEGG pathway. The size of the data points corresponds to gene counts and the colour represents the -log10 P-value.
For a comprehensive understanding of the transitions associated with fruiting body differentiation, DEGs and DMs from the FB vs. WM comparison were mapped on to the metabolic pathways using iPath. The results were consistent with the GO and KEGG enrichment analyses of DEGs, indicating that pathways related to purine, amino sugar, nucleotide sugar, glycerophospholipid, starch, sucrose, phenylalanine and steroid metabolism and biosynthesis are essential for fruiting body development (Suppl. material
Furthermore, a correlation plot was generated to illustrate the relationship between DEGs and DMs in the FB vs. WM comparison. This analysis identified two distinct groups of genes regulating metabolites. Group 1 consisted of 17 genes (including gene 0752 and gene 1433), exhibiting a similar regulatory pattern, while Group 2 contained the remaining 33 genes displaying an opposing regulatory pattern (Fig.
Correlation analysis between DEGs and DMs in the FB vs. WM comparison: a the figure shows the pairwise correlations between the top 50 DMs and the top 50 DEGs, sorted by P-values from the smallest to the largest. The intensity of the colour indicates the strength of the correlation, with red hues representing strong positive correlations and blue hues indicating strong negative correlations. Correlation values are more pronounced when the ellipses are flatter and statistically significant correlations (P < 0.05) are denoted with an asterisk (*); b, c based on the results shown in panel a, metabolite-regulating genes were categorised into two groups. Group 1 comprises 17 genes that show a positive correlation with 34 metabolites (Table
Metabolites significantly related to fruiting body development according to transcriptomic and metabolomic correlation analysis.
Metabolite name | Formula | Class | log2FC† | P-value | VIP‡ |
LPC 16:2§ | C24H46NO7P | Lipids and lipid-like molecules | 4.353 | 2.19E-13 | 1.792 |
LPC 17:2 | C25H48NO7P | Lipids and lipid-like molecules | 5.015 | 7.56E-12 | 1.840 |
MGDG O-24:4_16:0| | C49H88O9 | Lipids and lipid-like molecules | 6.441 | 1.65E-11 | 1.639 |
Trans-3-Indoleacrylic acid | C11H9NO2 | Organoheterocyclic compounds | 1.863 | 5.18E-11 | 1.764 |
Indole | C8H7N | Organoheterocyclic compounds | 1.497 | 1.51E-10 | 1.767 |
LPC 17:2-SN1 | C25H48NO7P | Lipids and lipid-like molecules | 3.635 | 1.82E-10 | 1.702 |
FAHFA 16:0/18:2¶ | C34H62O4 | Lipids and lipid-like molecules | 5.684 | 1.99E-10 | 1.652 |
Lithocholic Acid | C24H40O3 | Lipids and lipid-like molecules | 3.642 | 7.27E-10 | 1.828 |
DL-Tryptophan | C11H12N2O2 | Organoheterocyclic compounds | 2.023 | 1.43E-09 | 1.271 |
LPC 16:2-SN1 | C24H46NO7P | Lipids and lipid-like molecules | 3.806 | 2.95E-09 | 1.752 |
FAHFA 18:2/18:2 | C36H62O4 | Lipids and lipid-like molecules | 6.151 | 3.67E-09 | 1.675 |
FAHFA 18:2/17:2 | C35H60 O4 | Lipids and lipid-like molecules | 5.266 | 6.21E-09 | 1.719 |
LPC 16:1 | C24H48NO7P | Lipids and lipid-like molecules | 5.464 | 2.92E-08 | 1.817 |
HKK# | C18H33N7O4 | Organic acids and derivatives | 2.706 | 3.57E-08 | 1.728 |
LPE 16:2†† | C21H40NO7P | Lipids and lipid-like molecules | 2.545 | 3.86E-08 | 1.702 |
3-Methylindole | C9H9N | Organoheterocyclic compounds | 1.337 | 4.47E-08 | 1.708 |
2-[2-(2-furyl)vinyl]-4H-3,1-benzoxazin-4-one | C14H9NO3 | Organoheterocyclic compounds | 4.895 | 5.23E-08 | 1.810 |
FAHFA 18:1/20:3 | C38H66O4 | Lipids and lipid-like molecules | 6.412 | 9.61E-08 | 1.533 |
Aldosterone | C21H28O5 | Lipids and lipid-like molecules | 2.282 | 1.04E-07 | 1.791 |
Ethyl 1-(4-acetyl-2-aminophenyl) piperidine-4-carboxylate | C16H22N2O3 | Organoheterocyclic compounds | 4.980 | 1.08E-07 | 1.687 |
Obscurolide A1 | C15H17NO5 | Benzenoids | 4.535 | 1.13E-07 | 1.795 |
LPS 16:0‡‡ | C22H44NO9P | Lipids and lipid-like molecules | 8.025 | 1.58E-07 | 1.831 |
FAHFA 18:2/20:4 | C38H62O4 | Lipids and lipid-like molecules | 2.577 | 1.63E-07 | 1.368 |
Ursolic acid | C30H48O3 | Lipids and lipid-like molecules | 3.210 | 1.72E-07 | 1.713 |
LPS 15:0 | C21H42NO9P | Lipids and lipid-like molecules | 5.723 | 3.36E-07 | 1.830 |
Glycyl-L-leucine | C8H16N2O3 | Organic acids and derivatives | 1.279 | 3.59E-07 | 1.774 |
ST 28:1;O;Hex;FA 18:2 | C52H88O7 | Lipids and lipid-like molecules | 2.559 | 3.86E-07 | 1.333 |
Octadec-9-ynoic acid | C18H32O2 | Organic acids and derivatives | 3.265 | 4.69E-07 | 1.718 |
Oleanolic acid | C30H48O3 | Organic acids and derivatives | 2.811 | 5.30E-07 | 1.749 |
PE 18:0§§ | C23H46NO8P | Lipids and lipid-like molecules | 3.876 | 5.58E-07 | 1.687 |
15-OxoEDE | C20H34O3 | Lipids and lipid-like molecules | 6.576 | 6.57E-07 | 1.746 |
ERH|| | C17H28N8O6 | Organic nitrogen compounds | 2.566 | 6.82E-07 | 1.319 |
Lysopc 18:2 | C26H50NO7P | Lipids and lipid-like molecules | 3.318 | 7.01E-07 | 1.798 |
FAHFA 19:2/18:2 | C37H64O4 | Lipids and lipid-like molecules | 4.476 | 7.49E-07 | 1.839 |
This study provides insights into the genomic characteristics and potential evolutionary adaptations of C. blackwelliae. The genome size of 31.06 Mb, within the established range of available Cordyceps genomes (30.10–34.97 Mb) (Suppl. material
Genomic analyses indicated that the MAT1-2 mating type locus in C. blackwelliae comprises the complete MAT1-2-1 gene and a truncated version of MAT1-1-1, which has lost its N-terminal α-box domain, a pattern also reported in C. chanhua (
Comparative transcriptome analysis revealed stage-specific gene expression patterns associated with fungal development. At the LM stage, genes associated with DNA repair and recombination, as well as membrane component were highly expressed, supporting rapid cell division, biomass accumulation and adaptation to the liquid environment. The WM stage was characterised by the up-regulation of genes related to enzyme activity and co-factor binding, enhancing mycelial nutrient acquisition and supporting robust growth on solid substrates. During the FB stage, genes involved in various biosynthetic and catabolic processes were significantly overexpressed, facilitating morphological and physiological transitions required for fruiting body development. These findings align with transcriptomic studies on other Cordyceps species (
Key regulatory genes identified at each developmental stage further highlight functional specialisation. The hub genes at the LM stage were closely associated with key phenotypic traits, including rapid growth, branching, nutrient utilisation and stress adaptation. For example, the ARP3 (gene 3273) gene promotes hyphal apical formation and non-polar growth in Neurospora (
Metabolomic analysis identified 1,161 metabolites across three distinct developmental stages of C. blackwelliae. These metabolites were primarily categorised as lipids, organic acids, amino acids, nucleosides, carbohydrates, vitamins and alkaloids, aligning with previous metabolomic studies on Cordyceps (
A conjoint transcriptomic and metabolomic analysis identified 17 genes and 34 metabolites likely involved in fruiting body development. Amongst these genes, gene 0752, which encodes a putative DSBA-like thioredoxin domain protein, is associated with redox regulation (
This study provides the first reference genome for C. blackwelliae, laying a foundation for future functional studies. Integrated transcriptomic and metabolomic analyses identified 17 genes and 34 metabolites likely involved in fruiting body development. However, their precise roles in C. blackwelliae’s life cycle remain to be validated. Future research should focus on elucidating the specific functions of these genes and metabolites in fungal growth and development.
We thank Majorbio Bio-Pharm Technology Co., Ltd. (Shanghai, China) for providing Illumina and Nanopore genome sequencing and Novogene Co., Ltd. for providing transcriptomic and metabolomic services. We are also grateful to the High-Performance Simulation Platform of Shanxi University for providing computing resources.
The authors have declared that no competing interests exist.
No ethical statement was reported.
All the fungal strains used in this study have been legally obtained, respecting the Convention on Biological Diversity (Rio Convention).
This study was supported by the National Natural Science Foundation of China (31872162, 32370014), the Fundamental Research Program of Shanxi Province (202203021221036), the Open Fund Project of Key Laboratory of Microbial Resources Collection and Preservation, Ministry of Agriculture and Rural Affairs (KLMRCP2023-01) and the ‘Sanjin Talents’ Plan of Shanxi Province.
Y.J.Z. and S.Z. designed research; J.N.L. performed research and analysed data; J.N.L., S.Z. and Y.J.Z. wrote the paper. All authors have read and agreed to the published version of the manuscript.
Shu Zhang https://orcid.org/0009-0009-3779-1700
Yong-Jie Zhang https://orcid.org/0000-0002-6331-2189
The genome assembly of Cordyceps blackwelliae ZYJ0835 has been deposited at NCBI with an accession no. JBKZWQ000000000. RNA-seq data have been deposited at NCBI with a BioProject no. PRJNA1214394.
Supplementary tables S1–S16
Data type: xlsx
Explanation note: table S1. Genome information of Cordyceps species/strains with available whole genome sequences. table S2. Cordyceps species/strain and corresponding scaffolds harbouring MAT genes. table S3. Fungal taxa used in nuclear phylogenomic analyses. table S4. Fungal taxa used in mitochondrial phylogenetic analyses. table S5. Primers used in RT-qPCR assays. table S6. Statistics for scaffold length and telomere repeats in the genome assembly of Cordyceps blackwelliae. table S7. Summary of annotations for the Cordyceps blackwelliae genome. table S8. Transcriptional expression of MAT1-2-1 and the truncated MAT1-1-1 in Cordyceps blackwelliae. table S9. Summary of sequencing data of the Cordyceps blackwelliae transcriptome and mapping rates. table S10. List of novel genes identified by RNA-seq. table S11. DEGs across three groups of samples when |log2FoldChange| > 2. table S12. Description of extremely significant DEGs in volcano plot. table S13. GO and KEGG enrichment analysis of DEGs. table S14. Genes with developmental stage-specific alternative splicing. table S15. Genes of turquoise, magenta and black modules and hub genes. table S16. Statistics of metabolites present in Cordyceps blackwelliae.
Supplementary figures S1–S16
Data type: docx
Explanation note: figure S1. Soft threshold determination of WGCNA. figure S2. Statistics of functional annotations for the Cordyceps blackwelliae genome. figure S3. Functional annotation of the Cordyceps blackwelliae genome. figure S4. Principal component analysis (PCA) and Venn diagram of DEGs in RNA-seq. figure S5. Expression heatmap of development related genes in Cordyceps blackwelliae. figure S6. Validation of RNA-seq results by RT-qPCR analyses. figure S7. Detailed sashimi plots of genes exhibiting developmental stage-specific alternative splicing (FRD < 0.05). figure S8. GO term enrichment analysis for genes undergoing alternative splicing. figure S9. The total ion chromatogram of three stages in both positive (a) and negative (b) ion modes. figure S10. Functional annotation of Cordyceps blackwelliae metabolites. figure S11. Multivariate statistics in three comparisons. figure S12. Venn diagram of DMs by pairwise comparison. figure S13. Heatmap depicting the differential metabolic profile of amino acids. figure S14. Relative content of corey lactone diol across three developmental stages. figure S15. Interactive pathway analysis of Cordyceps blackwelliae in the FB vs. WM comparison. figure S16. Heatmap of 34 metabolites positively correlated with Group1 genes in three stages.