Research Article |
Corresponding author: Flávia Rogério ( flavia.rogerio@usal.es ) Corresponding author: Michael R. Thon ( mthon@usal.es ) Academic editor: Sajeewa Maharachchikumbura
© 2025 Flávia Rogério, Cock Van Oosterhout, Stéphane De Mita, Francisco Borja Cuevas-Fernández, Pablo García-Rodríguez, Sioly Becerra, Silvia Gutiérrez-Sánchez, Andrés G. Jacquat, Wagner Bettiol, Guilherme Kenichi Hosaka, Sofia B. Ulla, Jürg Hiltbrunner, Rogelio Santiago, Pedro Revilla, José S. Dambolena, José L. Vicente-Villardón, Ivica Buhiniček, Serenella A. Sukno, Michael R. Thon.
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
Rogério F, Van Oosterhout C, De Mita S, Cuevas-Fernández FB, García-Rodríguez P, Becerra S, Gutiérrez-Sánchez S, Jacquat AG, Bettiol W, Hosaka GK, Ulla SB, Hiltbrunner J, Santiago R, Revilla P, Dambolena JS, Vicente-Villardón JL, Buhiniček I, Sukno SA, Thon MR (2025) Long-distance gene flow and recombination shape the evolutionary history of a maize pathogen. IMA Fungus 16: e138888. https://doi.org/10.3897/imafungus.16.138888
|
The evolutionary history of crop pathogens is shaped by a complex interaction of natural and anthropogenic factors. The fungus Colletotrichum graminicola causes maize anthracnose which results in significant yield losses worldwide. We conducted a comprehensive investigation into the evolutionary genomics of C. graminicola using a collection of 212 isolates from 17 countries across five continents. Genomic analyses supported the existence of three geographically isolated genetic lineages, with a significant pattern of isolation by distance. We identified two distinct gene flow patterns, driven by short- and long-distance dispersal, likely resulting from the natural spread of the pathogen and the exchange of contaminated seeds. We present evidence of genetic introgression between lineages, suggesting a long history of recombination. We identified significant recombination events coalescing at distinct points in time, with the North American lineage displaying evidence of the most ancient recombination. Demographic modelling has indicated that North America is an intermediate between Brazil, Europe and an ancestral, unsampled source population, which is hypothesised to be Mesoamerican. Our analyses revealed that the global genomic structure of C. graminicola is shaped by geographic differentiation driven by long-distance migration and a long history of recombination and introgression. We show historical relationships amongst these lineages, identifying a potential route for fungal spread, with the North American population emerging ancestrally, followed sequentially by the Brazilian and European populations. Our research indicates that the European lineage is more virulent, which has implications for the potential emergence of new outbreaks of maize anthracnose in Europe.
Colletotrichum graminicola, genetic introgression, isolation by distance (IBD), population genomics, restriction site-associated DNA sequencing (RAD-seq), whole-genome sequencing (WGS)
Examining the processes that drive species evolution over time is a central focus of evolutionary genomics. Population genetics provides valuable insights into patterns of genetic variation within populations, enhancing our comprehension of their evolutionary history and dynamics (
The genetic composition of pathogen populations is shaped by evolutionary forces and interactions with hosts and local the environmental conditions (
The fungus Colletotrichum graminicola is an important model for studying plant-pathogen interactions, being the pioneer amongst Colletotrichum species for genome sequencing (
Population genomics studies of many crop pathogens have demonstrated that human activities (domestication, trade and migration) play pivotal roles in the emergence and spread of plant diseases (
Fungi exhibit a range of reproductive strategies involving genetic recombination outside conventional sexual reproduction, such as hyphal fusion (anastomosis) and parasexual cycles (
Gene flow and recombination lead to genomes with admixed ancestry, a process that is commonly referred to as genetic admixture (
In this study, we focused on unravelling the evolutionary dynamics shaping the genetic structure of C. graminicola. Specifically, we addressed key questions: What drives the genetic structure in C. graminicola populations? What role does genetic introgression play in its evolutionary history? What are the sources and historical relationships of its lineages? Our investigation employed a global collection of 212 isolates from 17 countries, enabling a comprehensive analysis of genetic exchanges at the genome scale and determination of the evolutionary origin of C. graminicola lineages. By addressing these questions, our study contributes to a deeper understanding of the evolutionary genomics of C. graminicola, with implications for disease management.
In total, 212 isolates of C. graminicola were obtained from field samples and public culture collections from seventeen countries (Suppl. material
The RAD-seq reads were demultiplexed and quality filtered via the process_radtags module of the software Stacks v.2.10 (
Repeated multilocus genotypes (here referred to as clones) were identified via the function mlg.filter with a threshold determined via the cutoff_predictor tool on the basis of Euclidean distance via the package poppr v.2.8.6 (
To further understand the impact of geographic subdivision on population structure, we conducted isolation-by-distance analyses. We computed pairwise genetic dissimilarity and geographical distance between samples and examined their relationships via regression analysis. The correlation between the matrices of genetic and geographic distance was also observed via a simple Mantel test with 1000 permutations, as implemented in the vegan package in R (v.2.6.4). Genome variability was assessed through nucleotide diversity statistics computed via egglib v.3.3.2 (
To examine the impact of genetic exchanges on tree topologies, we constructed a consensus tree using densitree 2 (
Recombination was analysed in multiple genome alignments by chromosome, including non-variant sites. We used the module genotypegvcs in GATK to include genotypes from all positions present in the reference genome and bcftools v.10.2 (
Recombination events were also dated through Bayesian Inference in Beast 2. We selected some recombinant blocks detected by hybridcheck and applied a Bayesian coalescent approach for dating. XML files were created using beauti, where the module bmodeltest was used to determine the optimal substitution model under a mutation rate of 10−8 and a strict clock rate. We assumed a model of a coalescent constant population with default priors and a chain length of 1 × 107 generations. Each dating analysis was replicated five times. trace was used for visualising posterior parameters. The tree height statistic, which represents the marginal posterior distribution of the age of the root of the entire tree, was considered the divergence time from the common ancestor (tMRCA) for the set of sequences evaluated.
For dating, we assumed a generation time of 1 year. Note that the reported dates are likely an overestimate of the true age of recombination. In other words, the “true” recombination event may be more recent than our inferred estimate. There are two ways dating estimates can be biased, leading to overestimating the true date of introgression. First, there is an overestimation because the descendants of the actual donor and recipient sequences are unlikely to have been sampled.
Consequently, the introgressed regions look more divergent (and thus “older”). Second, even if the direct descendants of the donor sequence were included in our analysis, they may have diverged from the block in the recipient block owing to subsequent recombination. This may have introduced multiple nucleotide polymorphisms into the introgressed region. Given that we assume that divergence is driven by the mutation rate (i.e. a molecular clock), polymorphisms introduced by recombination would make the divergence appear older (see
To characterise genes present in the introgressed regions, we extracted the transcripts and proteins from the corresponding regions in the reference genome (
We inferred the evolutionary history of C. graminicola genetic lineages using Approximate Bayesian Computation (ABC), based on random forest (RF), conducted in the Python package Pyabcranger (
Pathogenicity assays were performed by in vivo leaf blight assays, following
Additionally, the isolates tested in the virulence assay were divided into groups, based on sampling year and genetic structure. The isolates were classified into three categories, based on genetic group and virulence (less, equal or more virulent than isolate M1.001) and subjected to Fisher’s exact test. The isolates were also grouped by year into three categories (before 2012, between 2012 and 2016 and after 2016) and subjected to a Kruskal–Wallis test. The statistical tests and figures were implemented in the R.
Finally, we examined whether there was a positive correlation between pairwise genetic distance between isolates and similarity in their level of virulence. Genetic distance was calculated as the difference in the number of bases in pairwise comparisons. The similarity in virulence level was assessed by comparing the difference between lesion areas (calculated as the lesion area of isolate A minus the lesion area of isolate B). The correlation between pairwise genetic distance and similarity in virulence was analysed using a Mantel test with 1000 permutations, as implemented in the vegan package in R (v.2.6.4).
RAD-Seq Restriction site-associated DNA sequencing
WGS Whole Genome Shotgun Sequencing
NA North American
EU European
BR Brazilian
AMOVA Analysis of Molecular Variance
We characterised the global genetic structure of C. graminicola via a total of 212 isolates, based on 7,207 biallelic SNPs were distributed across 13 chromosomes, with an average of 1.3 SNPs per 10 kb (Suppl. material
Population subdivision of Colletotrichum graminicola based on 7,207 single-nucleotide polymorphisms in the clone-corrected dataset. A Neighbour-net network showing relationships between isolates, visualised in a circular tree via itol v. 6 (
Population substructuring and admixture were analysed in further detail using the non-parametric method implemented in snmf. On the basis of the cross-entropy criterion, which was confirmed by the Bayesian Information Criterion, the model with K = 3 clusters best captures the population substructure of the whole dataset (Suppl. material
The relationships amongst all the genotypes visualised in a neighbour phylogenetic network (Suppl. material
The consensus tree generated by densitree, based on 1291 SNPs on chromosome 1, revealed many different topologies. This representation further corroborated the inferred population subdivision into three groups, with individuals from different lineages consistently separated by consensus branches (Suppl. material
Isolation-by-distance (IBD) analyses revealed a positive relationship between genetic distance and geographic distance. The Mantel test revealed a significant correlation between the genetic and geographic matrices (R2 = 0.3607, P < 0.001). Regression analyses, including models with logarithmic, square root and quadratic transformations, demonstrated a positive relationship, with geographic distance explaining up to 35.80% of the genetic diversification observed (quadratic regression F2,40142 = 11196.59, P < 0.001, R2 = 0.3580) (Suppl. material
Recombination on chromosome 1 was analysed via rdp4, with a focus on fifteen isolates representative of each lineage (North America: LAR318, NRRL13649, I-618151, A-52621-1, CA-CHAT-1; Brazil: BR-85955-2, BR-98290-1, BR-85925-1, M5.001, JAB2; European: F-64330-2, ARG-X5196-1, P-7565-072-8, CR-34543-1, CBS11373). Similar results were found on the other chromosomes (Suppl. material
Recombination analysis of chromosome 1. A Left: Sequence similarity visualised through an RBG colour triangle by hybridcheck (involving the isolates BR-85955-2, P-7565-072-8 and I-61851, respectively). The significant recombination event detected by the rpd4 software is enclosed in a dashed box; right: age distribution of recombinant blocks detected by hybridcheck; B similar analyses of the BR, EU and NA lineages using the isolates M5.001, CR-34543-1 and A-52621-1, respectively.
We performed an additional recombination analysis to further investigate the variation in introgression age at the intracontinental scale. Isolates from the same lineage were organised into three further triplets: triplet 3 – M5.001:BR-98290-1:BR-85925-1 (Brazilian lineage), triplet 4 – F-64330-2:F-64330-7:F-64330-13B (European lineage) and triplet 5 – NRRL13649:I-61851:CA-CHAT-1 (North America lineage) (Suppl. material
To apply the Bayesian dating method, we selected three recombination blocks within each triplet identified by hybridcheck (for the blocks selected, refer to Suppl. material
Additionally, we characterised genes present in the introgressed regions; however, transcripts were recovered only for a few events (Suppl. material
We analysed the evolutionary history of the isolates, assuming three genetic lineages, North American (NA), Brazilian (BR) and European (EU), as independent populations. In total, we analysed 22 demographic models (Suppl. material
Worldwide sampling of Colletotrichum graminicola. A Map showing a likely colonisation route of C. graminicola reconstructed by Approximate Bayesian Computation (ABC) using 208 isolates from 17 countries. Different colours indicate distinct lineages and arrows indicate divergence events: (1) first divergence event from NA to BR; (2) second divergence event from NA to EU; B the most strongly supported demographic model was selected based on random forest population voting (see Suppl. material
We detected significant differences in virulence amongst the fifty-three isolates evaluated in an analysis of variance test (ANOVA, P < 0.0001). Sidäk’s comparison-of-means test revealed that, compared with M1.001, 16 isolates presented significantly greater virulence, whereas 14 isolates presented significantly reduced virulence compared to the reference strain M1.001 (Fig.
The isolates were also classified according to genetic lineage (NA, EU and BR) and sampling year. Differences in virulence between lineages were not statistically significant, according to a Kruskal-Wallis test, although greater variability was observed in the European group (Suppl. material
The Mantel test performed between pairwise genetic distance and pairwise similarity in virulence revealed no correlation between these variables (R = -0.006257, P = 0.47053). Interestingly, the isolates SW-8046-1 and SW-8046-2 exhibited (nearly) identical genotypes. Yet, they displayed contrasting virulence values (Fig.
A global collection of Colletotrichum graminicola field isolates from 17 countries allowed a comprehensive investigation into the evolutionary history of this important maize pathogen. We generated and analysed a broad dataset with a higher resolution than our previous study (
These lineages exhibit different levels of overall genetic diversity, as indicated by estimated genomic indices, suggesting independent evolutionary trajectories (Suppl. material
Natural gene flow in fungi tends to be mediated through the transport of spores. Members of the genus Colletotrichum typically produce asexual spores (conidia) in acervuli immersed in mucilaginous masses disseminated through water splashes, accounting for short-range dispersal (Madden 1997). In contrast, sexual spores (ascospores) are associated with medium- to long-distance dissemination, as they can be ejected upwards and dispersed by air currents, reaching longer distances (
Human-mediated transport is a major driver of the long-distance dissemination of plant pathogens, acting as an artificial gene flow mechanism in scenarios where fungal spores may not naturally reach (
We detected migration between Argentina and Europe, where isolates from Argentina were grouped within the European lineage. This suggests some form of genetic exchange between these geographically distant regions, facilitated by long-distance gene flow. The role of contaminated seeds as sources of inoculum, promoting the movement of pathogens, has been well established in this pathosystem. In this study, we proposed the hypothesis that winter nurseries in South America, which are frequently employed in breeding programmes, might serve as a potential source for the introduction of the pathogen into new regions. Furthermore, there seems to be a recurring pattern of maize germplasm movement between international breeding companies and experimental stations in Argentina. A case in point is the experimental station “La Josefina S.A”, in Mercedes, Argentina, which provided winter nursey service from 1985–2014, collaborating with numerous (10–15 companies/institutes) maize breeding programmes, including European (Murad 2023, personal communication). This station routinely handles maize seeds from international companies working with germplasm from Europe, the USA and Argentina. Consequently, it regularly receives and returns seeds to the original breeding programmes, potentially contributing to the long-distance dissemination of the pathogen.
We identified signatures of genetic introgression at the genome-wide level between lineages, supported by statistically significant recombination events. The examination of nucleotide similarities between sequences uncovered a mosaic-like genome structure, in which different segments exhibit distinct genetic ancestries. The recombination analysis performed with rdp4 revealed a few large recombinant blocks, which contrasts with the results of hybridcheck which displayed numerous smaller, fragmented blocks. Through visualising nucleotide similarities, we observed a pattern where smaller blocks appeared to be later fragmented by recurrent recombination. Multiple mutations may have accumulated after ancient recombination (i.e. before divergence), giving rise to the observed pattern of fragmented blocks. In essence, these small regions might be remnants of a common ancient introgression event, where subsequent recombination fragmented the originally larger block.
Importantly, subsequent recombination may also increase the genetic divergence between the donor and recipient sequences, making the introgressed regions appear more divergent and hence “older”. We are, therefore, cautious in our interpretation of the actual estimated divergence time dates. Nevertheless, we are confident in our interpretation that the introgression events observed in the North American lineage date further back in time than those in the other two lineages (i.e. Europe and Brazil). Our recombination analysis detected events that took place as recently as 6100 years ago, as well as potentially very old events (> 100,000 years ago). These more recent occurrences are likely indicative of genetic exchanges that have occurred in a relatively recent timeframe if not an ongoing process. Overall, our introgression investigation revealed a prolonged history of genetic exchange.
Sexual reproduction has been observed under laboratory conditions, suggesting an absence of prezygotic reproductive barriers between genetically divergent strains (
We propose that C. graminicola populations may have undergone intense recombination before genetic divergence. Repeated sexual recombination cycles over time could have led to the development of a mosaic-like genome structure, as evidenced by our analyses. Ancient hybridisation events have been observed in the wheat powdery mildew pathogen Blumeria graminis sp. tritici, characterised by admixture between divergent strains resulting in a mosaic genome. Chromosomal segments inherited from both parental sources exhibit fragmented segments, likely produced by backcrossing (
Regarding the evolutionary origin of these lineages, our demographic inference indicates that the North American lineage emerged before the Brazilian and European populations. Age estimates of recombinant blocks detected within lineages further support this finding. Dating the blocks involving isolates from North America revealed that this lineage is older compared to those blocks from other lineages. Moreover, the deeper branches in the phylogenetic tree and the higher level of genetic diversity are all consistent with the North American lineage predating the European and Brazilian lineages.
North America seems to act as an intermediate between the source population and the rest of the world, suggesting a potential colonisation route of the fungus. Our results strongly indicate a significant contribution of an unsampled population as the ancestral source. Given the widely accepted hypothesis that maize domestication originated in Mexico (
Recent studies have indicated that maize migrated out of Mexico in two waves at different time points (
Establishing the origin of emerging infectious diseases is both challenging and important. Identifying the ancestral population reveals the standing genetic variation that could introgress into the outbreak lineages. We encourage future studies to comprehensively survey across the Mesoamerica Region, as well as South America, which would further advance our understanding of how this pathogen has adapted and spread to other parts of the world. A genomic comparison between the genetic variation in the source population and population worldwide would enable the examination of genes and genomic regions that experienced strong selection in invasive strains. Such an analysis could shed light on the role of recombination and hybridisation, which become more powerful when there is a greater presence of ancestral variation.
C. graminicola shares several characteristics with another significant maize pathogen, Setosphaeria turcica, which is the cause of northern corn leaf blight. Evidence of sexual recombination in S. turcica has been reported in a population genetic study by
Finally, we observed great variability in virulence among the isolates tested, suggesting potential fungal adaptive evolution. A significant, albeit subtle, relationship was found between genetic groups and virulence levels, with the European group exhibiting increased virulence compared with the strain collected during the U.S. outbreak. This level of virulence changed over time, raising concerns about potential new outbreaks of maize anthracnose, particularly in European cultivation. We found that two pairs of isolates with nearly identical genotypes presented marked differences in virulence, emphasising that adaptive evolution may be attributed to factors other than genetic diversity, such as environmental influences or epigenetic modifications. Additionally, it is well known that minor variations in single genes such as effector encoding genes can have marked effects on virulence. Further studies using these genotypes with contrasting virulence phenotypes may be used to elucidate the genetic basis of virulence in pathogens, contributing to a better understanding of plant diseases and the development of durable control strategies.
We have observed two different gene flow patterns, which are likely the result of the natural spread of the pathogen and the exchange of contaminated seeds. Our findings indicate genetic introgression between lineages, revealing a long history of recombination. We have identified significant recombination events occurring at specific points in time, with the North American lineage showing evidence of the most ancient recombination. According to demographic modelling, North America is positioned between Brazil, Europe and an unsampled ancestral source population, which is believed to be Mesoamerican. Our analyses have shown that the global genomic structure of C. graminicola is influenced by geographic differentiation driven by long-distance migration and a long history of recombination and introgression.
The authors would like to thank the Supercomputing and Bioinnovation Center (SCBI) of the University of Malaga (http://www.scbi.uma.es/site) and the Core Cluster of the Institut Français de Bioinformatique (IFB) (ANR-11-INBS-0013) for their provision of computational resources and technical support. We also wish to thank numerous colleagues who contributed to this work by providing samples of maize showing symptoms of anthracnose. We also thank Lucía Rodríguez Mónaco for her assistance during the virulence analysis.
The authors have declared that no competing interests exist.
No ethical statement was reported.
All the fungal strains used in this study have been legally obtained, respecting the Convention on Biological Diversity (Rio Convention).
This research was supported by Grants RTI2018-093611-B-100 and PID2021-125349NB-100 from the MCIN of Spain AEI/10.13039/501100011033 and the European Regional Development Fund (ERDF). F.R. was supported by grant FJC2020-043351-I financed by MCIN/AEI/10.13039/501100011033 and by the European Union NextGenerationEU/PRTR and by a FEMS Research and Training Grant (2851—2023). C.V.O. is funded by the Earth and Life Systems Alliance (ELSA), Norwich Research Park, UK. F.B.C.F. was supported by grant BES-2016-078373, funded by MCIN/AEI/10.13039/501100011033 and by Margarita Salas fellowship R.USAL-28/07/2022 funded by the Ministry of Universities of Spain/NextGenerationEU/PRTR. S.B. and P.G.R. were supported by a fellowship programme from the regional government of Castilla y León and ERDF. S.G.-S. was supported by grant Programa Investigo BOE-B-2023-22031 funded by the Ministry of Labor and Social Economy of Spain/NextGenerationEU/PRTR.
S.A.S., M.R.T. and F.R. designed the study and designed the project; S.A.S., F.R. and S.G.-S. performed fungal isolation and DNA extraction; F.R., M.R.T., C.V.O., S.D.M., G.K.H. and J.L.V.V. performed the analyses; F.R., M.R.T., C.V.O., S.D.M. and S.A.S. wrote the manuscript; F.R., P.G.R., S.B., S.G.S., F.B.C.F. and S.A.S. performed the pathogenic characterisation; A.G.J., W.B., S.B.U., J.H., R.S., P.R., J.S.D., J.L.V. and I.B. collected the samples. All the authors reviewed the manuscript.
Flávia Rogério https://orcid.org/0000-0001-7801-5112
Cock Van Oosterhout https://orcid.org/0000-0002-5653-738X
Stéphane De Mita https://orcid.org/0000-0003-2752-865X
Francisco Borja Cuevas-Fernández https://orcid.org/0000-0002-6543-5094
Pablo García-Rodríguez https://orcid.org/0000-0001-9730-8412
Sioly Becerra https://orcid.org/0000-0003-2564-1982
Silvia Gutiérrez-Sánchez https://orcid.org/0009-0002-4389-4801
Andrés G. Jacquat https://orcid.org/0000-0001-6811-9604
Wagner Bettiol https://orcid.org/0000-0001-7724-6445
Guilherme Kenichi Hosaka https://orcid.org/0000-0001-8353-1138
Rogelio Santiago https://orcid.org/0000-0001-5036-3975
Pedro Revilla https://orcid.org/0000-0002-4826-6073
José S. Dambolena https://orcid.org/0000-0001-8690-6564
José L. Vicente-Villardón https://orcid.org/0000-0003-1416-6813
Ivica Buhiniček https://orcid.org/0000-0002-7743-8954
Serenella A. Sukno https://orcid.org/0000-0003-3248-6490
Michael R. Thon https://orcid.org/0000-0002-7225-7003
The raw Illumina RAD-seq and WGS reads were submitted to GenBank and their accession numbers are listed in Suppl. material
Supplementary tables
Data type: xlsx
Explanation note: table S1: Isolates of Colletotrichum graminicola used in this study; table S2: Repeated multilocus genotypes removed from the dataset with 212 Colletotrichum graminicola isolates; table S3: Summary of genomic diversity within Colletotrichum graminicola lineages; table S4: Membership proportion of each isolate for Colletotrichum graminicola genetic lineages considering number of cluster (K) equal to 3; table S5: List of the recombination blocks detected in the RDP4 software under chromossome 1. Table shows the assigned event number, the recombinant and minor parent, the predicted start and end positions each block relative to the referenced contig, and the p values for the different recombination detection methods; table S6: Estimated dates of introgression events from triplet1 (BR-85955-2; P-7565-072-8; I-61851), triplet 2 (M5.001; CR-34543-1; A-52621-1), triplet 3 (M5.001: BR-98290-1: BR-85925-1) , triplet 4 (F-64330-2: F-64330-7: F-64330-13), and triplet 5 (NRRL13649: I-61851: CA-CHAT-1) obtained by HYBRIDCHECK algorithm dating; table S7: Estimated dates of recombinant blocks detected by HYBRIDCHECK using BEAST software; table S8: Description of summary statistics employed on ABC demographic inference; table S9: Description of the demographic models tested; table S10: Classification of isolate in each genetic group into three categories based on their virulence compared to the control isolate M1.001. Values represent the percentage of isolates in each category; table S11: Funtional annotation of genes in the introgression regions detected between Colletotrichum graminicola lineages; table S11: Funtional annotation of genes in the introgression regions detected between Colletotrichum graminicola lineages; table S13: Untransformed virulence data.
Supplementary methods
Data type: docx
Supplementary figures
Data type: docx
Explanation note: figure S1: Manhattan plot of the distribution of the SNPs along the chromosomes; figure S2: Population subdivision of Colletotrichum graminicola; figure S3: Neighbor-net network showing relationships between isolates of Colletotrichum graminicola identified based on the clone-correct dataset; figure S4: Population subdivision on European lineage of Colletotrichum graminicola; figure S5: Densitree cloudogram based on 203 samples across chromosomes; figure S6: Isolation-by-distance plot showing the correlation between pairwise genetic dissimilarity and geographical distance (in km); figure S7: Recombination analysis across chromosomes for isolates M5.001:CR-34543-1:A-52621- and BR-85955-2:P-7565-072-8: I-61851; figure S8: Recombination analysis of chromosome 1 for isolates M5.001:BR-98290-1:BR-85925-1 (triplet 3), F-64330-2:F-64330-7:F-64330-13 (triplet 4) and NRRL13649:I-61851:CA-CHAT-1 (triplet 5); figure S9: Age distribution of recombinant blocks between lineages; figure S10: Demographic models tested; figure S11: The fit of the data used in the ABC analysis; figure S12: Results of model choice from ABC analysis; figure S13: Pathogenic characterization of Colletotrichum graminicola isolates; figure S14: Bar plot showing the distribution of the virulence for the three genetic lineages.