|
| 1 | +# Enterovirus D68 dataset with reference Fermon |
| 2 | + |
| 3 | +| Key | Value | |
| 4 | +|----------------------|-----------------------------------------------------------------------| |
| 5 | +| authors | [Nadia Neuner-Jehle](https://eve-lab.org/people/nadia-neuner-jehle/), [Alejandra González-Sánchez](https://www.vallhebron.com/en/professionals/alejandra-gonzalez-sanchez), [Emma B. Hodcroft](https://eve-lab.org/people/emma-hodcroft/), [ENPEN](https://escv.eu/european-non-polio-enterovirus-network-enpen/) | |
| 6 | +| name | Enterovirus D68 | |
| 7 | +| reference | [AY426531.1](https://www.ncbi.nlm.nih.gov/nuccore/AY426531.1) | |
| 8 | +| workflow | https://github.com/enterovirus-phylo/nextclade_d68 | |
| 9 | +| path | `enpen/enterovirus/ev-d68` | |
| 10 | +| clade definitions | A–C (D) | |
| 11 | + |
| 12 | +## Citation |
| 13 | + |
| 14 | +If you use this dataset in your research, please cite: |
| 15 | + |
| 16 | +> Neuner-Jehle, N., González Sánchez, A., Hodcroft, E. B., & European Non-Polio Enterovirus Network (ENPEN). (2025). *enterovirus-phylo/nextclade_d68: Enterovirus D68 Nextclade Dataset v1.0.0* (v1.0.0--2025-11-18). Zenodo. https://doi.org/10.5281/zenodo.17642338 |
| 17 | +
|
| 18 | +[](https://doi.org/10.5281/zenodo.17642338) |
| 19 | + |
| 20 | +## Scope of this dataset |
| 21 | + |
| 22 | +Based on full-genome sequences, this dataset uses the **Fermon reference sequence** ([AY426531.1](https://www.ncbi.nlm.nih.gov/nuccore/AY426531.1)), originally isolated in 1962. It serves as the basis for quality control, clade assignment, and mutation calling across global EV-D68 diversity. |
| 23 | + |
| 24 | +*Note: The Fermon reference differs substantially from currently circulating strains.* This is common for enterovirus datasets, in contrast to some other virus datasets (e.g., seasonal influenza), where the reference is updated more frequently to reflect recent lineages. |
| 25 | + |
| 26 | +To address this, the dataset is *rooted* on a Static Inferred Ancestor — a phylogenetically reconstructed ancestral sequence near the tree root. This provides a stable reference point that can be used, optionally, as an alternative for mutation calling. |
| 27 | + |
| 28 | +## Features |
| 29 | + |
| 30 | +This dataset supports: |
| 31 | + |
| 32 | +- Assignment of subgenotypes |
| 33 | +- Phylogenetic placement |
| 34 | +- Sequence quality control (QC) |
| 35 | + |
| 36 | +## Subgenogroups of Enterovirus D68 |
| 37 | + |
| 38 | +Clade designations follow the global diversity of EV-D68: A (A1–A2/D), B (B1–B3), and C. The label "pre-ABC" indicates old, basal lineages that are no longer circulating. Sequences labeled "pre-ABC" or "unassigned" may indicate sequencing or assembly issues and should be assessed carefully. |
| 39 | + |
| 40 | +These designations are based on the phylogenetic structure and mutations, and are widely used in molecular epidemiology, similar to subgenotype systems for other enteroviruses. Unlike influenza (H1N1, H3N2) or SARS-CoV-2, there is no universal, standardized global lineage nomenclature for enteroviruses. Naming follows conventions from published studies and surveillance practices. |
| 41 | + |
| 42 | +## Reference types |
| 43 | + |
| 44 | +This dataset includes several reference points used in analyses: |
| 45 | +- *Reference:* RefSeq or similarly established reference sequence. Here Fermon. |
| 46 | + |
| 47 | +- *Parent:* The nearest ancestral node of a sample in the tree, used to infer branch-specific mutations. |
| 48 | + |
| 49 | +- *Clade founder:* The inferred ancestral node defining a clade (e.g., A2, B3). Mutations "since clade founder" describe changes that define that clade. |
| 50 | + |
| 51 | +- *Static Inferred Ancestor:* Reconstructed ancestral sequence inferred with an outgroup, representing the likely founder of EV-D68. Serves as a stable reference. |
| 52 | + |
| 53 | +- *Tree root:* Corresponds to the root of the tree, it may change in future updates as more data become available. |
| 54 | + |
| 55 | +All references use the coordinate system of the Fermon sequence. |
| 56 | + |
| 57 | +## Issues & Contact |
| 58 | +- For questions or suggestions, please [open an issue](https://github.com/enterovirus-phylo/nextclade_d68/issues) or email: eve-group[at]swisstph.ch |
| 59 | + |
| 60 | +## What is a Nextclade dataset? |
| 61 | + |
| 62 | +A Nextclade dataset includes the reference sequence, genome annotations, tree, clade definitions, and QC rules. Learn more in the [Nextclade documentation](https://docs.nextstrain.org/projects/nextclade/en/stable/user/datasets.html). |
0 commit comments