Skip to content

Commit

Permalink
Include vaccine strains in trees
Browse files Browse the repository at this point in the history
Explicitly add vaccine strains to genome tree and N450 tree, following up on #23

These strains currently end up in the trees due to our subsampling parameters and lack of other sequences from 1954, but this commit explicitly adds them.
  • Loading branch information
kimandrews committed Apr 19, 2024
1 parent 6dc1603 commit a81915e
Show file tree
Hide file tree
Showing 5 changed files with 24 additions and 2 deletions.
2 changes: 2 additions & 0 deletions phylogenetic/defaults/config.yaml
Original file line number Diff line number Diff line change
@@ -1,6 +1,8 @@
strain_id_field: "accession"
files:
exclude: "defaults/dropped_strains.txt"
include_genome: "defaults/include_strains_genome.txt"
include_N450: "defaults/include_strains_N450.txt"
reference: "defaults/measles_reference.gb"
reference_N450: "defaults/measles_reference_N450.gb"
reference_N450_fasta: "defaults/measles_reference_N450.fasta"
Expand Down
9 changes: 9 additions & 0 deletions phylogenetic/defaults/include_strains_N450.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,9 @@
# Vaccine strain information from Parks et al. Comparison of predicted amino acid
# sequences of measles virus strains in the Edmonston vaccine lineage
# https://doi.org/10.1128/jvi.75.2.910-920.2001
AF266288
AF266287
AF266290
AF266289
AF266291
AF266286
9 changes: 9 additions & 0 deletions phylogenetic/defaults/include_strains_genome.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,9 @@
# Vaccine strain information from Parks et al. Comparison of predicted amino acid
# sequences of measles virus strains in the Edmonston vaccine lineage
# https://doi.org/10.1128/jvi.75.2.910-920.2001
AF266288
AF266287
AF266290
AF266289
AF266291
AF266286
3 changes: 2 additions & 1 deletion phylogenetic/rules/prepare_sequences.smk
Original file line number Diff line number Diff line change
Expand Up @@ -42,7 +42,8 @@ rule filter:
input:
sequences = "data/sequences.fasta",
metadata = "data/metadata.tsv",
exclude = config["files"]["exclude"]
exclude = config["files"]["exclude"],
include = config["files"]["include_genome"]
output:
sequences = "results/genome/filtered.fasta"
params:
Expand Down
3 changes: 2 additions & 1 deletion phylogenetic/rules/prepare_sequences_N450.smk
Original file line number Diff line number Diff line change
Expand Up @@ -34,7 +34,8 @@ rule filter_N450:
input:
sequences = "results/N450/sequences.fasta",
metadata = "data/metadata.tsv",
exclude = config["files"]["exclude"]
exclude = config["files"]["exclude"],
include = config["files"]["include_N450"]
output:
sequences = "results/N450/aligned.fasta"
params:
Expand Down

0 comments on commit a81915e

Please sign in to comment.