Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Copy ingest #13

Merged
merged 8 commits into from
Dec 5, 2023
Merged

Copy ingest #13

merged 8 commits into from
Dec 5, 2023

Commits on Nov 7, 2023

  1. Ingest: Copy ingest from mpox repo

    This is a copy of the ingest directory from the mpox repo:
    
    https://github.com/nextstrain/mpox/tree/c9b8282e2d56056c8b3c45fa860d931b320acd63
    
    However, `ingest/vendored` subdirectory is not copied over since that folder should be
    added with `git subtree`.
    
    https://github.com/nextstrain/mpox/tree/c9b8282e2d56056c8b3c45fa860d931b320acd63/ingest/vendored#ingest
    
    Future commits will change this to work with Dengue data.
    j23414 committed Nov 7, 2023
    Configuration menu
    Copy the full SHA
    11b67b1 View commit details
    Browse the repository at this point in the history
  2. git subrepo clone (merge) https://github.com/nextstrain/ingest ingest…

    …/vendored
    
    subrepo:
      subdir:   "ingest/vendored"
      merged:   "a0faef5"
    upstream:
      origin:   "https://github.com/nextstrain/ingest"
      branch:   "main"
      commit:   "a0faef5"
    git-subrepo:
      version:  "0.4.6"
      origin:   "https://github.com/ingydotnet/git-subrepo"
      commit:   "110b9eb"
    j23414 committed Nov 7, 2023
    Configuration menu
    Copy the full SHA
    71d2f56 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    b64198c View commit details
    Browse the repository at this point in the history
  4. Remove Nextclade related rules

    Temporary removal of Nextclade-related rules, pending the compilation of
    a Nextclade dengue dataset and potential v3 changes.
    
    May be added back in later.
    j23414 committed Nov 7, 2023
    Configuration menu
    Copy the full SHA
    50e6083 View commit details
    Browse the repository at this point in the history
  5. Remove unused reverse column

    j23414 committed Nov 7, 2023
    Configuration menu
    Copy the full SHA
    e1007c7 View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    109f2eb View commit details
    Browse the repository at this point in the history

Commits on Nov 8, 2023

  1. eradicate our confounding of release and submission

    Co-authored-by: Cornelius Roemer <cornelius.roemer@gmail.com>
    j23414 and corneliusroemer committed Nov 8, 2023
    Configuration menu
    Copy the full SHA
    94b0113 View commit details
    Browse the repository at this point in the history

Commits on Dec 5, 2023

  1. NCBI Dataset field name transformations

    Originally the field map was created to keep mpox NDJSON backward compatible
    with field names used from NCBI Virus. However, this constraint is not
    applicable to dengue.¹
    
    This commit organizes field renaming into two parts.
    
    1. Rename the NCBI output columns to match the NCBI mnemonics²
       (see "ncbi_field_map:" in `config/config.yaml`)
    2. Where necessary, rename the NCBI mnemonics to match Nextstrain expected column names³
       (see "transform: fieldmap:" in `config/config.yaml`)
    
    ¹ #13 (comment)
    ² https://www.ncbi.nlm.nih.gov/datasets/docs/v2/reference-docs/command-line/dataformat/tsv/dataformat_tsv_virus-genome/#fields
    ³ https://docs.nextstrain.org/projects/ncov/en/latest/reference/metadata-fields.html
    j23414 committed Dec 5, 2023
    Configuration menu
    Copy the full SHA
    2684e24 View commit details
    Browse the repository at this point in the history