Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Display sequencing metadata from the run_dir #4

Open
matrulda opened this issue Mar 19, 2024 · 2 comments
Open

Display sequencing metadata from the run_dir #4

matrulda opened this issue Mar 19, 2024 · 2 comments
Labels
enhancement New feature or request

Comments

@matrulda
Copy link

Description of feature

It would be nice if metadata regarding the sequencing run was displayed in the reports. In the case of an Illumina run this could include:

  • Control software version
  • Reagent kit version
  • RTA version
  • Flowcell type
  • and more...

This should be data that can be scraped from sequencing output files, like RunInfo.xml and RunParameters.xml.

This information could be shown in its own section in the reports or be part of the Software versions table.

This is how this feature has been implemented in seqreports: https://github.com/Molmed/seqreports/blob/main/bin/get_metadata.py

@matrulda matrulda added the enhancement New feature or request label Mar 19, 2024
@mahesh-panchal
Copy link
Member

A possible method that might help with speed is to implement this natively in Groovy. Processes using exec are executed on the head node and don't need to be distributed to a node like a python script does.

The XML slurper lib should make it easy-ish to parse out the relevant meta data and add it to the meta map.
https://groovy-lang.org/processing-xml.html#_simply_traversing_the_tree

@matrulda
Copy link
Author

Good point, something to keep in mind.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants