Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Adding a bit of a walkthrough for parsing S3 results via presigned URLs, focusing on umccrise here.
s3_files_list_filter_relevant
: this is equivalent togds_files_list_filter_relevant
but it takes an S3 object directory as input instead of GDS. Surprisinglyaws s3 ls --output json
cannot generate JSON outputs (bad form from AWS here - Unable to get JSON output fromaws s3 ls
command aws/aws-cli#709) so had to go withaws --output json s3api list-objects-v2
.s3_file_presignedurl
: generates presigned URL for the given S3 object viaaws s3 presign
.s3_search
: uses the portal API to search for the given file pattern unders3://umccr-primary-data-prod
, e.g.s3_search("multiqc_data.json")
, and returns the results in a tidy tibble.the umccrise multi-sample reporter template now generates interactive plots for signature contributions, HRD across CHORD and HRDetect, and summarises the summary table from the cancer report across all samples (449 successful umccrise workflows with the required results on S3).