Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[FEATURE] GATK GenotypeGVCFs option? #392

Closed
mvelinder opened this issue Jun 22, 2021 · 6 comments
Closed

[FEATURE] GATK GenotypeGVCFs option? #392

mvelinder opened this issue Jun 22, 2021 · 6 comments
Assignees
Labels
enhancement New feature or request Low priority Low priority for release
Milestone

Comments

@mvelinder
Copy link

It doesn't appear GenotypeGVCFs included in --tools? Could it be included in a future version? It's an important step for some larger scale studies. GATK documentation and description: https://gatk.broadinstitute.org/hc/en-us/articles/360037057852-GenotypeGVCFs

Thanks!

@mvelinder mvelinder added the enhancement New feature or request label Jun 22, 2021
@mvelinder
Copy link
Author

mvelinder commented Jun 24, 2021

Nevermind, it seems this is included here https://nf-co.re/sarek/2.7.1/output#gatk-genotypegvcfs Sorry for the confusion

@mvelinder
Copy link
Author

So while it is listed in the link I gave previously, https://nf-co.re/sarek/2.7.1/output#gatk-genotypegvcfs - it does not work with --tools . What is the usage for GenotypeGVCFs then?

@mvelinder mvelinder reopened this Jun 24, 2021
@maxulysse
Copy link
Member

Hi @mvelinder --tools haplotypecaller --generate_gvcf should do the trick.

@mvelinder
Copy link
Author

mvelinder commented Jun 29, 2021

Thanks @maxulysse - from the GATK tool, the output is "A final VCF in which all samples have been jointly genotyped." https://gatk.broadinstitute.org/hc/en-us/articles/360037057852-GenotypeGVCFs

My latest run using --generate_gvcf (starting from 3 sets of FASTQ files as a trio exome) did indeed result in a g.vcf.gz for each sample. Have these individual GVCFs been jointly genotyped but just not merged into a single multi-sample output VCF? Would it be appropriate to just bcftools merge the 3 individual GVCFs to create a multiple sample joint-genotyped VCF?

Appreciate your help and clarification!

@maxulysse
Copy link
Member

Hi @mvelinder.
At the moment in sarek the g.vcf are not merged yet, so each g.vcf file you have is from one set of FASTQs.
I'd look more into GenomicsDBImport for merging the resulting g.vcf files.
Hopefully it's something that we'll be able to add to the next sarek release.

@FriederikeHanssen
Copy link
Contributor

the GATK recommended workflow for joint genotyping is added here #595 including genomicsdbimport, genotypegvcf, and vqsr

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request Low priority Low priority for release
Projects
None yet
Development

No branches or pull requests

4 participants