Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

The "REF prefixes differ" followed by Failed to merge alleles #47

Open
jjfarrell opened this issue Mar 31, 2022 · 2 comments
Open

The "REF prefixes differ" followed by Failed to merge alleles #47

jjfarrell opened this issue Mar 31, 2022 · 2 comments

Comments

@jjfarrell
Copy link

Any suggestions for this error?

 tail smoove_duphold_ont_manta.sh.o3870689
[smoove] 2022/03/31 11:14:06 [duphold] finished
[smoove] 2022/03/31 11:14:07 [duphold] finished
[smoove] 2022/03/31 11:14:07 [duphold] finished
[smoove] 2022/03/31 11:14:07 [duphold] finished
[smoove] 2022/03/31 11:14:08 [duphold] finished
[smoove] 2022/03/31 11:14:08 starting bcftools merge
[smoove] 2022/03/31 11:35:58 The REF prefixes differ: T vs N (1,1)
Failed to merge alleles at chr22:17756433 in tmp.22/tempclean-539213974/dh132899912smoove-duphold.bcf

Below are the variants at that location. 3 Refs have an N and one has a T which appears to impact the merge step.

chr22   17756433        chr22:17756433:FG       N       <INS:SVSIZE=74:AGGREGATED>
chr22   17756433        chr22:17756433:FG.0     N       <INS:SVSIZE=74:BREAKPOINT1>
chr22   17756433        chr22:17756433:FG.1     N       <INS:SVSIZE=74:BREAKPOINT2>
chr22   17756433        chr22:17756433:OG       T       ]chr2:102886770]T
@brentp
Copy link
Owner

brentp commented Apr 1, 2022

hmm. that is odd. you could use tiwih setref to set the 'N' to 'T' and see if that helps. Are these manta calls?

@jjfarrell
Copy link
Author

I tried it both ways (all N and all T) for just those 4 variants and it worked fine. If I make the variants with a mix of 3 Ts and then an 1 N, it also fails. So I will try tiwih setref on the full vcf to see if that cleans up the issue. FYI, duphold does runs fine if I filter out the BND sites and run it on the other variant types (INS,DEL, DUPs).

The input VCF is the output of graphtyper calls with sites selected by svimmer from primarily manta called vcfs.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants