Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ask forhelp : error when run run-asm-pipeline.sh #180

Open
Jiangjiangzhang6 opened this issue Feb 24, 2024 · 5 comments
Open

ask forhelp : error when run run-asm-pipeline.sh #180

Jiangjiangzhang6 opened this issue Feb 24, 2024 · 5 comments

Comments

@Jiangjiangzhang6
Copy link

hello friend
I assembly the genome with juicer and 3d-dna, when run the command "run-asm-pipeline.sh -r 0 raw_scaffold_2.fasta ./aligned/merged_nodups.txt"
its error
image

@Jiangjiangzhang6
Copy link
Author

version: 180922
-r|--rounds flag was triggered, will run 0 round(s) of misjoin correction.
###############
Starting iterating scaffolding with editing:
...starting round 0 of scaffolding:
:) -p flag was triggered. Running LIGer with GNU Parallel support parameter set to true.
:) -s flag was triggered, starting calculations with 15000 threshold starting contig/scaffold size
:) -q flag was triggered, starting calculations with 1 threshold mapping quality
...Using cprops file: raw_scaffold_2.0.cprops
...Using merged_nodups file: raw_scaffold_2.mnd.0.txt
...Scaffolding all scaffolds and contigs greater or equal to 15000 bp.
...Starting iteration # 1
...Starting iteration # 2
...Starting iteration # 3
...Starting iteration # 4
...Starting iteration # 5
...Starting iteration # 6
...Starting iteration # 7
...Starting iteration # 8
...Starting iteration # 9
...Starting iteration # 10
:) DONE!
...visualizing round 0 results:
:) -p flag was triggered. Running with GNU Parallel support parameter set to true.
:) -q flag was triggered, starting calculations for 1 threshold mapping quality
:) -i flag was triggered, building mapq without
:) -c flag was triggered, will remove temporary files after completion
...Remapping contact data from the original contig set to assembly
...Building track files
...Building the hic file
Not including fragment map
Start preprocess
Writing header
Writing body
.###############
Starting polish:
:) -p flag was triggered. Running with GNU Parallel support parameter set to true.
:) -q flag was triggered, performing polishing taking into account signal with minimum 1 mapping quality
...-j flag was triggered, will use Juicebox map raw_scaffold_2.resolved.hic
...-a flag was triggered, will use scaffold annotation file raw_scaffold_2.resolved_asm.scaffold_track.txt
...-b flag was triggered, will use superscaffold annotation file raw_scaffold_2.resolved_asm.superscaf_track.txt
...-s flag was triggered, will ignore all scaffolds shorter than 1000000 for polishing
:) -c flag was triggered, starting calculations with 5% saturation level
:) -w flag was triggered, performing cursory search for mismatches at 50000 resolution
:) -d flag was triggered, depletion score will be averaged across a region bounded by 1500000 superdiagonal
:) -k flag was triggered, starting calculations with 55% depletion as mismatch threshold
:) -n flag was triggered, performing mismatch region thinning at 500 resolution
:) -p flag was triggered. Running with GNU Parallel support parameter set to true.
:) -c flag was triggered, starting calculations with 5% saturation level
:) -w flag was triggered, performing cursory search for mismatches at 50000 resolution
:) -k flag was triggered, starting calculations with 55% depletion as mismatch threshold
:) -d flag was triggered, depletion score will be averaged across a region bounded by 1500000 superdiagonal
:) -n flag was triggered, performing mismatch region thinning at 500 resolution
...Dumping 50000 resolution matrix
HiC file version: 8
ERROR [2024-02-21 23:00:24,820] [DatasetReaderV2.java:229] [main] Error reading dataset
java.io.EOFException
at htsjdk.tribble.util.LittleEndianInputStream.readString(LittleEndianInputStream.java:119)
at juicebox.data.DatasetReaderV2.readFooter(DatasetReaderV2.java:435)
at juicebox.data.DatasetReaderV2.read(DatasetReaderV2.java:225)
at juicebox.data.HiCFileTools.extractDatasetForCLT(HiCFileTools.java:64)
at juicebox.tools.clt.old.Dump.readArguments(Dump.java:353)
at juicebox.tools.HiCTools.main(HiCTools.java:85)
...-p flag was triggered. Running with GNU Parallel support parameter set to true.
...-r flag was triggered, output will be labeled as .polish.
...applying edits to cprops file
...applying edits to mnd file
:) -p flag was triggered. Running LIGer with GNU Parallel support parameter set to true.
:) -q flag was triggered, starting calculations with 1 threshold mapping quality
:) -s flag was triggered, starting calculations with 1000000 threshold starting contig/scaffold size
...Using cprops file: raw_scaffold_2.resolved.polish.cprops
...Using merged_nodups file: raw_scaffold_2.resolved.polish.mnd.txt
...Explicit scaffold set has been listed as input. Using set as a first iteration.
...Starting iteration # 1
:) DONE!
:) -p flag was triggered. Running LIGer with GNU Parallel support parameter set to true.
:) -q flag was triggered, starting calculations with 1 threshold mapping quality
:) -s flag was triggered, starting calculations with 1000000 threshold starting contig/scaffold size
...Using cprops file: raw_scaffold_2.resolved.polish.cprops
...Using merged_nodups file: raw_scaffold_2.resolved.polish.mnd.txt
...Explicit scaffold set has been listed as input. Using set as a first iteration.
...Starting iteration # 1
:) DONE!
:) -p flag was triggered. Running with GNU Parallel support parameter set to true.
:) -q flag was triggered, starting calculations for 1 threshold mapping quality
:) -i flag was triggered, building mapq without
:) -c flag was triggered, will remove temporary files after completion
...Remapping contact data from the original contig set to assembly
...Building track files
...Building the hic file
Not including fragment map
Start preprocess
Writing header
Writing body
.###############
Starting split:
:) -p flag was triggered. Running with GNU Parallel support parameter set to true.
...-q flag was triggered, will ignore all reads with mapping quality less then 1 for polishing
...-j flag was triggered, will use Juicebox map raw_scaffold_2.polished.hic
...-a flag was triggered, will use scaffold annotation file raw_scaffold_2.polished_asm.scaffold_track.txt
...-b flag was triggered, will use superscaffold annotation file raw_scaffold_2.polished_asm.superscaf_track.txt
...-s flag was triggered, will ignore all scaffolds shorter than 100000 for polishing
:) -c flag was triggered, starting calculations with 5% saturation level
:) -w flag was triggered, performing cursory search for mismatches at 50000 resolution
:) -d flag was triggered, depletion score will be averaged across a region bounded by 1500000 superdiagonal
:) -k flag was triggered, starting calculations with 55% depletion as mismatch threshold
:) -n flag was triggered, performing mismatch region thinning at 500 resolution
:) -p flag was triggered. Running with GNU Parallel support parameter set to true.
:) -c flag was triggered, starting calculations with 5% saturation level
:) -w flag was triggered, performing cursory search for mismatches at 50000 resolution
:) -k flag was triggered, starting calculations with 55% depletion as mismatch threshold
:) -d flag was triggered, depletion score will be averaged across a region bounded by 1500000 superdiagonal
:) -n flag was triggered, performing mismatch region thinning at 500 resolution
...Dumping 50000 resolution matrix
HiC file version: 8
ERROR [2024-02-22 05:19:39,071] [DatasetReaderV2.java:229] [main] Error reading dataset
java.io.EOFException
at htsjdk.tribble.util.LittleEndianInputStream.readString(LittleEndianInputStream.java:119)
at juicebox.data.DatasetReaderV2.readFooter(DatasetReaderV2.java:435)
at juicebox.data.DatasetReaderV2.read(DatasetReaderV2.java:225)
at juicebox.data.HiCFileTools.extractDatasetForCLT(HiCFileTools.java:64)
at juicebox.tools.clt.old.Dump.readArguments(Dump.java:353)
at juicebox.tools.HiCTools.main(HiCTools.java:85)
...-p flag was triggered. Running with GNU Parallel support parameter set to true.
...-r flag was triggered, output will be labeled as .split.
...applying edits to cprops file
...applying edits to mnd file
:) -p flag was triggered. Running with GNU Parallel support parameter set to true.
:) -q flag was triggered, starting calculations for 1 threshold mapping quality
:) -i flag was triggered, building mapq without
:) -c flag was triggered, will remove temporary files after completion
...Remapping contact data from the original contig set to assembly
...Building track files
...Building the hic file
Not including fragment map
Start preprocess
Writing header
Writing body
.###############
Starting sealing:
... -i flag was triggered, will attempt to place back only debris contigs/scaffolds and those less than 15000
:) -p flag was triggered. Running with GNU Parallel support parameter set to true.
:) -q flag was triggered, starting calculations for 1 threshold mapping quality
:) -i flag was triggered, building mapq without
:) -c flag was triggered, will remove temporary files after completion
...Remapping contact data from the original contig set to assembly
...Building track files
...Building the hic file
Not including fragment map
Start preprocess
Writing header
Writing body
.###############
Finilizing output:
... -s flag was triggered, treating all contigs/scaffolds shorter than 15000 as unattempted.
... -l flag was triggered. Output will appear with headers of the form raw_scaffold_2_hic_scaffold_#.
... -g flag was triggered, making gap size between scaffolded draft sequences to be equal to 500.
Analyzing the merged assembly
...trimming N overhangs
...adding gaps

how to debug this, lookiing forward for you reply.

@Jiangjiangzhang6
Copy link
Author

just the hic file was too small size, the fasta was 2.2gb and the hic file was just 373kb

@Djangodu
Copy link

hello friend I assembly the genome with juicer and 3d-dna, when run the command "run-asm-pipeline.sh -r 0 raw_scaffold_2.fasta ./aligned/merged_nodups.txt" its error image

hello, friend
I miss this same issue, how about you now? did you solve it?

@Jiangjiangzhang6
Copy link
Author

hello friend I assembly the genome with juicer and 3d-dna, when run the command "run-asm-pipeline.sh -r 0 raw_scaffold_2.fasta ./aligned/merged_nodups.txt" its error image

hello, friend I miss this same issue, how about you now? did you solve it?
hello ,you should check your fasta, you could seqkit or other tools to check the , maybe your fasta with the information that not ATGC, like X, or space

@dudcha
Copy link
Collaborator

dudcha commented Mar 22, 2024

Hi all,

Please see this discussion over here: https://groups.google.com/g/3d-genomics/c/OoCwQdL6yX0

You can exit 3d-dna at the point which is causing the problem and see if running with an updated jar will help. If you confirm that this is it you can just replace the jar in the 3d-dna/visualize folder with the one that was updated to address a similar issue.

Best,
Olga

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants