Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Questionable FASTQ format in test dataset #29

Open
VanOverbeeke opened this issue Jan 29, 2024 · 0 comments
Open

Questionable FASTQ format in test dataset #29

VanOverbeeke opened this issue Jan 29, 2024 · 0 comments

Comments

@VanOverbeeke
Copy link

VanOverbeeke commented Jan 29, 2024

Hi,

I ran into an error doing the test:

Started analysis of individual_99_R1.fastq
Failed to process file individual_99_R1.fastq
uk.ac.babraham.FastQC.Sequence.SequenceFormatException: Midline 'CCCFFFFFHHHHHJJJHGIJJJIJJGJGIJJJJIGIJJJIJJIFHGIIJIGJJFHEHI=DGGEEHHFFDFFFDEDEEDDBDBEFEEEEDDDDDDDDDDDDDDBDDDDDDCDDDDDDDDDDDDDADDDDDDBDDDDDDDCDDDDEDDEDEDE' didn't start with '+'
        at uk.ac.babraham.FastQC.Sequence.FastQFile.readNext(FastQFile.java:172)
        at uk.ac.babraham.FastQC.Sequence.FastQFile.next(FastQFile.java:125)
        at uk.ac.babraham.FastQC.Analysis.AnalysisRunner.run(AnalysisRunner.java:77)
        at java.base/java.lang.Thread.run(Thread.java:1583)

When I inspected the FASTQ files manually, I saw mismatching line counts (not all files contain a multiple of 4 lines), leading to unexpected inputs for the pipeline as seen in the error message. See the 1018 line count of this file at https://github.com/akcorut/kGWASflow/blob/main/.test/data/test_reads/individual_99_R1.fastq.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant