Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fixed slow COO SpMV for the OpenMP executor #340

Merged
merged 2 commits into from
Aug 29, 2019
Merged

Commits on Aug 29, 2019

  1. Fixed slow COO SpMV for the OpenMP executor

    Moved the `omp parallel for` to the most outer loop of the apply,
    so it is parallelized over the matrix entries instead over the number
    of right hand sides for every single matrix entry.
    Thomas Grützmacher committed Aug 29, 2019
    Configuration menu
    Copy the full SHA
    b187da0 View commit details
    Browse the repository at this point in the history
  2. Changed loop order in OpenMP COO SpMV

    Since the previous version contained race conditions, the order of the
    for-loops was swapped (first over number of right hand sides, then
    matrix values). For a single right hand side, it performs close to the
    reference implementation, while for multiple right hand sides, it is
    faster in most cases (at least in the ones I tested).
    Thomas Grützmacher committed Aug 29, 2019
    Configuration menu
    Copy the full SHA
    704e008 View commit details
    Browse the repository at this point in the history