Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Accelerate CSR->Ell,Hybrid conversions on CUDA #318

Merged
merged 1 commit into from
Jun 24, 2019
Merged

Commits on Jun 21, 2019

  1. Accelerate CSR->Ell,Hybrid conversions on CUDA.

    + The previous grid dimensions for `initialize_zero_ell` were `stride *
      num_rows`, i.e. roughly the dense matrix dimension.
    + Using `max_nnz_per_row * num_rows` reduces significantly the amount of threads
      created which makes this kernel call more efficient (less useless thread
      creation).
    tcojean committed Jun 21, 2019
    Configuration menu
    Copy the full SHA
    646c2cc View commit details
    Browse the repository at this point in the history