-
Notifications
You must be signed in to change notification settings - Fork 89
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Added ParILU for CUDA #324
Commits on Jul 12, 2019
-
Started implementing CUDA kernels for ParILU
Thomas Grützmacher committedJul 12, 2019 Configuration menu - View commit details
-
Copy full SHA for 0ac57a8 - Browse repository at this point
Copy the full SHA 0ac57a8View commit details -
Additionally, separated the storage of the matrices to a single location to prevent matrix file duplication between CUDA and OpenMP tests.
Thomas Grützmacher committedJul 12, 2019 Configuration menu - View commit details
-
Copy full SHA for 12dd8d7 - Browse repository at this point
Copy the full SHA 12dd8d7View commit details -
Fixed kernel and test mistake in CUDA, ParILU
Thomas Grützmacher committedJul 12, 2019 Configuration menu - View commit details
-
Copy full SHA for 922d064 - Browse repository at this point
Copy the full SHA 922d064View commit details -
Added initialize kernel for ParILU on CUDA
Thomas Grützmacher committedJul 12, 2019 Configuration menu - View commit details
-
Copy full SHA for 8052ac0 - Browse repository at this point
Copy the full SHA 8052ac0View commit details -
Added remaining kernel (compute) for CUDA ParILU
Thomas Grützmacher committedJul 12, 2019 Configuration menu - View commit details
-
Copy full SHA for a2348c0 - Browse repository at this point
Copy the full SHA a2348c0View commit details -
Added test matrix to use it in CUDA ParILU test
Additionally, added more iterations for the default CUDA compute kernel for ParILU.
Thomas Grützmacher committedJul 12, 2019 Configuration menu - View commit details
-
Copy full SHA for fa638ee - Browse repository at this point
Copy the full SHA fa638eeView commit details -
Added check for inf and NaN in ParILU CUDA
Also reduced default number of iterations for CUDA compute kernel
Thomas Grützmacher committedJul 12, 2019 Configuration menu - View commit details
-
Copy full SHA for 2f47596 - Browse repository at this point
Copy the full SHA 2f47596View commit details -
All ParILU compute kernels have inf & NaN checks
Additionally, raised the allowed tolerance for OpenMP compute kernel because on all cores, the CI system fails otherwise.
Thomas Grützmacher committedJul 12, 2019 Configuration menu - View commit details
-
Copy full SHA for 55fab55 - Browse repository at this point
Copy the full SHA 55fab55View commit details -
Removed compile issues for older CUDA compilers
Also used for the NaN inf check the appropriate function: `isfinite`
Thomas Grützmacher committedJul 12, 2019 Configuration menu - View commit details
-
Copy full SHA for 2f4dba5 - Browse repository at this point
Copy the full SHA 2f4dba5View commit details -
ParIlu: Fix for the clang compiler compiler
Thomas Grützmacher committedJul 12, 2019 Configuration menu - View commit details
-
Copy full SHA for ca5129e - Browse repository at this point
Copy the full SHA ca5129eView commit details -
Thomas Grützmacher committed
Jul 12, 2019 Configuration menu - View commit details
-
Copy full SHA for 3f25c9f - Browse repository at this point
Copy the full SHA 3f25c9fView commit details -
Thomas Grützmacher committed
Jul 12, 2019 Configuration menu - View commit details
-
Copy full SHA for 664d978 - Browse repository at this point
Copy the full SHA 664d978View commit details