Avoid transpose in symbolic Cholesky benchmark #1312

upsj · 2023-03-23T15:42:37Z

The transposition is pretty slow on OpenMP, so to have a fair performance baseline, I want to skip all the operations needed to symmetrize the matrix.

MarcelKoch · 2023-03-23T15:54:23Z

core/factorization/symbolic.hpp

-void symbolic_cholesky(const matrix::Csr<ValueType, IndexType>*,
-                       std::unique_ptr<matrix::Csr<ValueType, IndexType>>&,
-                       std::unique_ptr<elimination_forest<IndexType>>&);
+void symbolic_cholesky(


perhaps it makes sense to always just return the pattern of L? Then you would have to add the symmetrization to the places where you call this, but IMO it would be clearer what the function does.

I'd like to have this in quickly, so I can always benchmark off develop, clean-up comes later :)

TBH, I don't think that is the purpose of develop. Why not use a branch that has everything you need in it?

Because I want things to be reproducible, if I mix performance tuning with benchmark changes, I could easily mess up things.
And in general, all uses of the symbolic factorization in Ginkgo need the symmetric version ATM (maybe with some changes in the future for Cholesky, but that's not yet clear). This is not a public function, so it makes sense to choose the most useful semantics.

yhmtsai

Maybe still need to keep one version including transpose time?

upsj · 2023-03-23T17:34:38Z

@yhmtsai agreed, I added it

yhmtsai

I agree with @MarcelKoch.
develop should not have a short change and revert afterwards.
At least, if we add the performance test without transpose, we should keep it for a while.
Maybe not complete related to this, but I would like to bring #895

thoasm

~~I am not a fan of renaming an old functionality instead of renaming the new one, as this makes it difficult to re-generate old benchmark results.~~
Edit: Since the benchmark was added 2 days ago in #1302, I am fine with renaming this functionality.

core/factorization/symbolic.hpp

benchmark/sparse_blas/operations.cpp

upsj · 2023-03-23T19:50:46Z

@yhmtsai To clarify: I want the code I am benchmarking on to be as close as possible to develop, it doesn't necessarily need to be merged, but it should be reviewed (I won't merge this now, because there seem to have been some compiler changes in MSVC which break the overflow check)

upsj · 2023-03-23T20:53:11Z

Found the MSVC issue, we were reading uninitialized data in the prefix_sum kernel, I fixed it and added tests

ginkgo-bot · 2023-03-23T21:10:35Z

Note: This PR changes the Ginkgo ABI:

Functions changes summary: 8 Removed, 0 Changed, 8 Added functions
Variables changes summary: 0 Removed, 0 Changed, 0 Added variable

For details check the full ABI diff under Artifacts here

sonarcloud · 2023-03-24T02:59:49Z

Kudos, SonarCloud Quality Gate passed!

0 Bugs
0 Vulnerabilities
0 Security Hotspots
1 Code Smell

41.7% Coverage
0.0% Duplication

codecov · 2023-03-24T03:35:50Z

Codecov Report

Patch coverage: 83.33% and project coverage change: -0.01 ⚠️

Comparison is base (e73ea67) 91.27% compared to head (8228f8d) 91.26%.

Additional details and impacted files

@@             Coverage Diff             @@
##           develop    #1312      +/-   ##
===========================================
- Coverage    91.27%   91.26%   -0.01%     
===========================================
  Files          577      577              
  Lines        48721    48739      +18     
===========================================
+ Hits         44468    44484      +16     
- Misses        4253     4255       +2

Impacted Files	Coverage Δ
test/factorization/cholesky_kernels.cpp	`0.00% <0.00%> (ø)`
reference/test/factorization/cholesky_kernels.cpp	`92.30% <71.42%> (-0.80%)`	⬇️
core/factorization/cholesky.cpp	`100.00% <100.00%> (ø)`
core/factorization/lu.cpp	`98.14% <100.00%> (ø)`
core/factorization/symbolic.cpp	`100.00% <100.00%> (ø)`
reference/components/prefix_sum_kernels.cpp	`100.00% <100.00%> (ø)`
reference/test/components/prefix_sum_kernels.cpp	`100.00% <100.00%> (ø)`
reference/test/factorization/lu_kernels.cpp	`95.45% <100.00%> (ø)`
test/components/prefix_sum_kernels.cpp	`100.00% <100.00%> (ø)`

... and 1 file with indirect coverage changes

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

☔ View full report in Codecov by Sentry.
📢 Do you have feedback about the report comment? Let us know in this issue.

avoid transpose in symbolic cholesky benchmark

08f7bd9

upsj added the 1:ST:ready-for-review This PR is ready for review label Mar 23, 2023

upsj self-assigned this Mar 23, 2023

ginkgo-bot added reg:testing This is related to testing. mod:core This is related to the core module. mod:reference This is related to the reference module. reg:benchmarking This is related to benchmarking. type:factorization This is related to the Factorizations labels Mar 23, 2023

upsj added the 1:ST:no-changelog-entry Skip the wiki check for changelog update label Mar 23, 2023

add lower-only test

29793e1

upsj added the 1:ST:skip-full-test label Mar 23, 2023

MarcelKoch reviewed Mar 23, 2023

View reviewed changes

yhmtsai reviewed Mar 23, 2023

View reviewed changes

upsj added 2 commits March 23, 2023 18:33

add benchmark with symmetrization

51d23cf

fix LU test

01df4d7

yhmtsai approved these changes Mar 23, 2023

View reviewed changes

thoasm approved these changes Mar 23, 2023

View reviewed changes

core/factorization/symbolic.hpp Outdated Show resolved Hide resolved

benchmark/sparse_blas/operations.cpp Show resolved Hide resolved

upsj added 2 commits March 23, 2023 20:52

improve documentation

1cc34c4

fix reading uninitialized values in prefix_sum

8228f8d

upsj removed the 1:ST:skip-full-test label Mar 23, 2023

upsj added 1:ST:ready-to-merge This PR is ready to merge. and removed 1:ST:ready-for-review This PR is ready for review labels Mar 23, 2023

upsj merged commit 786d580 into develop Mar 24, 2023

upsj deleted the symbolic_cholesky_no_transpose branch March 24, 2023 06:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Avoid transpose in symbolic Cholesky benchmark #1312

Avoid transpose in symbolic Cholesky benchmark #1312

upsj commented Mar 23, 2023

MarcelKoch Mar 23, 2023

upsj Mar 23, 2023

MarcelKoch Mar 23, 2023

upsj Mar 23, 2023

yhmtsai left a comment

upsj commented Mar 23, 2023

yhmtsai left a comment

thoasm left a comment •

edited

Loading

upsj commented Mar 23, 2023

upsj commented Mar 23, 2023

ginkgo-bot commented Mar 23, 2023

sonarcloud bot commented Mar 24, 2023

codecov bot commented Mar 24, 2023 •

edited

Loading

Avoid transpose in symbolic Cholesky benchmark #1312

Avoid transpose in symbolic Cholesky benchmark #1312

Conversation

upsj commented Mar 23, 2023

MarcelKoch Mar 23, 2023

Choose a reason for hiding this comment

upsj Mar 23, 2023

Choose a reason for hiding this comment

MarcelKoch Mar 23, 2023

Choose a reason for hiding this comment

upsj Mar 23, 2023

Choose a reason for hiding this comment

yhmtsai left a comment

Choose a reason for hiding this comment

upsj commented Mar 23, 2023

yhmtsai left a comment

Choose a reason for hiding this comment

thoasm left a comment • edited Loading

Choose a reason for hiding this comment

upsj commented Mar 23, 2023

upsj commented Mar 23, 2023

ginkgo-bot commented Mar 23, 2023

sonarcloud bot commented Mar 24, 2023

codecov bot commented Mar 24, 2023 • edited Loading

Codecov Report

thoasm left a comment •

edited

Loading

codecov bot commented Mar 24, 2023 •

edited

Loading