Use `nano-gemm` instead of `matrixmultiply` #292

cschwan · 2024-06-06T14:45:51Z

Here are some benchmarks of the existing code:

cschwan · 2024-06-30T07:01:25Z

Using the 22 GB-sized flavour-basis EKO ATLAS_1JET_8TEV_R06 for a strong coupling of 0.119 at the Z-boson mass, the old code takes

real    160m21.133s
user    159m53.132s
sys     0m27.700s

to evolve the corresponding grid. With nano-gemm it only takes

real    121m36.539s
user    121m9.532s
sys     0m26.784s

That's a wonderful 25% reduction in runtime!

BTW: the old runtime (with OpenBLAS even) was

real    301m45.642s
user    300m24.360s
sys     0m54.963s

and the difference comes from the fact that I used an evolution-basis EKO that's much bigger due to the rotation: 45 GB.

mert-kurttutan · 2024-07-03T10:51:07Z

Hi,
Your problem seems interesting and practical. Could please you share the steps to reproduce your results (along with the specs of the hardware you used) if possible?
I might be able to contribute.

Edit0: Your benchmark seems to be too long for experiment. I would appreciate, if you provide steps for smaller version of your benchmark test
Thanks

cschwan · 2024-07-04T07:57:55Z

Hi @mert-kurttutan,

The linear algebra routines are used in an operation that we call 'evolution', and some faster running evolutions are used in our integration tests. See this file: pineappl_cli/tests/evolve.rs. These tests run the binary that we call the 'PineAPPL CLI', and for these cases we always run

pineappl evolve <INPUT> <EKO> <OUTPUT> <CONV_FUNS>

The integration tests simply verify the output.

For examples of how the arguments are used have a look at the tests. To run them successfully, you'll need the test data that is passed to the CLI, which you can download here. It's probably easier to copy and run the wget calls from maintainer/generate-coverage.sh. The files must be placed into a folder test-data at top level of the repository.

The installation is probably a bit tricky, please read https://nnpdf.github.io/pineappl/docs/installation.html#cli-pineappl-for-your-shell for instructions (you will need the evolution feature, all other optional features are not needed). However, before compiling the Rust code, you'll need to install LHAPDF 6.5.4; without this C++ library nothing will compile/run unfortunately. The installation instructions for this library are here.

If you have suggestion on how to improve these documents we'd be happy to take your comments into account (best in a separate Issue). While I'm writing this, I realized that in our installation documents we never mention LHAPDF, probably because practically everyone in our community has it installed. This has to be improved.

cschwan · 2024-07-22T13:01:36Z

Starting from commit 9ca3022 the new baseline is now:

real    4m54.425s
user    4m20.387s
sys     0m32.138s

Apparently we did a lot of linear algebra with zeros 😞.

Use nano-gemm instead of matrixmultiply for non-DIS grids

a4e96a2

cschwan self-assigned this Jun 6, 2024

cschwan linked an issue Jun 6, 2024 that may be closed by this pull request

Investigate nano-gemm crate to improve speed of linear algebra #290

Open

Merge branch 'master' into test-nano-gemm

d20fbb9

cschwan mentioned this pull request Jun 30, 2024

Add documentation on how to use this crate sarah-ek/nano-gemm#1

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use `nano-gemm` instead of `matrixmultiply` #292

Use `nano-gemm` instead of `matrixmultiply` #292

cschwan commented Jun 6, 2024 •

edited

Loading

cschwan commented Jun 30, 2024

mert-kurttutan commented Jul 3, 2024 •

edited

Loading

cschwan commented Jul 4, 2024

cschwan commented Jul 22, 2024

Use nano-gemm instead of matrixmultiply #292

Are you sure you want to change the base?

Use nano-gemm instead of matrixmultiply #292

Conversation

cschwan commented Jun 6, 2024 • edited Loading

cschwan commented Jun 30, 2024

mert-kurttutan commented Jul 3, 2024 • edited Loading

cschwan commented Jul 4, 2024

cschwan commented Jul 22, 2024

Use `nano-gemm` instead of `matrixmultiply` #292

Use `nano-gemm` instead of `matrixmultiply` #292

cschwan commented Jun 6, 2024 •

edited

Loading

mert-kurttutan commented Jul 3, 2024 •

edited

Loading