Skip to content

v0.2.7

Latest
Compare
Choose a tag to compare
@reuterbal reuterbal released this 04 Oct 13:07
· 25 commits to main since this release
477c56d

What's New

  • Experimental Fortran-to-CUDA transpilation demonstrated on CLOUDSC (#328)
  • A new SplitReadWriteTransformation that allows user-guided GPU optimisation to make loads independent from stores (#329)
  • A new LowerConstantArrayIndices transformation to pass full arrays instead of constant slices in kernel calls (#348)
  • New transformation utilities to introduce loop blocking for driver loops (#362)
  • A new string-based substitution mechanism for expressions (#366)
  • Refactoring of SCC tests (#353) and transformation utilities (#354)
  • And many small improvements and bug fixes (see below)

All Changes

  • IR: Automatic sanitisation of tuples in IR constructors by @mlange05 in #350
  • Run pytest on macos in GH actions by @reuterbal in #262
  • SCC test reshuffle by @mlange05 in #353
  • Transformations: Move common SCC utility routines to utilities by @mlange05 in #354
  • Transformations: Test and fix corner case in get_local_arrays by @mlange05 in #355
  • Tools: Disable timeout utility test on MacOS due to sporadic failures by @mlange05 in #356
  • Fixed logical evaluation of PRESENT intrinsics on Array variables by @JoeffreyLegaux in #341
  • ecWAM regression tests: switch to develop-1.3 branch by @awnawab in #358
  • Split reads and writes for certain accumulation patterns by @awnawab in #329
  • fix for 'resolve_vector_notation' utility by @MichaelSt98 in #361
  • Transformations: Internalise IdemTransformation by @mlange05 in #360
  • New transformation 'LowerConstantArrayIndices' to allow to … by @MichaelSt98 in #348
  • OMNI: Fix dimension range-indexing in frontend by @mlange05 in #363
  • Loki-transform: Pass cuf option to FilewriteTrafo by @mlange05 in #364
  • Filter out globals in get_local_arrays by @awnawab in #370
  • extend hoist variables functionality by @MichaelSt98 in #357
  • Change/fix pipeline for mode 'scc-raw-stack' by @MichaelSt98 in #371
  • Minimal padding in pool allocator by @awnawab in #365
  • CLOUDSC low-level GPU (transpilation) via Loki (CUF/CUDA) by @MichaelSt98 in #328
  • Loop splitting/blocking of block loops by @wertysas in #362
  • String-based expression substitution and moar expression tests! by @mlange05 in #366
  • SCC: Add vectorisation annotations in SCCRevector and translate in SCCAnnotate by @mlange05 in #359
  • Update VERSION to 0.2.7 by @reuterbal in #381

New Contributors

Full Changelog: v0.2.6...v0.2.7