Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

(DO NOT MERGE) - Tpetra: performance improvements to CrsMatrix::copyAndPermute - working branch #13598

Draft
wants to merge 5 commits into
base: develop
Choose a base branch
from

Commits on Sep 20, 2024

  1. treat copy and permute for the special case that both the source and …

    …target matrices are locally indexed
    tjfulle committed Sep 20, 2024
    Configuration menu
    Copy the full SHA
    35785c2 View commit details
    Browse the repository at this point in the history

Commits on Nov 7, 2024

  1. Configuration menu
    Copy the full SHA
    b7ce02c View commit details
    Browse the repository at this point in the history

Commits on Nov 13, 2024

  1. Address performance issues in CrsMatrix copyAndPermute:

      1) timers 2) batch version of Map::getGlobalElement 3) improved replaceGlobalValues
      4) kokkos parallel reduce/for for the main loop over rows in CAP
      5) small improvements using if constexpr()
    skennon10 committed Nov 13, 2024
    Configuration menu
    Copy the full SHA
    53ae2cc View commit details
    Browse the repository at this point in the history

Commits on Nov 14, 2024

  1. Configuration menu
    Copy the full SHA
    7c7a518 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    cbce5bb View commit details
    Browse the repository at this point in the history