Add weighting function for several scenarios #567

veni-vidi-vici-dormivi · 2024-11-26T14:24:53Z

Add a function go generate weights such that each scenario contributes equally to some fitting routine. This means that a member of a scenario with a lot of members weighs less than a member of a scenario with few members.

Closes get_scenario_weights for new data structures #356
Tests added
Fully documented, including CHANGELOG.rst

codecov · 2024-11-26T14:39:26Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 78.10%. Comparing base (456776d) to head (693562e).
Report is 2 commits behind head on main.

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #567      +/-   ##
==========================================
+ Coverage   77.93%   78.10%   +0.17%     
==========================================
  Files          49       49              
  Lines        2986     3010      +24     
==========================================
+ Hits         2327     2351      +24     
  Misses        659      659

Flag	Coverage Δ
unittests	`78.10% <100.00%> (+0.17%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

for more information, see https://pre-commit.ci

mathause

Oh - I just realized that the text and code of Lea's weight generation does not align:

mesmer/mesmer/calibrate_mesmer/train_utils.py

Lines 14 to 15 in 456776d

    
               derive scenario weights such that each has equal weight, i.e., 1 / number of samples 
        
               (= nr_runs * nr_ts)

mesmer/mesmer/calibrate_mesmer/train_utils.py

Line 39 in 456776d

weights.append(np.full(nr_samples_scen, 1 / nr_runs))

Let me check if I introduced this error or if this was always the case. -> See #569

Given your tests: should exclude be all dims that are not in ens_dim (i.e. remove it from the input params)? I am a bit afraid that the user will forget to add the correct exclude. Does it actually work if you don't exclude all the remaining dims?

mesmer/core/weighted.py

tests/unit/test_weighted.py

veni-vidi-vici-dormivi · 2024-11-27T12:03:40Z

Hm given #569 we could also extend the weighting function to have several options, something like "by_ens", "by_ens_ts" or even pass a function to do it... I have to think about it some more.

Co-authored-by: Mathias Hauser <[email protected]>

mathause · 2024-11-27T13:16:37Z

Hm given #569 we could also extend the weighting function to have several options, something like "by_ens", "by_ens_ts" or even pass a function to do it... I have to think about it some more.

I would probably go via ens_dim or maybe dims by allowing to pass more than one dimension. I suggest to make our lives easier and keep the simple case for now.

veni-vidi-vici-dormivi · 2024-11-27T13:35:54Z

should exclude be all dims that are not in ens_dim

No, in practice all dims except for ens_dim and time should be excluded. The test does not reflect the use case in practice... I agree that if the outcome should always be the same we might as well do it internally. I wrote it because I was not sure if there might be another use case for the function but actually in find_localized_empirical_covariance takes the same weights and I am not sure yet how we will handle weights in the autoregression. I guess we can cross that bridge when we come to it.

Bumps [codecov/codecov-action](https://github.com/codecov/codecov-action) from 4 to 5. - [Release notes](https://github.com/codecov/codecov-action/releases) - [Changelog](https://github.com/codecov/codecov-action/blob/main/CHANGELOG.md) - [Commits](codecov/codecov-action@v4...v5) --- updated-dependencies: - dependency-name: codecov/codecov-action dependency-type: direct:production update-type: version-update:semver-major ... Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

) * implement datatree and dataset in linear regression * tests * extend tests to root dt --------- Co-authored-by: Mathias Hauser <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

mathause

Looks good, thanks. map_over_subtree skips empty nodes, right?

mesmer/core/weighted.py

Co-authored-by: Mathias Hauser <[email protected]>

veni-vidi-vici-dormivi · 2024-12-02T17:38:52Z

skips empty nodes, right?

Yes.

* implement weighting for several scnearios and members * implement tests * work around datatree.testing * extend tests to root dt * docs --------- Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Mathias Hauser <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

veni-vidi-vici-dormivi added 2 commits November 26, 2024 10:49

implement weighting for several scnearios and members

ff105e8

implement tests

073330c

veni-vidi-vici-dormivi and others added 3 commits November 27, 2024 10:30

work around datatree.testing

7fd3452

[pre-commit.ci] auto fixes from pre-commit.com hooks

513768f

for more information, see https://pre-commit.ci

use .data_vars

f14b16a

veni-vidi-vici-dormivi requested a review from mathause November 27, 2024 09:48

mathause reviewed Nov 27, 2024

View reviewed changes

mathause mentioned this pull request Nov 27, 2024

weighting of scenarios? #569

Open

veni-vidi-vici-dormivi and others added 2 commits November 27, 2024 13:04

Update mesmer/core/weighted.py

ab74c11

Co-authored-by: Mathias Hauser <[email protected]>

Update mesmer/core/weighted.py

9f1c67c

Co-authored-by: Mathias Hauser <[email protected]>

dependabot bot and others added 7 commits November 27, 2024 16:23

fix

e195f75

nits

0871ac6

remove exclude

e6c8776

precommit

21cdeec

Merge branch 'main' into scenweights

171ff63

veni-vidi-vici-dormivi requested a review from mathause November 30, 2024 13:11

mathause approved these changes Dec 2, 2024

View reviewed changes

mesmer/core/weighted.py Outdated Show resolved Hide resolved

mesmer/core/weighted.py Outdated Show resolved Hide resolved

mesmer/core/weighted.py Outdated Show resolved Hide resolved

mesmer/core/weighted.py Outdated Show resolved Hide resolved

veni-vidi-vici-dormivi and others added 3 commits December 2, 2024 18:33

Update mesmer/core/weighted.py

c422ae4

Co-authored-by: Mathias Hauser <[email protected]>

Update mesmer/core/weighted.py

a7e960e

Co-authored-by: Mathias Hauser <[email protected]>

doctest

88153eb

veni-vidi-vici-dormivi added 3 commits December 2, 2024 18:39

comment

c0242b5

docs

aee081a

docfix

693562e

veni-vidi-vici-dormivi merged commit 6552d66 into MESMER-group:main Dec 3, 2024
11 checks passed

veni-vidi-vici-dormivi deleted the scenweights branch December 3, 2024 15:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add weighting function for several scenarios #567

Add weighting function for several scenarios #567

veni-vidi-vici-dormivi commented Nov 26, 2024

codecov bot commented Nov 26, 2024 •

edited

Loading

mathause left a comment •

edited

Loading

veni-vidi-vici-dormivi commented Nov 27, 2024

mathause commented Nov 27, 2024

veni-vidi-vici-dormivi commented Nov 27, 2024

mathause left a comment

veni-vidi-vici-dormivi commented Dec 2, 2024

	derive scenario weights such that each has equal weight, i.e., 1 / number of samples
	(= nr_runs * nr_ts)

Add weighting function for several scenarios #567

Add weighting function for several scenarios #567

Conversation

veni-vidi-vici-dormivi commented Nov 26, 2024

codecov bot commented Nov 26, 2024 • edited Loading

Codecov Report

mathause left a comment • edited Loading

Choose a reason for hiding this comment

veni-vidi-vici-dormivi commented Nov 27, 2024

mathause commented Nov 27, 2024

veni-vidi-vici-dormivi commented Nov 27, 2024

mathause left a comment

Choose a reason for hiding this comment

veni-vidi-vici-dormivi commented Dec 2, 2024

codecov bot commented Nov 26, 2024 •

edited

Loading

mathause left a comment •

edited

Loading