weighting of scenarios? #569

mathause · 2024-11-27T11:11:32Z

I always thought that the scenario weights applied to the linear regression is given by 1 / (n_ens * n_ts). However it's 1 / n_ens. I probably miss-interpreted this. The original code (v0.8.0) is here:

mesmer/mesmer/calibrate_mesmer/train_utils.py

Lines 50 to 58 in 13f048b

    
           # assumption: nr_runs per scen and nr_ts for these runs can vary 
        
           # derive weights such that each scenario receives same weight (divide by nr samples) 
        
           nr_samples = 0 
        
           wgt_scen_eq = [] 
        
           for scen in scens: 
        
               nr_runs, nr_ts, nr_gps = targ[scen].shape 
        
               nr_samples_scen = nr_runs * nr_ts 
        
               wgt_scen_eq = np.append(wgt_scen_eq, np.repeat(1 / nr_runs, nr_samples_scen)) 
        
               nr_samples += nr_samples_scen

I refactored this in #143 and adapted the comment to

mesmer/mesmer/calibrate_mesmer/train_utils.py

Lines 14 to 15 in 456776d

    
               derive scenario weights such that each has equal weight, i.e., 1 / number of samples 
        
               (= nr_runs * nr_ts)

but importantly the code stayed the same:

mesmer/mesmer/calibrate_mesmer/train_utils.py

Line 39 in 456776d

weights.append(np.full(nr_samples_scen, 1 / nr_runs))

From Beusch et al. (2022):

"To obtain robust MESMER parameter estimates for each ESM, MESMER is trained on all available ensemble members of each available scenario and equal weight is given to each scenario."

I think it's not 100% clear - you could argue that the historical scenario does get a bit more weight as it has more time steps. But saying the weight is 1 / n_ens is a just-as-valid interpretation of "equal weight for each scenario". So in conclusion there is nothing to do here (except maybe to adapt my comment).

Originally commented in #567 (review)

edit: corrected n_scen -> n_ens

The text was updated successfully, but these errors were encountered:

veni-vidi-vici-dormivi · 2024-11-27T11:58:57Z

Shouldn't n_scen be n_ens or n_runs? Or do you actually mean n_scen because if you were to weigh each sample by 1/n_scen scenarios with more members would be overrepresented.

mathause · 2024-11-27T13:10:57Z

Shouldn't n_scen be n_ens or n_runs? Or do you actually mean n_scen because if you were to weigh each sample by 1/n_scen scenarios with more members would be overrepresented.

Yes you are right - I mean n_ens. I'll correct it above.

mathause mentioned this issue Nov 27, 2024

Add weighting function for several scenarios #567

Merged

3 tasks

veni-vidi-vici-dormivi mentioned this issue Dec 3, 2024

Add MESMER-M example for multiple scenarios and members #572

Draft

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

weighting of scenarios? #569

weighting of scenarios? #569

mathause commented Nov 27, 2024 •

edited

Loading

veni-vidi-vici-dormivi commented Nov 27, 2024

mathause commented Nov 27, 2024

weighting of scenarios? #569

weighting of scenarios? #569

Comments

mathause commented Nov 27, 2024 • edited Loading

veni-vidi-vici-dormivi commented Nov 27, 2024

mathause commented Nov 27, 2024

mathause commented Nov 27, 2024 •

edited

Loading