You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
- Improve overview of new classes by adding a correspondence table with v0.x classes. Start introducing classification.
- Use v0.x instead of v0.9.
- Replace `method` by `technique` when referring to conformal inference methods.
- Other minor improvements and clarifications
Copy file name to clipboardexpand all lines: doc/v1_migration_guide.rst
+80-66
Original file line number
Diff line number
Diff line change
@@ -1,36 +1,52 @@
1
1
Migrating to MAPIE v1
2
2
===========================================
3
3
4
-
MAPIE v1 introduces several updates, enhancements, and structural changes that simplify the API by breaking down ``MapieRegressor`` functionality into dedicated classes for different conformal prediction methods. This guide outlines the key differences between MAPIE v0.9 and MAPIE v1 and provides instructions for adapting your code to the new structure.
4
+
MAPIE v1 introduces several updates, enhancements, and structural changes that simplify the API by breaking down ``MapieRegressor`` and ``MapieClassifier`` into dedicated classes for different conformal prediction techniques.
5
+
6
+
This guide outlines the differences between MAPIE v0.x and MAPIE v1 and provides instructions for migrating your code to the new API.
5
7
6
8
1. Overview of class restructuring
7
9
-----------------------------------
8
10
9
-
MAPIE v1 organizes the ``MapieRegressor`` functionality into specific regressor classes, each optimized for a particular type of conformal prediction:
- ``ConformalizedQuantileRegressor``: For quantile-based conformal prediction.
15
-
16
-
This modular approach makes it easier to select and configure a specific conformal regression technique. Each class includes parameters relevant to its own methodology, reducing redundancy and improving readability.
17
-
18
-
Migration summary of ``MapieRegressor`` to new classes
In MAPIE v0.9, ``MapieRegressor`` managed all conformal regression techniques under a single interface, which sometimes led to parameter redundancy and ambiguity. In MAPIE v1, each method-specific class includes only the parameters and techniques relevant to its technique.
- "Split conformal techniques" refer to ``SplitConformalRegressor`` and ``ConformalizedQuantileRegressor``
33
-
- "Cross-conformal techniques" refer to ``CrossConformalRegressor`` and ``JackknifeAfterBootstrapRegressor``
11
+
MAPIE v1 breaks down the ``MapieRegressor`` and ``MapieClassifier`` classes into 5 classes, each dedicated to a particular conformal prediction technique. ``MapieQuantileRegressor`` has also been revamped, and renamed ``ConformalizedQuantileRegressor``.
12
+
13
+
The rationale behind this is that ``MapieRegressor`` and ``MapieClassifier`` managed several conformal techniques under a single interface, which led to parameter redundancy and ambiguity. In MAPIE v1, each class includes only the relevant parameters specific to its technique.
14
+
15
+
The ``cv`` parameter is key to understand what new class to use in the v1 API:
- ``None``, integer, or any ``sklearn.model_selection.BaseCrossValidator``
30
+
- ``CrossConformalRegressor``
31
+
- Cross
32
+
* - ``MapieRegressor``
33
+
- ``subsample.Subsample``
34
+
- ``JackknifeAfterBootstrapRegressor``
35
+
- Cross
36
+
* - ``MapieQuantileRegressor``
37
+
- ``None``, ``"split"`` or ``"prefit"``
38
+
- ``ConformalizedQuantileRegressor``
39
+
- Split
40
+
* - ``MapieClassifier``
41
+
- ``"split"`` or ``"prefit"``
42
+
- ``SplitConformalClassifier``
43
+
- Split
44
+
* - ``MapieClassifier``
45
+
- ``None``, integer, or any ``sklearn.model_selection.BaseCrossValidator``
46
+
- ``CrossConformalClassifier``
47
+
- Cross
48
+
49
+
For more details regarding the difference between split and cross conformal types, see :doc:`split_cross_conformal`
34
50
35
51
2. Method changes
36
52
-----------------
@@ -39,13 +55,13 @@ In MAPIE v1, the conformal prediction workflow is more streamlined and modular,
39
55
40
56
Step 1: Data splitting
41
57
~~~~~~~~~~~~~~~~~~~~~~
42
-
In v0.9, data splitting is handled by MAPIE.
58
+
In v0.x, data splitting is handled by MAPIE.
43
59
44
60
In v1, the data splitting is left to the user for split conformal techniques. The user can split the data into training, conformalization, and test sets using scikit-learn's ``train_test_split`` or other methods.
45
61
46
62
Step 2 & 3: Model training and conformalization (ie: calibration)
In v0.9, the ``fit`` method handled both model training and calibration.
64
+
In v0.x, the ``fit`` method handled both model training and calibration.
49
65
50
66
In v1.0: MAPIE separates between training and calibration. We decided to name the *calibration* step *conformalization*, to avoid confusion with probability calibration.
51
67
@@ -68,91 +84,89 @@ For cross conformal techniques:
68
84
69
85
Step 4: Making predictions (``predict`` and ``predict_interval`` methods)
In MAPIE v0.9, both point predictions and prediction intervals were produced through the ``predict`` method.
87
+
In MAPIE v0.x, both point predictions and prediction intervals were produced through the ``predict`` method.
72
88
73
-
MAPIE v1 introduces a new method for prediction, ``.predict_interval()``, that behaves like v0.9 ``.predict(alpha=...)`` method. Namely, it predicts points and intervals.
89
+
MAPIE v1 introduces a new method for prediction, ``.predict_interval()``, that behaves like v0.x ``.predict(alpha=...)`` method. Namely, it predicts points and intervals.
74
90
The ``.predict()`` method now focuses solely on producing point predictions.
75
91
76
92
77
93
78
-
3. Key parameter changes
94
+
3. Parameters change
79
95
------------------------
80
96
81
-
``conformity_score``
82
-
~~~~~~~~~~~~~~~~~~~~
83
-
A parameter used to specify the scoring approach for evaluating model predictions.
84
-
85
-
- **v0.9**: Only allowed custom objects derived from ``BaseRegressionScore``.
86
-
- **v1**: Now accepts both strings (like ``"absolute"``) for predefined methods and custom ``BaseRegressionScore`` instances, simplifying usage.
87
-
88
97
``alpha``
89
98
~~~~~~~~~~~~~~~~~~~~
90
99
Indicates the desired coverage probability of the prediction intervals.
91
100
92
-
- **v0.9**: Specified as ``alpha`` during prediction, representing error rate.
101
+
- **v0.x**: Specified as ``alpha`` during prediction, representing error rate.
93
102
- **v1**: Replaced with ``confidence_level`` to denote the coverage rate directly. Set at model initialization, improving consistency and clarity. ``confidence_level`` is equivalent to ``1 - alpha``.
94
103
104
+
``cv``
105
+
~~~~~~~
106
+
See the first section of this guide. The ``cv`` parameter is now only declared at cross conformal techniques initialization.
107
+
108
+
``conformity_score``
109
+
~~~~~~~~~~~~~~~~~~~~
110
+
A parameter used to specify the scoring approach for evaluating model predictions.
111
+
112
+
- **v0.x**: Only allowed subclass instances of ``BaseRegressionScore``, like AbsoluteConformityScore()
113
+
- **v1**: Now also accepts strings, like ``"absolute"``.
114
+
95
115
``method``
96
116
~~~~~~~~~~
97
-
Specifies the approach for calculating prediction intervals, especially in advanced models like Cross Conformal and Jackknife After Bootstrap regressors.
117
+
Specifies the approach for calculating prediction intervals for cross conformal techniques.
98
118
99
-
- **v0.9**: Part of ``MapieRegressor``. Configured for the main prediction process.
119
+
- **v0.x**: Part of ``MapieRegressor``. Configured for the main prediction process.
100
120
- **v1**: Specific to ``CrossConformalRegressor`` and ``JackknifeAfterBootstrapRegressor``, indicating the interval calculation approach (``"base"``, ``"plus"``, or ``"minmax"``).
101
121
102
-
``cv``
103
-
~~~~~~~
104
-
The ``cv`` parameter manages the cross-validation configuration, accepting either an integer to indicate the number of data splits or a ``BaseCrossValidator`` object for custom data splitting.
105
-
106
-
- **v0.9**: The ``cv`` parameter was included in ``MapieRegressor``, where it handled cross-validation. The option ``cv="prefit"`` was available for models that were already pre-trained.
107
-
- **v1**: The ``cv`` parameter is now only present in ``CrossConformalRegressor``, with the ``prefit`` option removed.
108
-
109
122
``groups``
110
123
~~~~~~~~~~~
111
124
The ``groups`` parameter is used to specify group labels for cross-validation, ensuring that the same group is not present in both training and conformity sets.
112
125
113
-
- **v0.9**: Passed as a parameter to the ``fit`` method.
126
+
- **v0.x**: Passed as a parameter to the ``fit`` method.
114
127
- **v1**: The ``groups`` present is now only present in ``CrossConformalRegressor``. It is passed in the ``.conformalize()`` method instead of the ``.fit()`` method. In other classes (like ``SplitConformalRegressor``), groups can be directly handled by the user during data splitting.
115
128
116
129
``prefit``
117
130
~~~~~~~~~~
118
131
Controls whether the model has been pre-fitted before applying conformal prediction.
119
132
120
-
- **v0.9**: Indicated through ``cv="prefit"`` in ``MapieRegressor``.
121
-
- **v1**: ``prefit`` is now a separate boolean parameter, allowing explicit control over whether the model has been pre-fitted before applying conformal methods. It is set by default to ``True`` for ``SplitConformalRegressor``, as we believe this will become MAPIE nominal usage.
133
+
- **v0.x**: Indicated through ``cv="prefit"`` in ``MapieRegressor``.
134
+
- **v1**: ``prefit`` is now a separate boolean parameter, allowing explicit control over whether the model has been pre-fitted before conformalizing. It is set by default to ``True`` for ``SplitConformalRegressor``, as we believe this will become MAPIE nominal usage.
122
135
123
136
``fit_params`` (includes ``sample_weight``)
124
137
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
125
138
Dictionary of parameters specifically used during training, such as ``sample_weight`` in scikit-learn.
126
139
127
-
- **v0.9**: Passed additional parameters in a flexible but less explicit manner.
140
+
- **v0.x**: Passed additional parameters in a flexible but less explicit manner.
128
141
- **v1**: Now explicitly structured as a dedicated dictionary, ``fit_params``, ensuring parameters used during training are clearly defined and separated from other stages.
129
142
130
143
``predict_params``
131
144
~~~~~~~~~~~~~~~~~~
132
145
Defines additional parameters exclusively for prediction.
133
146
134
-
- **v0.9**: Passed additional parameters in a flexible but less explicit manner, sometimes mixed within training configurations.
147
+
- **v0.x**: Passed additional parameters in a flexible but less explicit manner, sometimes mixed within training configurations.
135
148
- **v1**: Now structured as a dedicated dictionary, ``predict_params``, to be used during calibration (``conformalize`` method) and prediction stages, ensuring no overlap with training parameters.
The aggregation method and technique for combining predictions in cross conformal methods.
152
+
How to aggregate predictions in cross conformal methods.
140
153
141
-
- **v0.9**: Previously, the ``agg_function`` parameter had two usage: to aggregate predictions when setting ``ensemble=True`` in the ``predict`` method, and to specify the aggregation technique in ``JackknifeAfterBootstrapRegressor``.
154
+
- **v0.x**: Previously, the ``agg_function`` parameter had two usage: to aggregate predictions when setting ``ensemble=True`` in the ``predict`` method, and to specify the aggregation used in ``JackknifeAfterBootstrapRegressor``.
142
155
- **v1**:
156
+
143
157
- The ``agg_function`` parameter has been split into two distinct parameters: ``aggregate_predictions`` and ``aggregation_method``. ``aggregate_predictions`` is specific to ``CrossConformalRegressor``, and it specifies how predictions from multiple conformal regressors are aggregated when making point predictions. ``aggregation_method`` is specific to ``JackknifeAfterBootstrapRegressor``, and it specifies the aggregation technique for combining predictions across different bootstrap samples during conformalization.
144
-
- Note that for both cross conformal methods, predictions points are now computed by default using mean aggregation. This is to avoid prediction points outside of prediction intervals in the default setting.
158
+
- Note that for both cross conformal techniques, predictions points are now computed by default using mean aggregation. This is to avoid prediction points outside of prediction intervals in the default setting.
145
159
146
160
``random_state``
147
161
~~~~~~~~~~~~~~~~~~
148
162
149
-
- **v0.9**: This parameter was used to control the randomness of the data splitting.
163
+
- **v0.x**: This parameter was used to control the randomness of the data splitting.
150
164
- **v1**: This parameter has been removed in cases where data splitting is now manual. Future evolutions may reintroduce it as a general purpose randomness control parameter.
151
165
152
166
``symmetry``
153
167
~~~~~~~~~~~~~~~~~~
154
168
155
-
- **v0.9**: This parameter of the `predict` method of the MapieQuantileRegressor was set to True by default
169
+
- **v0.x**: This parameter of the `predict` method of the MapieQuantileRegressor was set to True by default
156
170
- **v1**: This parameter is now named `symmetric_correction` and is set to False by default, because the resulting intervals are smaller. It is used in the `predict_interval` method of the ConformalizedQuantileRegressor.
157
171
158
172
``optimize_beta``
@@ -164,26 +178,26 @@ None defaults
164
178
~~~~~~~~~~~~~~~~~~~~
165
179
No more parameters with incorrect ``None`` defaults.
166
180
167
-
- **v0.9**: Eg: ``estimator`` had a ``None`` default value, even though the actual default value is ``LinearRegression()``. This was the case for other parameters as well.
181
+
- **v0.x**: Eg: ``estimator`` had a ``None`` default value, even though the actual default value is ``LinearRegression()``. This was the case for other parameters as well.
168
182
- **v1**: All parameters now have explicit defaults.
Below is a side-by-side example of code in MAPIE v0.9 and its equivalent in MAPIE v1 using the new modular classes and methods.
188
+
Below is a side-by-side example of code in MAPIE v0.x and its equivalent in MAPIE v1
175
189
176
190
Example 1: Split Conformal Prediction
177
191
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
178
192
179
193
Description
180
194
############
181
-
Split conformal prediction is a widely used method for generating prediction intervals, it splits the data into training, conformity, and test sets. The model is trained on the training set, calibrated on the conformity set, and then used to make predictions on the test set. In `MAPIE v1`, the `SplitConformalRegressor` replaces the older `MapieRegressor` with a more modular design and simplified API.
195
+
Split conformal prediction is a widely used technique for generating prediction intervals, it splits the data into training, conformity, and test sets. The model is trained on the training set, calibrated on the conformity set, and then used to make predictions on the test set. In `MAPIE v1`, the `SplitConformalRegressor` replaces the older `MapieRegressor` with a more modular design and simplified API.
182
196
183
-
MAPIE v0.9 Code
197
+
MAPIE v0.x Code
184
198
###############
185
199
186
-
Below is a MAPIE v0.9 code for split conformal prediction in case of pre-fitted model:
200
+
Below is a MAPIE v0.x code for split conformal prediction in case of pre-fitted model:
187
201
188
202
.. testcode::
189
203
@@ -250,10 +264,10 @@ Description
250
264
251
265
Cross-conformal prediction extends split conformal prediction by using multiple cross-validation folds to improve the efficiency of the prediction intervals. In MAPIE v1, `CrossConformalRegressor`` replaces the older `MapieRegressor`` for this purpose.
252
266
253
-
MAPIE v0.9 code
267
+
MAPIE v0.x code
254
268
###############
255
269
256
-
Below is a MAPIE v0.9 code for cross-conformal prediction:
270
+
Below is a MAPIE v0.x code for cross-conformal prediction:
0 commit comments