Skip to content

Commit 2c61419

Browse files
DOC: Improve migration guide: (#627)
- Improve overview of new classes by adding a correspondence table with v0.x classes. Start introducing classification. - Use v0.x instead of v0.9. - Replace `method` by `technique` when referring to conformal inference methods. - Other minor improvements and clarifications
1 parent af5298f commit 2c61419

File tree

1 file changed

+80
-66
lines changed

1 file changed

+80
-66
lines changed

doc/v1_migration_guide.rst

+80-66
Original file line numberDiff line numberDiff line change
@@ -1,36 +1,52 @@
11
Migrating to MAPIE v1
22
===========================================
33

4-
MAPIE v1 introduces several updates, enhancements, and structural changes that simplify the API by breaking down ``MapieRegressor`` functionality into dedicated classes for different conformal prediction methods. This guide outlines the key differences between MAPIE v0.9 and MAPIE v1 and provides instructions for adapting your code to the new structure.
4+
MAPIE v1 introduces several updates, enhancements, and structural changes that simplify the API by breaking down ``MapieRegressor`` and ``MapieClassifier`` into dedicated classes for different conformal prediction techniques.
5+
6+
This guide outlines the differences between MAPIE v0.x and MAPIE v1 and provides instructions for migrating your code to the new API.
57

68
1. Overview of class restructuring
79
-----------------------------------
810

9-
MAPIE v1 organizes the ``MapieRegressor`` functionality into specific regressor classes, each optimized for a particular type of conformal prediction:
10-
11-
- ``SplitConformalRegressor``: Handles split conformal prediction.
12-
- ``CrossConformalRegressor``: Implements cross-validation-based conformal prediction.
13-
- ``JackknifeAfterBootstrapRegressor``: Supports jackknife-after-bootstrap conformal prediction.
14-
- ``ConformalizedQuantileRegressor``: For quantile-based conformal prediction.
15-
16-
This modular approach makes it easier to select and configure a specific conformal regression technique. Each class includes parameters relevant to its own methodology, reducing redundancy and improving readability.
17-
18-
Migration summary of ``MapieRegressor`` to new classes
19-
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
20-
21-
In MAPIE v0.9, ``MapieRegressor`` managed all conformal regression techniques under a single interface, which sometimes led to parameter redundancy and ambiguity. In MAPIE v1, each method-specific class includes only the parameters and techniques relevant to its technique.
22-
23-
+--------------------+--------------------------------------------------------------------------+
24-
| MAPIE v0.9 Class | MAPIE v1 Classes |
25-
+====================+==========================================================================+
26-
| ``MapieRegressor`` | ``SplitConformalRegressor``, ``CrossConformalRegressor``, |
27-
| | |
28-
| | ``JackknifeAfterBootstrapRegressor``, ``ConformalizedQuantileRegressor`` |
29-
+--------------------+--------------------------------------------------------------------------+
30-
31-
In the following guide:
32-
- "Split conformal techniques" refer to ``SplitConformalRegressor`` and ``ConformalizedQuantileRegressor``
33-
- "Cross-conformal techniques" refer to ``CrossConformalRegressor`` and ``JackknifeAfterBootstrapRegressor``
11+
MAPIE v1 breaks down the ``MapieRegressor`` and ``MapieClassifier`` classes into 5 classes, each dedicated to a particular conformal prediction technique. ``MapieQuantileRegressor`` has also been revamped, and renamed ``ConformalizedQuantileRegressor``.
12+
13+
The rationale behind this is that ``MapieRegressor`` and ``MapieClassifier`` managed several conformal techniques under a single interface, which led to parameter redundancy and ambiguity. In MAPIE v1, each class includes only the relevant parameters specific to its technique.
14+
15+
The ``cv`` parameter is key to understand what new class to use in the v1 API:
16+
17+
.. list-table:: Mapie v0.x -> v1 classes correspondence
18+
:header-rows: 1
19+
20+
* - v0.x class
21+
- ``cv`` parameter value
22+
- Corresponding v1 class
23+
- Conformal prediction type
24+
* - ``MapieRegressor``
25+
- ``"split"`` or ``"prefit"``
26+
- ``SplitConformalRegressor``
27+
- Split
28+
* - ``MapieRegressor``
29+
- ``None``, integer, or any ``sklearn.model_selection.BaseCrossValidator``
30+
- ``CrossConformalRegressor``
31+
- Cross
32+
* - ``MapieRegressor``
33+
- ``subsample.Subsample``
34+
- ``JackknifeAfterBootstrapRegressor``
35+
- Cross
36+
* - ``MapieQuantileRegressor``
37+
- ``None``, ``"split"`` or ``"prefit"``
38+
- ``ConformalizedQuantileRegressor``
39+
- Split
40+
* - ``MapieClassifier``
41+
- ``"split"`` or ``"prefit"``
42+
- ``SplitConformalClassifier``
43+
- Split
44+
* - ``MapieClassifier``
45+
- ``None``, integer, or any ``sklearn.model_selection.BaseCrossValidator``
46+
- ``CrossConformalClassifier``
47+
- Cross
48+
49+
For more details regarding the difference between split and cross conformal types, see :doc:`split_cross_conformal`
3450

3551
2. Method changes
3652
-----------------
@@ -39,13 +55,13 @@ In MAPIE v1, the conformal prediction workflow is more streamlined and modular,
3955

4056
Step 1: Data splitting
4157
~~~~~~~~~~~~~~~~~~~~~~
42-
In v0.9, data splitting is handled by MAPIE.
58+
In v0.x, data splitting is handled by MAPIE.
4359

4460
In v1, the data splitting is left to the user for split conformal techniques. The user can split the data into training, conformalization, and test sets using scikit-learn's ``train_test_split`` or other methods.
4561

4662
Step 2 & 3: Model training and conformalization (ie: calibration)
4763
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
48-
In v0.9, the ``fit`` method handled both model training and calibration.
64+
In v0.x, the ``fit`` method handled both model training and calibration.
4965

5066
In v1.0: MAPIE separates between training and calibration. We decided to name the *calibration* step *conformalization*, to avoid confusion with probability calibration.
5167

@@ -68,91 +84,89 @@ For cross conformal techniques:
6884

6985
Step 4: Making predictions (``predict`` and ``predict_interval`` methods)
7086
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
71-
In MAPIE v0.9, both point predictions and prediction intervals were produced through the ``predict`` method.
87+
In MAPIE v0.x, both point predictions and prediction intervals were produced through the ``predict`` method.
7288

73-
MAPIE v1 introduces a new method for prediction, ``.predict_interval()``, that behaves like v0.9 ``.predict(alpha=...)`` method. Namely, it predicts points and intervals.
89+
MAPIE v1 introduces a new method for prediction, ``.predict_interval()``, that behaves like v0.x ``.predict(alpha=...)`` method. Namely, it predicts points and intervals.
7490
The ``.predict()`` method now focuses solely on producing point predictions.
7591

7692

7793

78-
3. Key parameter changes
94+
3. Parameters change
7995
------------------------
8096

81-
``conformity_score``
82-
~~~~~~~~~~~~~~~~~~~~
83-
A parameter used to specify the scoring approach for evaluating model predictions.
84-
85-
- **v0.9**: Only allowed custom objects derived from ``BaseRegressionScore``.
86-
- **v1**: Now accepts both strings (like ``"absolute"``) for predefined methods and custom ``BaseRegressionScore`` instances, simplifying usage.
87-
8897
``alpha``
8998
~~~~~~~~~~~~~~~~~~~~
9099
Indicates the desired coverage probability of the prediction intervals.
91100

92-
- **v0.9**: Specified as ``alpha`` during prediction, representing error rate.
101+
- **v0.x**: Specified as ``alpha`` during prediction, representing error rate.
93102
- **v1**: Replaced with ``confidence_level`` to denote the coverage rate directly. Set at model initialization, improving consistency and clarity. ``confidence_level`` is equivalent to ``1 - alpha``.
94103

104+
``cv``
105+
~~~~~~~
106+
See the first section of this guide. The ``cv`` parameter is now only declared at cross conformal techniques initialization.
107+
108+
``conformity_score``
109+
~~~~~~~~~~~~~~~~~~~~
110+
A parameter used to specify the scoring approach for evaluating model predictions.
111+
112+
- **v0.x**: Only allowed subclass instances of ``BaseRegressionScore``, like AbsoluteConformityScore()
113+
- **v1**: Now also accepts strings, like ``"absolute"``.
114+
95115
``method``
96116
~~~~~~~~~~
97-
Specifies the approach for calculating prediction intervals, especially in advanced models like Cross Conformal and Jackknife After Bootstrap regressors.
117+
Specifies the approach for calculating prediction intervals for cross conformal techniques.
98118

99-
- **v0.9**: Part of ``MapieRegressor``. Configured for the main prediction process.
119+
- **v0.x**: Part of ``MapieRegressor``. Configured for the main prediction process.
100120
- **v1**: Specific to ``CrossConformalRegressor`` and ``JackknifeAfterBootstrapRegressor``, indicating the interval calculation approach (``"base"``, ``"plus"``, or ``"minmax"``).
101121

102-
``cv``
103-
~~~~~~~
104-
The ``cv`` parameter manages the cross-validation configuration, accepting either an integer to indicate the number of data splits or a ``BaseCrossValidator`` object for custom data splitting.
105-
106-
- **v0.9**: The ``cv`` parameter was included in ``MapieRegressor``, where it handled cross-validation. The option ``cv="prefit"`` was available for models that were already pre-trained.
107-
- **v1**: The ``cv`` parameter is now only present in ``CrossConformalRegressor``, with the ``prefit`` option removed.
108-
109122
``groups``
110123
~~~~~~~~~~~
111124
The ``groups`` parameter is used to specify group labels for cross-validation, ensuring that the same group is not present in both training and conformity sets.
112125

113-
- **v0.9**: Passed as a parameter to the ``fit`` method.
126+
- **v0.x**: Passed as a parameter to the ``fit`` method.
114127
- **v1**: The ``groups`` present is now only present in ``CrossConformalRegressor``. It is passed in the ``.conformalize()`` method instead of the ``.fit()`` method. In other classes (like ``SplitConformalRegressor``), groups can be directly handled by the user during data splitting.
115128

116129
``prefit``
117130
~~~~~~~~~~
118131
Controls whether the model has been pre-fitted before applying conformal prediction.
119132

120-
- **v0.9**: Indicated through ``cv="prefit"`` in ``MapieRegressor``.
121-
- **v1**: ``prefit`` is now a separate boolean parameter, allowing explicit control over whether the model has been pre-fitted before applying conformal methods. It is set by default to ``True`` for ``SplitConformalRegressor``, as we believe this will become MAPIE nominal usage.
133+
- **v0.x**: Indicated through ``cv="prefit"`` in ``MapieRegressor``.
134+
- **v1**: ``prefit`` is now a separate boolean parameter, allowing explicit control over whether the model has been pre-fitted before conformalizing. It is set by default to ``True`` for ``SplitConformalRegressor``, as we believe this will become MAPIE nominal usage.
122135

123136
``fit_params`` (includes ``sample_weight``)
124137
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
125138
Dictionary of parameters specifically used during training, such as ``sample_weight`` in scikit-learn.
126139

127-
- **v0.9**: Passed additional parameters in a flexible but less explicit manner.
140+
- **v0.x**: Passed additional parameters in a flexible but less explicit manner.
128141
- **v1**: Now explicitly structured as a dedicated dictionary, ``fit_params``, ensuring parameters used during training are clearly defined and separated from other stages.
129142

130143
``predict_params``
131144
~~~~~~~~~~~~~~~~~~
132145
Defines additional parameters exclusively for prediction.
133146

134-
- **v0.9**: Passed additional parameters in a flexible but less explicit manner, sometimes mixed within training configurations.
147+
- **v0.x**: Passed additional parameters in a flexible but less explicit manner, sometimes mixed within training configurations.
135148
- **v1**: Now structured as a dedicated dictionary, ``predict_params``, to be used during calibration (``conformalize`` method) and prediction stages, ensuring no overlap with training parameters.
136149

137150
``agg_function`` and ``ensemble``
138151
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
139-
The aggregation method and technique for combining predictions in cross conformal methods.
152+
How to aggregate predictions in cross conformal methods.
140153

141-
- **v0.9**: Previously, the ``agg_function`` parameter had two usage: to aggregate predictions when setting ``ensemble=True`` in the ``predict`` method, and to specify the aggregation technique in ``JackknifeAfterBootstrapRegressor``.
154+
- **v0.x**: Previously, the ``agg_function`` parameter had two usage: to aggregate predictions when setting ``ensemble=True`` in the ``predict`` method, and to specify the aggregation used in ``JackknifeAfterBootstrapRegressor``.
142155
- **v1**:
156+
143157
- The ``agg_function`` parameter has been split into two distinct parameters: ``aggregate_predictions`` and ``aggregation_method``. ``aggregate_predictions`` is specific to ``CrossConformalRegressor``, and it specifies how predictions from multiple conformal regressors are aggregated when making point predictions. ``aggregation_method`` is specific to ``JackknifeAfterBootstrapRegressor``, and it specifies the aggregation technique for combining predictions across different bootstrap samples during conformalization.
144-
- Note that for both cross conformal methods, predictions points are now computed by default using mean aggregation. This is to avoid prediction points outside of prediction intervals in the default setting.
158+
- Note that for both cross conformal techniques, predictions points are now computed by default using mean aggregation. This is to avoid prediction points outside of prediction intervals in the default setting.
145159

146160
``random_state``
147161
~~~~~~~~~~~~~~~~~~
148162

149-
- **v0.9**: This parameter was used to control the randomness of the data splitting.
163+
- **v0.x**: This parameter was used to control the randomness of the data splitting.
150164
- **v1**: This parameter has been removed in cases where data splitting is now manual. Future evolutions may reintroduce it as a general purpose randomness control parameter.
151165

152166
``symmetry``
153167
~~~~~~~~~~~~~~~~~~
154168

155-
- **v0.9**: This parameter of the `predict` method of the MapieQuantileRegressor was set to True by default
169+
- **v0.x**: This parameter of the `predict` method of the MapieQuantileRegressor was set to True by default
156170
- **v1**: This parameter is now named `symmetric_correction` and is set to False by default, because the resulting intervals are smaller. It is used in the `predict_interval` method of the ConformalizedQuantileRegressor.
157171

158172
``optimize_beta``
@@ -164,26 +178,26 @@ None defaults
164178
~~~~~~~~~~~~~~~~~~~~
165179
No more parameters with incorrect ``None`` defaults.
166180

167-
- **v0.9**: Eg: ``estimator`` had a ``None`` default value, even though the actual default value is ``LinearRegression()``. This was the case for other parameters as well.
181+
- **v0.x**: Eg: ``estimator`` had a ``None`` default value, even though the actual default value is ``LinearRegression()``. This was the case for other parameters as well.
168182
- **v1**: All parameters now have explicit defaults.
169183

170184

171-
4. Migration example: MAPIE v0.9 to MAPIE v1
185+
4. Migration example: MAPIE v0.x to MAPIE v1
172186
----------------------------------------------------------------------------------------
173187

174-
Below is a side-by-side example of code in MAPIE v0.9 and its equivalent in MAPIE v1 using the new modular classes and methods.
188+
Below is a side-by-side example of code in MAPIE v0.x and its equivalent in MAPIE v1
175189

176190
Example 1: Split Conformal Prediction
177191
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
178192

179193
Description
180194
############
181-
Split conformal prediction is a widely used method for generating prediction intervals, it splits the data into training, conformity, and test sets. The model is trained on the training set, calibrated on the conformity set, and then used to make predictions on the test set. In `MAPIE v1`, the `SplitConformalRegressor` replaces the older `MapieRegressor` with a more modular design and simplified API.
195+
Split conformal prediction is a widely used technique for generating prediction intervals, it splits the data into training, conformity, and test sets. The model is trained on the training set, calibrated on the conformity set, and then used to make predictions on the test set. In `MAPIE v1`, the `SplitConformalRegressor` replaces the older `MapieRegressor` with a more modular design and simplified API.
182196

183-
MAPIE v0.9 Code
197+
MAPIE v0.x Code
184198
###############
185199

186-
Below is a MAPIE v0.9 code for split conformal prediction in case of pre-fitted model:
200+
Below is a MAPIE v0.x code for split conformal prediction in case of pre-fitted model:
187201

188202
.. testcode::
189203

@@ -250,10 +264,10 @@ Description
250264

251265
Cross-conformal prediction extends split conformal prediction by using multiple cross-validation folds to improve the efficiency of the prediction intervals. In MAPIE v1, `CrossConformalRegressor`` replaces the older `MapieRegressor`` for this purpose.
252266

253-
MAPIE v0.9 code
267+
MAPIE v0.x code
254268
###############
255269

256-
Below is a MAPIE v0.9 code for cross-conformal prediction:
270+
Below is a MAPIE v0.x code for cross-conformal prediction:
257271

258272
.. testcode::
259273

0 commit comments

Comments
 (0)