Validate inputs to Keras model #106

stsievert · 2020-10-12T16:23:36Z

Currently, the number of outputs of the Keras model is checked:

Lines 418 to 419 in 9f633f5

    
           if self.model_n_outputs_ != len(self.model_.outputs): 
        
               raise RuntimeError(

It'd be nice if the number of inputs could be checked too (and their shape/dtype). This should be possible according to the docs, which says a Keras Model has an .inputs attribute: https://keras.io/api/models/model/

The text was updated successfully, but these errors were encountered:

stsievert · 2020-10-12T16:34:52Z

This would be an internal refactor; I don't think the user would notice any changes (at least not at first). The advantage for the user would be that this would more lightly wrap the Keras API; a lot of the code in #88's _validate_data could be removed. In #88, _validate_data does the following:

Runs Scikit-learn's check_X_y
Casts to appropriate dtype.
Checks X/y shape
Checks the number of features in X.

I think (3) and (4) could be moved to checking the Keras input (after the user input has been run through the target/feature transformers). I think (2) can be enhanced with closer integration to the Keras model; there might be better error messages with np.can_cast(X.dtype, self.model_.layers[0].dtype).

adriangb · 2020-11-29T08:39:12Z

I am just now seeing this last comment. This sounds interesting, I am +1 on anything that reduces code complexity.

That said, I think this would be a fundamental departure from how Scikit-Learn does these validations? _validate_data is meant to validate that data passed after the first fit/initialization matches what the estimator knows. I'm guessing sklearn does it like this mainly to avoid cryptic failures within estimators, but like you say we may be able to check the "source of truth" (i.e. the Keras model) to achieve the same effect. Got to think about it a bit or see it implemented.

adriangb mentioned this issue Oct 12, 2020

BUG/ENH: Data processing refactor #88

Merged

stsievert changed the title ~~Check number of inputs to Keras model~~ Validate inputs to Keras model Oct 14, 2020

adriangb linked a pull request Nov 29, 2020 that will close this issue

ENH: Check input dimensions against the initialized model_ #143

Open

adriangb mentioned this issue Jun 21, 2021

RFC: Composable input/output pipeline #234

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Validate inputs to Keras model #106

Validate inputs to Keras model #106

stsievert commented Oct 12, 2020 •

edited

Loading

stsievert commented Oct 12, 2020 •

edited

Loading

adriangb commented Nov 29, 2020

Validate inputs to Keras model #106

Validate inputs to Keras model #106

Comments

stsievert commented Oct 12, 2020 • edited Loading

stsievert commented Oct 12, 2020 • edited Loading

adriangb commented Nov 29, 2020

stsievert commented Oct 12, 2020 •

edited

Loading

stsievert commented Oct 12, 2020 •

edited

Loading