Optimizer parameters validation improvement #542

himkwtn · 2024-08-07T19:24:11Z

Right now, parameters validation is done in the constructor as follows

Lines 138 to 163 in 3e8a445

    
           def __init__( 
        
               self, 
        
               threshold=0.1, 
        
               thresholds=None, 
        
               nu=1.0, 
        
               tol=1e-5, 
        
               thresholder="L0", 
        
               trimming_fraction=0.0, 
        
               trimming_step_size=1.0, 
        
               max_iter=30, 
        
               copy_X=True, 
        
               initial_guess=None, 
        
               normalize_columns=False, 
        
               verbose=False, 
        
               unbias=False, 
        
           ): 
        
               super(SR3, self).__init__( 
        
                   max_iter=max_iter, 
        
                   initial_guess=initial_guess, 
        
                   copy_X=copy_X, 
        
                   normalize_columns=normalize_columns, 
        
                   unbias=unbias, 
        
               ) 
        
               if threshold < 0: 
        
                   raise ValueError("threshold cannot be negative")

According to the scikit-learn documents, validations should be done in the fit method because if we call set_params, it will bypass the validation in the constructor.

Reproducing code example:

from pysindy.optimizers import SR3
opt = SR3(threshold=-1)
# raises "ValueError: threshold cannot be negative"

from pysindy.optimizers import SR3
opt = SR3()
opt.set_params(threshold=-1)
# no error

Jacob-Stevens-Haas · 2024-08-08T17:12:45Z

This is a good point, although fairly low impact, since I believe set_params() is only used in gridsearch, which is not done much with SINDy, and we can cheat by doing the same validation in set_params() that we do in __init__(). That said, this is something we want, and the way to do this is:

list out all classes that break the scikit-learn API in differentiation, feature library, and optimizer
extract all validation into a helper function, refactored to the end of __init__()
Modify tests for bad argument combinations to require __init__ to pass, and then
move the validation from __init__ to fit()

himkwtn added the enhancement New feature or request label Aug 7, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimizer parameters validation improvement #542

Optimizer parameters validation improvement #542

himkwtn commented Aug 7, 2024

Jacob-Stevens-Haas commented Aug 8, 2024

Optimizer parameters validation improvement #542

Optimizer parameters validation improvement #542

Comments

himkwtn commented Aug 7, 2024

Reproducing code example:

Jacob-Stevens-Haas commented Aug 8, 2024