Version 0.11.1#
Changelog#
Bug fixes#
Fix a bug in
SMOTENC
where the entries of the one-hot encoding should be divided bysqrt(2)
and not2
, taking into account that they are plugged into an Euclidean distance computation. #1014 by Guillaume Lemaitre.Raise an informative error message when all support vectors are tagged as noise in
SVMSMOTE
. #1016 by Guillaume Lemaitre.Fix a bug in
SMOTENC
where the median of standard deviation of the continuous features was only computed on the minority class. Now, we are computing this statistic for each class that is up-sampled. #1015 by Guillaume Lemaitre.Fix a bug in
SMOTENC
such that the case where the median of standard deviation of the continuous features is null is handled in the multiclass case as well. #1015 by Guillaume Lemaitre.Fix a bug in
BorderlineSMOTE
version 2 where samples should be generated from the whole dataset and not only from the minority class. #1023 by Guillaume Lemaitre.
Version 0.11.0#
July 8, 2023
Changelog#
Bug fixes#
Fix a bug in
classification_report_imbalanced
where the parametertarget_names
was not taken into account whenoutput_dict=True
. #989 by AYY7.SMOTENC
now handles mix types of data type such asbool
andpd.category
by delegating the conversion to scikit-learn encoder. #1002 by Guillaume Lemaitre.Handle sparse matrices in
SMOTEN
and raise a warning since it requires a conversion to dense matrices. #1003 by Guillaume Lemaitre.Remove spurious warning raised when minority class get over-sampled more than the number of sample in the majority class. #1007 by Guillaume Lemaitre.
Compatibility#
Maintenance release for being compatible with scikit-learn >= 1.3.0. #999 by Guillaume Lemaitre.
Deprecation#
The fitted attribute
ohe_
inSMOTENC
is deprecated and will be removed in version 0.13. Usecategorical_encoder_
instead. #1000 by Guillaume Lemaitre.The default of the parameters
sampling_strategy
,bootstrap
andreplacement
will change inBalancedRandomForestClassifier
to follow the implementation of the original paper. This changes will take effect in version 0.13. #1006 by Guillaume Lemaitre.
Enhancements#
SMOTENC
now accepts a parametercategorical_encoder
allowing to specify aOneHotEncoder
with custom parameters. #1000 by Guillaume Lemaitre.SMOTEN
now accepts a parametercategorical_encoder
allowing to specify aOrdinalEncoder
with custom parameters. A new fitted parametercategorical_encoder_
is exposed to access the fitted encoder. #1001 by Guillaume Lemaitre.RandomUnderSampler
andRandomOverSampler
(whenshrinkage is not None
) now accept any data types and will not attempt any data conversion. #1004 by Guillaume Lemaitre.SMOTENC
now support passing array-like ofstr
when passing thecategorical_features
parameter. #1008 by :user`Guillaume Lemaitre <glemaitre>`.SMOTENC
now support automatic categorical inference whencategorical_features
is set to"auto"
. #1009 by :user`Guillaume Lemaitre <glemaitre>`.