Flow
TEST06d7677483sklearn.impute._base.SimpleImputer

TEST06d7677483sklearn.impute._base.SimpleImputer

Visibility: public Uploaded 10-01-2024 by Continuous Integration sklearn==0.24.0 numpy>=1.13.3 scipy>=0.19.1 joblib>=0.11 threadpoolctl>=2.0.0 0 runs
0 likes downloaded by 0 people 0 issues 0 downvotes , 0 total downloads
  • openml-python python scikit-learn sklearn sklearn_0.24.0
Issue #Downvotes for this reason By


Loading wiki
Help us complete this description Edit
Imputation transformer for completing missing values.

Parameters

add_indicatorIf True, a :class:`MissingIndicator` transform will stack onto output of the imputer's transform. This allows a predictive estimator to account for missingness despite imputation. If a feature has no missing values at fit/train time, the feature won't appear on the missing indicator even if there are missing values at transform/test time.default: false
copyIf True, a copy of X will be created. If False, imputation will be done in-place whenever possible. Note that, in the following cases, a new copy will always be made, even if `copy=False`: - If X is not an array of floating values; - If X is encoded as a CSR matrix; - If add_indicator=Truedefault: true
fill_valueWhen strategy == "constant", fill_value is used to replace all occurrences of missing_values If left to the default, fill_value will be 0 when imputing numerical data and "missing_value" for strings or object data typesdefault: null
missing_valuesThe placeholder for the missing values. All occurrences of `missing_values` will be imputed. For pandas' dataframes with nullable integer dtypes with missing values, `missing_values` should be set to `np.nan`, since `pd.NA` will be converted to `np.nan`default: NaN
strategyThe imputation strategy - If "mean", then replace missing values using the mean along each column. Can only be used with numeric data - If "median", then replace missing values using the median along each column. Can only be used with numeric data - If "most_frequent", then replace missing using the most frequent value along each column. Can be used with strings or numeric data If there is more than one such value, only the smallest is returned - If "constant", then replace missing values with fill_value. Can be used with strings or numeric data .. versionadded:: 0.20 strategy="constant" for fixed value imputationdefault: "median"
verboseControls the verbosity of the imputerdefault: 0

0
Runs

List all runs
Parameter:
Rendering chart
Rendering table