Flow
TESTf1d7ce1e10sklearn.impute._base.SimpleImputer

TESTf1d7ce1e10sklearn.impute._base.SimpleImputer

Visibility: public Uploaded 10-01-2024 by Continuous Integration sklearn==0.23.1 numpy>=1.13.3 scipy>=0.19.1 joblib>=0.11 threadpoolctl>=2.0.0 0 runs
0 likes downloaded by 0 people 0 issues 0 downvotes , 0 total downloads
  • openml-python python scikit-learn sklearn sklearn_0.23.1
Issue #Downvotes for this reason By


Loading wiki
Help us complete this description Edit
Imputation transformer for completing missing values.

Parameters

add_indicatorIf True, a :class:`MissingIndicator` transform will stack onto output of the imputer's transform. This allows a predictive estimator to account for missingness despite imputation. If a feature has no missing values at fit/train time, the feature won't appear on the missing indicator even if there are missing values at transform/test time.default: false
copyIf True, a copy of X will be created. If False, imputation will be done in-place whenever possible. Note that, in the following cases, a new copy will always be made, even if `copy=False`: - If X is not an array of floating values; - If X is encoded as a CSR matrix; - If add_indicator=Truedefault: true
fill_valueWhen strategy == "constant", fill_value is used to replace all occurrences of missing_values If left to the default, fill_value will be 0 when imputing numerical data and "missing_value" for strings or object data typesdefault: null
missing_valuesThe placeholder for the missing values. All occurrences of `missing_values` will be imputed. For pandas' dataframes with nullable integer dtypes with missing values, `missing_values` should be set to `np.nan`, since `pd.NA` will be converted to `np.nan`default: NaN
strategyThe imputation strategy - If "mean", then replace missing values using the mean along each column. Can only be used with numeric data - If "median", then replace missing values using the median along each column. Can only be used with numeric data - If "most_frequent", then replace missing using the most frequent value along each column. Can be used with strings or numeric data - If "constant", then replace missing values with fill_value. Can be used with strings or numeric data .. versionadded:: 0.20 strategy="constant" for fixed value imputationdefault: "median"
verboseControls the verbosity of the imputerdefault: 0

0
Runs

List all runs
Parameter:
Rendering chart
Rendering table