public interface QuantileDiscretizerBase extends Params, HasHandleInvalid, HasInputCol, HasOutputCol
QuantileDiscretizer
.Modifier and Type | Method and Description |
---|---|
int |
getNumBuckets() |
int[] |
getNumBucketsArray() |
double |
getRelativeError() |
Param<String> |
handleInvalid()
Param for how to handle invalid entries.
|
IntParam |
numBuckets()
Number of buckets (quantiles, or categories) into which data points are grouped.
|
IntArrayParam |
numBucketsArray()
Array of number of buckets (quantiles, or categories) into which data points are grouped.
|
DoubleParam |
relativeError()
Relative error (see documentation for
org.apache.spark.sql.DataFrameStatFunctions.approxQuantile for description)
Must be in the range [0, 1]. |
getHandleInvalid
getInputCol, inputCol
getOutputCol, outputCol
clear, copy, copyValues, defaultCopy, defaultParamMap, explainParam, explainParams, extractParamMap, extractParamMap, get, getDefault, getOrDefault, getParam, hasDefault, hasParam, isDefined, isSet, paramMap, params, set, set, set, setDefault, setDefault, shouldOwn
toString, uid
int getNumBuckets()
int[] getNumBucketsArray()
double getRelativeError()
Param<String> handleInvalid()
handleInvalid
in interface HasHandleInvalid
IntParam numBuckets()
See also handleInvalid
, which can optionally create an additional bucket for NaN values.
default: 2
IntArrayParam numBucketsArray()
See also handleInvalid
, which can optionally create an additional bucket for NaN values.
DoubleParam relativeError()
org.apache.spark.sql.DataFrameStatFunctions.approxQuantile
for description)
Must be in the range [0, 1].
Note that in multiple columns case, relative error is applied to all columns.
default: 0.001