specifies which binning method to use.
n Bucket binning creates equal-length bins
and assigns the data to one of these bins.
You can choose the number of bins during
the binning. The default number of bins
(the binning level) is 16.
n Winsorized binning is similar to bucket
binning except that both tails are cut off to
obtain a smooth binning result. This
technique is often used to remove outliers
during the data preparation stage.
You must specify a value for the Winsor
rate option. Valid values are from 0.0 to 0.5
n Pseudo-quantile binning mimics the
results of the quantile binning method but
is more efficient by consuming less CPU
time and memory.
Select tables to display
In the results, you can specify whether to
include no tables, the default tables for the
task, or customized tables.
If you create customized tables, you can
choose from these options:
n Basic statistics displays the mean,
pseudo-median, standard deviation,
minimum, maximum, and number of bins
for each binning variable.
n Quantile statistics displays the estimated
quantiles and extremes table.
Output Data Sets
You can specify whether to save the results to an output table. If the table is created, it is
saved in the Work library by default. In the Additional variables to include in the output
data set role, specify any columns from the input data set that you want to include in the
output data set.
Chapter 9 9 / High-Performance Tasks