Gets the average value among the given list of values
Gets the average value among the given list of values
the list of values
Double The average value
Gets the average value among this column
Gets the average value among this column
Dataframe
Column
String The string type of average value
Returns column name from src dataframe specified by the column ID defined by protobuf.
Returns column name from src dataframe specified by the column ID defined by protobuf.
dataframe to get names of columns.
column ID to anonymize.
String.
Returns column names from src dataframe specified by column IDs.
Returns column names from src dataframe specified by column IDs. Note that the column with invalid IDs are ignored.
dataframe to get names of columns.
Array[String].
Gets the number of tuples among he given list of values
Gets the number of tuples among he given list of values
Double The number of tuples
Gets the number of tuples among this column
Gets the number of tuples among this column
Dataframe
Column
Double The number of tuples
Gets the maximum value among the given list of values
Gets the maximum value among the given list of values
the list of values
Double Maximum value
Gets the maximum value among this column
Gets the maximum value among this column
Dataframe
Column
String the string type of Maximum value
Gets the minimum value among the given list of values
Gets the minimum value among the given list of values
the list of values
Double Minimum value
Gets the minimum value among this column
Gets the minimum value among this column
Dataframe
Column
String the string type of Minimum value
Gets the average value among the given list of values
Gets the average value among the given list of values
the list of values
Double The average value
Gets the standard deviation value among this column
Gets the standard deviation value among this column
Dataframe
Column
String The string type of standard deviation value
Checks the given column ID is valid.
Checks the given column ID is valid.
dataframe to get names of columns.
Boolean.
Checks the given column Name is valid.
Checks the given column Name is valid.
dataframe to get names of columns.
column Name.
Boolean.
In case of age-related column, the outlier information info may be decided by the 10s, 20s, and so on.
In case of age-related column, the outlier information info may be decided by the 10s, 20s, and so on.
Dataframe
Column
Methods to handle outliers. ex., min, max, avg, std, and count
Map[Interval, Boxplot] Outlier management table ( the Boxplot information in a interval)
Same as makeAgeStatTable(src: DataFrame, columnName: String, method: String), but the method is the type of AggregationMethod.
Same as makeAgeStatTable(src: DataFrame, columnName: String, method: String), but the method is the type of AggregationMethod.
Dataframe
Column
Methods to handle outliers. ex., min, max, avg, std, and count
Map[Interval, Boxplot] Outlier management table ( the Boxplot information in a interval)
In case of age-related column, the outlier information info may be decided by the 10s, 20s, and so on.
In case of age-related column, the outlier information info may be decided by the 10s, 20s, and so on.
Dataframe
Column
Methods to handle outliers. ex., min, max, avg, std, and count
Map[Interval, Zscore] Outlier management table ( the Z-score information in a interval)
Same as makeAgeStatTable(src: DataFrame, columnName: String, method: String), but the method is the type of AggregationMethod.
Same as makeAgeStatTable(src: DataFrame, columnName: String, method: String), but the method is the type of AggregationMethod.
Dataframe
Column
Methods to handle outliers. ex., min, max, avg, std, and count
Map[Interval, Boxplot] Outlier management table ( the Boxplot information in a interval)
Makes the outlier management table based on the boxplot technique which includes outlier information about some numerical interval as a form of map [numerical interval, outlier management info].
Makes the outlier management table based on the boxplot technique which includes outlier information about some numerical interval as a form of map [numerical interval, outlier management info].
Dataframe
Column
Methods to handle outliers. ex., min, max, avg, std, and count
Map[Interval, Boxplot] Outlier management table ( the Boxplot information in a interval)
Makes the outlier management table based on the boxplot technique which includes outlier information about some numerical interval as a form of map [numerical interval, outlier management info].
Makes the outlier management table based on the boxplot technique which includes outlier information about some numerical interval as a form of map [numerical interval, outlier management info]. The default interval is set to satisfy the number of intervals to be be 10
Dataframe
Column
Methods to handle outliers. ex., min, max, avg, std, and count
Map[Interval, Boxplot] Outlier management table ( the Boxplot information in a interval)
Same as makeOutlierMgmtTableBoxplot(src: DataFrame, columnName: String, method: String), but the method is the type of AggregationMethod
Same as makeOutlierMgmtTableBoxplot(src: DataFrame, columnName: String, method: String), but the method is the type of AggregationMethod
Dataframe
Column
Methods to handle outliers. ex., min, max, avg, std, and count
Map[Interval, Boxplot] Outlier management table ( the Boxplot information in a interval)
Makes the outlier management table based on the z-score technique which includes outlier information about some numerical interval as a form of map [numerical interval, outlier management info].
Makes the outlier management table based on the z-score technique which includes outlier information about some numerical interval as a form of map [numerical interval, outlier management info].
Dataframe
Column
Methods to handle outliers. ex., min, max, avg, std, and count
Map[Interval, Zscore] Outlier management table ( the Z-score information in a interval)
Makes the outlier management table based on the z-score technique which includes outlier information about some numerical interval as a form of map [numerical interval, outlier management info].
Makes the outlier management table based on the z-score technique which includes outlier information about some numerical interval as a form of map [numerical interval, outlier management info]. The default interval is set to satisfy the number of intervals to be be 10
Dataframe
Column
Methods to handle outliers. ex., min, max, avg, std, and count
Map[Interval, Zscore] Outlier management table ( the Z-score information in a interval)
Same as makeOutlierMgmtTableZscore(src: DataFrame, columnName: String, method: String), but the method is the type of AggregationMethod
Same as makeOutlierMgmtTableZscore(src: DataFrame, columnName: String, method: String), but the method is the type of AggregationMethod
Dataframe
Column
Methods to handle outliers. ex., min, max, avg, std, and count
Map[Interval, Zscore] Outlier management table ( the Z-score information in a interval)
This object provides some functions to manage outliers.