ksb.csle.didentification.privacy
Object that contains message ksb.csle.common.proto.StreamDidentProto.AggregationInfo AggregationInfo contains attributes as follows:
enum AggregationMethod { MIN = 0; AVG = 1; MAX = 2; STD = 3; COUNT = 4; MANUAL = 5; } message AggregationInfo { repeated int32 selectedColumnId = 1; required AggregationMethod method = 2 [default = AVG]; repeated FieldInfo fieldInfo = 3; optional PrivacyCheckInfo check = 4; }
Replaces the values of the column containing numerical data with statistic information using 'aggrType' method such as min, max, avg, std, and count
Replaces the values of the column containing numerical data with statistic information using 'aggrType' method such as min, max, avg, std, and count
Dataframe to anonymize
Column to be anonymized
Methods of aggregation module. ex., min, max, avg, std, and count
DataFrame Anonymized dataframe
Anonymizes the column specified in src dataframe using generic 'Type' method.
Anonymizes the column specified in src dataframe using generic 'Type' method. The 'Type' is decided by inherited object module.
Dataframe to anonymize
Column to be anonymized
DataFrame The dataframe which replaces original column with anonymized column
Replaces the values of the column with statistic information using 'aggrType' method such as min, max, avg, std, and count.
Replaces the values of the column with statistic information using 'aggrType' method such as min, max, avg, std, and count. If the type of column is string, this module call the hiding module internally.
Dataframe to anonymize
Column to be anonymized
DataFrame Anonymized dataframe
Returns column name from src dataframe specified by the column ID defined by protobuf.
Returns column name from src dataframe specified by the column ID defined by protobuf.
dataframe to get names of columns.
column ID to anonymize.
String.
Returns column names from src dataframe specified by column IDs.
Returns column names from src dataframe specified by column IDs. Note that the column with invalid IDs are ignored.
dataframe to get names of columns.
Array[String].
Checks the given column ID is valid.
Checks the given column ID is valid.
dataframe to get names of columns.
Boolean.
Checks the given column Name is valid.
Checks the given column Name is valid.
dataframe to get names of columns.
column Name.
Boolean.
Operates aggregation module for basic de-identification
Operates aggregation module for basic de-identification
Input dataframe
DataFrame Anonymized dataframe
:: ApplicationDeveloperApi ::
Operator that implements the aggregation module in the Aggregation algorithm. It replaces the values of the data with some statistic values such as min, max, avg, std, or count. If the type of record is string containing numerical value, this module extracts only numerical value and then applies this function on it.