ksb.csle.didentification.privacy
Object that contains message ksb.csle.common.proto.StreamDidentProto.DataRangeInfo DataRangeInfo contains attributes as follows:
enum AggregationMethod { MIN = 0; AVG = 1; MAX = 2; STD = 3; COUNT = 4; MANUAL = 5; } message DataRangeInfo { repeated int32 selectedColumnId = 1; required int32 rangeStep = 2 [default = 10]; required AggregationMethod method = 3 [default = AVG]; repeated FieldInfo fieldInfo = 4; optional PrivacyCheckInfo check = 5; }
Anonymizes the column specified in src dataframe using generic 'Type' method.
Anonymizes the column specified in src dataframe using generic 'Type' method. The 'Type' is decided by inherited object module.
Dataframe to anonymize
Column to be anonymized
DataFrame The dataframe which replaces original column with anonymized column
Represents the values of column in src dataframe with a form of interval [lower, upper]
Represents the values of column in src dataframe with a form of interval [lower, upper]
Dataframe to anonymize
Column to be anonymized
DataFrame Anonymized dataframe
Returns column name from src dataframe specified by the column ID defined by protobuf.
Returns column name from src dataframe specified by the column ID defined by protobuf.
dataframe to get names of columns.
column ID to anonymize.
String.
Returns column names from src dataframe specified by column IDs.
Returns column names from src dataframe specified by column IDs. Note that the column with invalid IDs are ignored.
dataframe to get names of columns.
Array[String].
Checks the given column ID is valid.
Checks the given column ID is valid.
dataframe to get names of columns.
Boolean.
Checks the given column Name is valid.
Checks the given column Name is valid.
dataframe to get names of columns.
column Name.
Boolean.
Operates data range module for basic de-identification
Operates data range module for basic de-identification
Input dataframe
DataFrame Anonymized dataframe
Represents the numerical type of column as a form of intervals [lower, upper] using 'rangeType' method.
Represents the numerical type of column as a form of intervals [lower, upper] using 'rangeType' method.
Dataframe to anonymize
Column to be anonymized
DataFrame Anonymized dataframe
Represents the values of column containing both numerical and string as a form of intervals [lower, upper] using 'rangeType' method.
Represents the values of column containing both numerical and string as a form of intervals [lower, upper] using 'rangeType' method.
Dataframe to anonymize
Column to be anonymized
Methods of data range module. currently, not supported
:: ApplicationDeveloperApi ::
Operator that implements the data range module in the Data Suppression algorithm. It represents the values of the data with intervals [lower, upper].