ksb.csle.didentification.privacy
Object that contains message ksb.csle.common.proto.StreamDidentProto.IdenPartialReductionInfo IdenPartialReductionInfo contains attributes as follows:
enum ReductionMethod { DELETE = 0; REPLACE = 1; } message GeneralizeColumnInfo { required int32 selectedColumnId = 1; required int32 numLevels = 2 [default = 5]; required int32 curLevel = 3 [default = 1]; } enum ColumnHandlePolicy { ONEBYONE = 0; ALL = 1; } message IdenPartialReductionInfo { required ReductionMethod method = 1 [default = DELETE]; repeated GeneralizeColumnInfo generalizedColumnInfo = 2; required ColumnHandlePolicy columnHandlePolicy = 3 [default = ONEBYONE]; repeated FieldInfo fieldInfo = 4; optional PrivacyCheckInfo check = 5; }
Replace the some parts of the value with "*".
Replace the some parts of the value with "*". The parts are decided by the generalization step. If the given generalized step and its maximum step are 3 and 10, respectively, the 30% of value are replaced.
Dataframe to anonymize
DataFrame Generalized dataframe
Anonymizes the column specified in src dataframe using generic 'Type' method.
Anonymizes the column specified in src dataframe using generic 'Type' method. The 'Type' is decided by inherited object module.
Dataframe to anonymize
Column to be anonymized
DataFrame The dataframe which replaces original column with anonymized column
Dummy function.
Dummy function. To be modified.
dataframe to anonymize
Column to be anonymized
DataFrame The dataframe which replaces original column with anonymized column
Returns column name from src dataframe specified by the column ID defined by protobuf.
Returns column name from src dataframe specified by the column ID defined by protobuf.
dataframe to get names of columns.
column ID to anonymize.
String.
Returns column names from src dataframe specified by column IDs.
Returns column names from src dataframe specified by column IDs. Note that the column with invalid IDs are ignored.
dataframe to get names of columns.
Array[String].
Checks the given column ID is valid.
Checks the given column ID is valid.
dataframe to get names of columns.
Boolean.
Checks the given column Name is valid.
Checks the given column Name is valid.
dataframe to get names of columns.
column Name.
Boolean.
Operates iden-partial reduction module for basic de-identification
Operates iden-partial reduction module for basic de-identification
Input dataframe
DataFrame Anonymized dataframe
:: ApplicationDeveloperApi ::
Operator that implements the identity partial reduction module in the Data Reduction algorithm, which replace the some parts of the value with "*".