ksb.csle.didentification.privacy
Object that contains message ksb.csle.common.proto.StreamDidentProto.QuasiReductionInfo QuasiReductionInfo contains attributes as follows:
enum ReductionMethod { DELETE = 0; REPLACE = 1; } message QuasiReductionInfo { required ReductionMethod method = 1 [default = DELETE]; repeated string safeHarborList = 2; repeated FieldInfo fieldInfo = 3; optional PrivacyCheckInfo check = 4; }
Anonymizes the src dataframe.
Anonymizes the src dataframe. Since this method is only applicable on the quasi-identifier columns, there is no need to specify the columns to be anonymized.
Dataframe to anonymize
DataFrame Anonymized dataframe which replaces original column with anonymized column
Anonymizes the column specified in src dataframe using generic 'Type' method.
Anonymizes the column specified in src dataframe using generic 'Type' method. The 'Type' is decided by inherited object module.
Dataframe to anonymize
Column to be anonymized
DataFrame The dataframe which replaces original column with anonymized column
Drops the quasi-identifier column which belongs to the list of safe habor.
Drops the quasi-identifier column which belongs to the list of safe habor. This list can be configured by the data handler, or by referring to the HIPAA act.
Dataframe to anonymize
Columnname to be anonymized
DataFrame Anonymized dataframe
Returns column name from src dataframe specified by the column ID defined by protobuf.
Returns column name from src dataframe specified by the column ID defined by protobuf.
dataframe to get names of columns.
column ID to anonymize.
String.
Returns column names from src dataframe specified by column IDs.
Returns column names from src dataframe specified by column IDs. Note that the column with invalid IDs are ignored.
dataframe to get names of columns.
Array[String].
Checks whether the given column belongs to the list of safe habor
Checks whether the given column belongs to the list of safe habor
the column name to check
Boolean returns whether belongs to the list of safe harbor or not.
Data handle can configure the list of safe habor.
Data handle can configure the list of safe habor. In the case, this function checks the given column belongs to the configured list.
the column name to check
Boolean returns whether belongs to the list of safe harbor or not.
Checks the given column ID is valid.
Checks the given column ID is valid.
dataframe to get names of columns.
Boolean.
Checks the given column Name is valid.
Checks the given column Name is valid.
dataframe to get names of columns.
column Name.
Boolean.
Operates quasi-reduction module for basic de-identification
Operates quasi-reduction module for basic de-identification
Input dataframe
DataFrame Anonymized dataframe
:: ApplicationDeveloperApi ::
Operator that implements the quasi-identifier reduction module in the Data Reduction algorithm.