ksb.csle.didentification.privacy
Object that contains message ksb.csle.common.proto.StreamDidentProto.BlankImputeInfo BlankImputeInfo contains attributes as follows:
enum ReplaceValueMethod { BLANK = 0; STAR = 1; UNDERBAR = 2; } message BlankImputeInfo { repeated int32 selectedColumnId = 1; repeated int32 position = 2; optional int32 numReplace = 3 [default = 1]; required ReplaceValueMethod method = 4 [default = STAR]; repeated FieldInfo fieldInfo = 5; optional PrivacyCheckInfo check = 6; }
Anonymizes the column specified in src dataframe using generic 'Type' method.
Anonymizes the column specified in src dataframe using generic 'Type' method. The 'Type' is decided by inherited object module.
Dataframe to anonymize
Column to be anonymized
DataFrame The dataframe which replaces original column with anonymized column
Replaces the specific index position of values in the columm with imputeType (blank ' ', star '*', or impute '_')
Replaces the specific index position of values in the columm with imputeType (blank ' ', star '*', or impute '_')
Dataframe to anonymize
Column to be anonymized
Position of value to be imputed
The number of replace
Blank, star, or impute
DataFrame Anonymized dataframe
Replaces the specified multiple positions of values in the columm with imputeType (blank ' ', star '*', or impute '_')
Replaces the specified multiple positions of values in the columm with imputeType (blank ' ', star '*', or impute '_')
Dataframe to anonymize
Column to be anonymized
DataFrame Anonymized dataframe
Returns column name from src dataframe specified by the column ID defined by protobuf.
Returns column name from src dataframe specified by the column ID defined by protobuf.
dataframe to get names of columns.
column ID to anonymize.
String.
Returns column names from src dataframe specified by column IDs.
Returns column names from src dataframe specified by column IDs. Note that the column with invalid IDs are ignored.
dataframe to get names of columns.
Array[String].
Checks the given column ID is valid.
Checks the given column ID is valid.
dataframe to get names of columns.
Boolean.
Checks the given column Name is valid.
Checks the given column Name is valid.
dataframe to get names of columns.
column Name.
Boolean.
Operates blank and impute module for basic de-identification
Operates blank and impute module for basic de-identification
Input dataframe
DataFrame Anonymized dataframe
:: ApplicationDeveloperApi ::
Operator that implements the 'blank and impute' module in the Data Reduction algorithm. It changes the values of the data with ' ', '*', or '_'.