ksb.csle.didentification.privacy
Object that contains message ksb.csle.common.proto.StreamDidentProto.RearrangementInfo RearrangementInfo contains attributes as follows:
enum RearrangeMethod { REARR_RANDOM = 0; REARR_MANUAL = 1; } message SwapList { required int32 fromRowId = 1; required int32 toRowId = 2; } enum ColumnHandlePolicy { ONEBYONE = 0; ALL = 1; } message RearrangementInfo { repeated int32 selectedColumnId = 1; required ColumnHandlePolicy columnHandlePolicy = 2 [default = ONEBYONE]; required RearrangeMethod method = 3 [default = REARR_RANDOM]; optional double ratio = 4 [default = 0.001]; repeated SwapList swapList = 5; repeated FieldInfo fieldInfo = 6; optional PrivacyCheckInfo check = 7; }
Rearranges some records of given columns each other.
Rearranges some records of given columns each other.
Dataframe to rearrange
the array of columns to be rearranged
DataFrame Rearranged dataframe
Anonymizes the column specified in src dataframe using generic 'Type' method.
Anonymizes the column specified in src dataframe using generic 'Type' method. The 'Type' is decided by inherited object module.
Dataframe to anonymize
Column to be anonymized
DataFrame The dataframe which replaces original column with anonymized column
Applies the rearrangement function on given specified column
Applies the rearrangement function on given specified column
Dataframe to anonymize
the column to apply the rearrangment module
DataFrame the rearranged dataframe
Applies the rearrangement function on given all specified columns simultaneously.
Applies the rearrangement function on given all specified columns simultaneously.
Dataframe to anonymize
DataFrame the rearranged dataframe
Suppose there are some columns in a dataset, and the data handler tries to to apply the rearrangement function on some of that columns.
Suppose there are some columns in a dataset, and the data handler tries to to apply the rearrangement function on some of that columns. This function applies the function on each specified column individually one-by-one.
Dataframe to anonymize
the array of columns to apply the rearrangment module
DataFrame the rearranged dataframe
Returns column name from src dataframe specified by the column ID defined by protobuf.
Returns column name from src dataframe specified by the column ID defined by protobuf.
dataframe to get names of columns.
column ID to anonymize.
String.
Returns column names from src dataframe specified by column IDs.
Returns column names from src dataframe specified by column IDs. Note that the column with invalid IDs are ignored.
dataframe to get names of columns.
Array[String].
Checks the given column ID is valid.
Checks the given column ID is valid.
dataframe to get names of columns.
Boolean.
Checks the given column Name is valid.
Checks the given column Name is valid.
dataframe to get names of columns.
column Name.
Boolean.
Operates rearrangement module for basic de-identification
Operates rearrangement module for basic de-identification
Input dataframe
DataFrame Anonymized dataframe
:: ApplicationDeveloperApi ::
Operator that implements the rearrangement module in the aggregation algorithm. The model swaps some records in the given column each other. - The user can configure the records to be rearranged manually. - Or, the records to be rearranged can be automatically done according to the given ratio. That is, if the ratio is 0.5, the half of total records are randomly rearranged each other.