ksb.csle.didentification.verification.loss
Converts all the contents of a specific column to the array of string.
Converts all the contents of a specific column to the array of string.
the dataframe
the column name
Array[String] the number of equivalence classes
Gets cardinality of the array of columns in 'src' dataframe
Gets cardinality of the array of columns in 'src' dataframe
the dataframe
Double the cardinality of the column
Gets cardinality of 'column' column in 'src' dataframe
Gets cardinality of 'column' column in 'src' dataframe
the dataframe
the column to get cardinality
Double the cardinality of the column
Gets the number of the equivalence class which is composed of referring to the columns of quasi-identifiers.
Gets the number of the equivalence class which is composed of referring to the columns of quasi-identifiers.
the dataframe
the array of quasi-identifier columns
Long the number of equivalence classes
Measures the information loss based on cardinality of a column.
Measures the information loss based on cardinality of a column.
the original dataframe
the anonymized dataframe
the suppressed dataframe
the array of column names of quasi-identifiers. The combination key is made by cross-tabulating these variables.
InformationLossBound the measured information loss
Measures the information loss of anonymized data compared to the original data.
Measures the information loss of anonymized data compared to the original data.
the dataframe
the anonymized dataframe
the array of column names
InformationLossBound the measured information loss bound
This class implements the loss measure method named as the cardinality loss which measures the loss based on cardinality of a column.