Returns the list of tuples to be suppressed under the default condition when the k value of k-anonymity and suppreshold threshold are given by 3 and 0.2, respectively.
Returns the list of tuples to be suppressed under the default condition when the k value of k-anonymity and suppreshold threshold are given by 3 and 0.2, respectively.
dataframe to generalize
array of column names to be generalization
List[Any] the list of tuples to be suppressed
Returns the list of tuples to be suppressed under the condition when the k value of k-anonymity is given by 'k'.
Returns the list of tuples to be suppressed under the condition when the k value of k-anonymity is given by 'k'. All the tuples which violate the condition of k-anonymity are suppressed.
dataframe to generalize
array of column names to be generalization
the k-value of k-anonymity
List[Any] the list of tuples to be suppressed
Returns the list of tuples to be suppressed under the condition when the k value of k-anonymity and the suppreshold threshold are given by 'k' and 'suppressRatio'.
Returns the list of tuples to be suppressed under the condition when the k value of k-anonymity and the suppreshold threshold are given by 'k' and 'suppressRatio'. All the tuples which violate the condition of k-anonymity are suppressed.
dataframe to generalize
array of column names to be generalization
the k-value of k-anonymity
the acceptable suppress ratio
List[Any] the list of tuples to be suppressed
Returns the list of tuples to be suppressed under the condition when the re-identification risk is given by 'risk'.
Returns the list of tuples to be suppressed under the condition when the re-identification risk is given by 'risk'. All the tuples which has the probability higher than given re-identification risk are suppressed.
dataframe to generalize
array of column names to be generalization
the re-identification risk
List[Any] the list of tuples to be suppressed
In general, satisfying k-anonymity in global domain generalization is not easy because generalization is equally applied on all tuples of a column (that is, column-based generalization).
In general, satisfying k-anonymity in global domain generalization is not easy because generalization is equally applied on all tuples of a column (that is, column-based generalization). Compared to this, local domain generalization only applies on tuples on an equivalence class (equivalence-based). Accordingly, in global-domain generalization, a large number of tuples may violate the privacy policy, so some specific algorithms support suppressing techniques which changes the tuples does not satisfying privacy policy into '*'. This function reads the 'src' dataframe, configures the tuple lists to be suppressed, and then returns the new dataframe including suppressed tuples.
dataframe to generalize
array of column names to be generalization
DataFrame the new suppressed dataframe
Only changes the quasi-identifier columns of given row into '*', and returns this changed row
Only changes the quasi-identifier columns of given row into '*', and returns this changed row
the row to be suppressed
Row new suppressed row
(Since version 0.1) This function can not support different type of columns
Returns the suppressed row of given original row
Returns the suppressed row of given original row
the row to be suppressed
Row new suppressed row
(Since version 0.1) This function can not support different type of columns