This case class contains the list of hierarchical information of a column as a map [the number of generalization (depth), generalized data w.r.t.
This trait provides some functions to get specific columns or check the given column names or IDs of dataframe are valid or not.
This case class manages the information about the equivalence class.
This case class manages the information about the equivalence class. It contains its sequence of quasi-identifiers, the number of records, and its lattice entry to get this equivalence class.
The sequence of quasi-identifiers in this EC
The number of records in this EC
The lattice entry to get this EC
The hierarchical information is changed by how many times the algorithm scrubs (or generalizes) the original data.
The hierarchical information is changed by how many times the algorithm scrubs (or generalizes) the original data. This case class describes this information of a column with respect to the number (or depth) of scrubbing the data. [depth, original data => generalized data w.r.t. depth].
the number of scrubbing the data
the information containing the hierarchical info
This case class provides some information about the loss occurred by the de-identification.
This case class provides some information about the loss occurred by the de-identification.
Pure information loss, but not normalized
The lower bound of pure information loss
Relative loss to original data
This case class contains the interval information [lower, upper].
This case class contains the interval information [lower, upper].
the lower bound of this interval
the upper bound of this interval
The lattice value indicates how to scrub the specific single column.
The lattice value indicates how to scrub the specific single column. Generally, there are many quasi-identifiers in the data set, so it may be required to manage those quasi-identifiers. Lattice entry manages them.
the list of lattice value
the measured information loss when this lattice entry applies
the measured de-identification risk when this lattice entry applies
This case class indicates how many times to scrub the column
This case class indicates how many times to scrub the column
the index of column
the name of column
how many times to scrub the column. i.e., currentDepth = 1 means that it scrub the column one time
the possible number of scrub
continuously scrub the column or not. For example, if it is sufficient to satisfy the privacy policy by only scrubing one time, scrubbing the column two times is useless because it surely satisfy the privacy policy
This trait provides the function to translate the given specific methods to related strings.
This case class contains the outlier information [lower, upper].
This case class contains the outlier information [lower, upper]. The given value is beyond [lower, upper], it is considered as outlier, which may be replaced by the 'replace' value.
the lower bound of this interval
the upper bound of this interval
This trait provides the statistics related functions such as min, max, avg, std, and # of records.
This case class contains the list of hierarchical information of a column as a map [the number of generalization (depth), generalized data w.r.t. to depth].
the information containing the hierarchical info w.r.t. the given depth