Package

ksb.csle.didentification

interfaces

Permalink

package interfaces

Visibility
  1. Public
  2. All

Type Members

  1. case class ColumnHierarchy(hierarhcyDepth: Map[Int, HierarchyEntries]) extends Product with Serializable

    Permalink

    This case class contains the list of hierarchical information of a column as a map [the number of generalization (depth), generalized data w.r.t.

    This case class contains the list of hierarchical information of a column as a map [the number of generalization (depth), generalized data w.r.t. to depth].

    hierarhcyDepth

    the information containing the hierarchical info w.r.t. the given depth

  2. trait DataFrameCheck extends AnyRef

    Permalink

    This trait provides some functions to get specific columns or check the given column names or IDs of dataframe are valid or not.

  3. trait DeAnonymizer extends AnyRef

    Permalink
  4. case class EquivalenceClass(eqIdentifiers: Seq[Any], count: Long, latticeEntry: List[LatticeValue]) extends Product with Serializable

    Permalink

    This case class manages the information about the equivalence class.

    This case class manages the information about the equivalence class. It contains its sequence of quasi-identifiers, the number of records, and its lattice entry to get this equivalence class.

    eqIdentifiers

    The sequence of quasi-identifiers in this EC

    count

    The number of records in this EC

    latticeEntry

    The lattice entry to get this EC

  5. case class HierarchyEntries(depth: Int, hierarchyEntry: Map[String, String]) extends Product with Serializable

    Permalink

    The hierarchical information is changed by how many times the algorithm scrubs (or generalizes) the original data.

    The hierarchical information is changed by how many times the algorithm scrubs (or generalizes) the original data. This case class describes this information of a column with respect to the number (or depth) of scrubbing the data. [depth, original data => generalized data w.r.t. depth].

    depth

    the number of scrubbing the data

    hierarchyEntry

    the information containing the hierarchical info

  6. case class InformationLossBound(loss: Double, lowerBound: Double, relativeLoss: Double, relativelowerBound: Double) extends Product with Serializable

    Permalink

    This case class provides some information about the loss occurred by the de-identification.

    This case class provides some information about the loss occurred by the de-identification.

    loss

    Pure information loss, but not normalized

    lowerBound

    The lower bound of pure information loss

    relativeLoss

    Relative loss to original data

  7. case class Interval(lower: Double, upper: Double) extends Product with Serializable

    Permalink

    This case class contains the interval information [lower, upper].

    This case class contains the interval information [lower, upper].

    lower

    the lower bound of this interval

    upper

    the upper bound of this interval

  8. case class LatticeEntry(entry: List[LatticeValue], anonymity: Double, loss: Double, risk: Double) extends Product with Serializable

    Permalink

    The lattice value indicates how to scrub the specific single column.

    The lattice value indicates how to scrub the specific single column. Generally, there are many quasi-identifiers in the data set, so it may be required to manage those quasi-identifiers. Lattice entry manages them.

    entry

    the list of lattice value

    loss

    the measured information loss when this lattice entry applies

    risk

    the measured de-identification risk when this lattice entry applies

  9. case class LatticeValue(columnIndex: Int, columnName: String, currentDepth: Int, maxDepth: Int, doStop: Boolean) extends Product with Serializable

    Permalink

    This case class indicates how many times to scrub the column

    This case class indicates how many times to scrub the column

    columnIndex

    the index of column

    columnName

    the name of column

    currentDepth

    how many times to scrub the column. i.e., currentDepth = 1 means that it scrub the column one time

    maxDepth

    the possible number of scrub

    doStop

    continuously scrub the column or not. For example, if it is sufficient to satisfy the privacy policy by only scrubing one time, scrubbing the column two times is useless because it surely satisfy the privacy policy

  10. trait MethodString extends AnyRef

    Permalink

    This trait provides the function to translate the given specific methods to related strings.

  11. case class OutlierInfo(lower: Double, upper: Double, replace: String) extends Product with Serializable

    Permalink

    This case class contains the outlier information [lower, upper].

    This case class contains the outlier information [lower, upper]. The given value is beyond [lower, upper], it is considered as outlier, which may be replaced by the 'replace' value.

    lower

    the lower bound of this interval

    upper

    the upper bound of this interval

  12. trait Statistics extends DataFrameCheck

    Permalink

    This trait provides the statistics related functions such as min, max, avg, std, and # of records.

Ungrouped