Class/Object

ksb.csle.component.operator.cleaning

EMClusteringOperator

Related Docs: object EMClusteringOperator | package cleaning

Permalink

class EMClusteringOperator extends BaseDataOperator[StreamOperatorInfo, DataFrame] with BaseDistanceCalculator

:: ApplicationDeveloperApi ::

Operator that performs EM(Expectation–Maximization) Clustering. It performs two steps iteratively to partition into k clusters. Expectation step calculates the expected value of the log likelihood function and maximization step finds the parameter maximizing the expected log-likelihood.

Linear Supertypes
BaseDistanceCalculator, BaseDataOperator[StreamOperatorInfo, DataFrame], BaseGenericOperator[StreamOperatorInfo, DataFrame], BaseGenericMutantOperator[StreamOperatorInfo, DataFrame, DataFrame], BaseDoer, Logging, Serializable, Serializable, AnyRef, Any
Ordering
  1. Alphabetic
  2. By Inheritance
Inherited
  1. EMClusteringOperator
  2. BaseDistanceCalculator
  3. BaseDataOperator
  4. BaseGenericOperator
  5. BaseGenericMutantOperator
  6. BaseDoer
  7. Logging
  8. Serializable
  9. Serializable
  10. AnyRef
  11. Any
  1. Hide All
  2. Show All
Visibility
  1. Public
  2. All

Instance Constructors

  1. new EMClusteringOperator(o: StreamOperatorInfo)

    Permalink

    o

    Object that contains message ksb.csle.common.proto.StreamOperatorProto.EMClusteringInfo EMClusteringInfo contains attributes as follows:

    • k_value: Number of clusters to form (required)
    • maxIter: Maximal number of iterations to be performed for one run (required)

    EMClusteringInfo

    message EMClusteringInfo {
    required int32 k_value = 3 [default = 2];
    required int32 maxIter = 4;
    }

Value Members

  1. final def !=(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  2. final def ##(): Int

    Permalink
    Definition Classes
    AnyRef → Any
  3. final def ==(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  4. def ManhattanDistanceMetric(x: Row, y: Row): Double

    Permalink
    Definition Classes
    BaseDistanceCalculator
  5. final def asInstanceOf[T0]: T0

    Permalink
    Definition Classes
    Any
  6. def clone(): AnyRef

    Permalink
    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  7. final def eq(arg0: AnyRef): Boolean

    Permalink
    Definition Classes
    AnyRef
  8. def equals(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  9. def finalize(): Unit

    Permalink
    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( classOf[java.lang.Throwable] )
  10. final def getClass(): Class[_]

    Permalink
    Definition Classes
    AnyRef → Any
  11. def hashCode(): Int

    Permalink
    Definition Classes
    AnyRef → Any
  12. final def isInstanceOf[T0]: Boolean

    Permalink
    Definition Classes
    Any
  13. val logger: Logger

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  14. final def ne(arg0: AnyRef): Boolean

    Permalink
    Definition Classes
    AnyRef
  15. final def notify(): Unit

    Permalink
    Definition Classes
    AnyRef
  16. final def notifyAll(): Unit

    Permalink
    Definition Classes
    AnyRef
  17. def operate(df: DataFrame): DataFrame

    Permalink

    Operates EMClustering.

    Operates EMClustering.

    df

    Input dataframe

    returns

    DataFrame Output dataframe

    Definition Classes
    EMClusteringOperator → BaseGenericOperator → BaseGenericMutantOperator
  18. val p: EMClusteringInfo

    Permalink
  19. def stop: Unit

    Permalink
    Definition Classes
    BaseGenericOperator → BaseGenericMutantOperator
  20. final def synchronized[T0](arg0: ⇒ T0): T0

    Permalink
    Definition Classes
    AnyRef
  21. def toString(): String

    Permalink
    Definition Classes
    AnyRef → Any
  22. def validate(df: DataFrame): Unit

    Permalink

    Validates EMClustering info and dataframe schema info using following params.

    Validates EMClustering info and dataframe schema info using following params.

    Annotations
    @throws( classOf[KsbException] )
  23. final def wait(): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  24. final def wait(arg0: Long, arg1: Int): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  25. final def wait(arg0: Long): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )

Inherited from BaseDistanceCalculator

Inherited from BaseDataOperator[StreamOperatorInfo, DataFrame]

Inherited from BaseGenericOperator[StreamOperatorInfo, DataFrame]

Inherited from BaseGenericMutantOperator[StreamOperatorInfo, DataFrame, DataFrame]

Inherited from BaseDoer

Inherited from Logging

Inherited from Serializable

Inherited from Serializable

Inherited from AnyRef

Inherited from Any

Ungrouped