Missing Value Impute Operator


Operator 설명

Operator 파라미터 설명

filed value 설명
subParam.key numeric 특정 숫자로 채우는 방법 선택
subParam.value 0 missing value를 0의 값으로 채움

작동 순서

  1. 입력 DataFrame 및 Protobuffer 입력 파라미터 유효성 검사를 진행한다.
  2. Spark ml 및 DataFrame API를 이용하여 MVI 알고리즘 동작한다.
  3. 출력 DataFrame을 생성한다.

Operator 모듈 테스트 결과

입력 데이터

-0.2947324415-0.1463210702 -0.3093645485-0.3841973244-0.3511705686-0.1647157191
-0.2683946488-0.1998327759-0.1655518395-0.2826086957-0.3883779264-0.3319397993-0.1730769231
-0.2880434783-0.2566889632-0.3850334448-0.3704013378-0.1693143813
-0.3131270903-0.2215719064-0.1580267559-0.2408026756-0.3683110368-0.2274247492
-0.3152173913-0.1421404682-0.1484113712-0.1605351171-0.3386287625-0.3465719064
-0.2683946488-0.1998327759-0.1655518395-0.2826086957-0.3319397993-0.1730769231
-0.2880434783-0.2186454849-0.2491638796-0.3850334448-0.1693143813
-0.2215719064-0.1580267559-0.2408026756-0.3683110368-0.3837792642
-0.3152173913-0.1421404682-0.1484113712-0.1605351171-0.3386287625-0.3465719064-0.1772575251
-0.3110367893-0.1906354515-0.2495819398-0.3595317726-0.3298494983-0.1496655518

결과 데이터

-0.2947324415-0.14632107020 -0.3093645485-0.3841973244-0.3511705686-0.1647157191
-0.2683946488-0.1998327759-0.1655518395-0.2826086957-0.3883779264-0.3319397993-0.1730769231
-0.28804347830-0.25668896320-0.3850334448-0.3704013378-0.1693143813
-0.3131270903-0.2215719064-0.1580267559-0.2408026756-0.36831103680-0.2274247492
-0.3152173913-0.1421404682-0.1484113712-0.1605351171-0.3386287625-0.34657190640
-0.2683946488-0.1998327759-0.1655518395-0.28260869570-0.3319397993-0.1730769231
-0.2880434783-0.21864548490-0.2491638796-0.38503344480-0.1693143813
0-0.2215719064-0.1580267559-0.2408026756-0.3683110368-0.38377926420
-0.3152173913-0.1421404682-0.1484113712-0.1605351171-0.3386287625-0.3465719064-0.1772575251
-0.3110367893-0.1906354515-0.24958193980-0.3595317726-0.3298494983-0.1496655518
val mviInfo = MissingValueImputationInfo.newBuilder()
  .setScope(MissingValueImputationInfo.Scope.SCOPE_ALL)
  .addSelectedColumnId(1)
  .setMethod(MissingValueImputationInfo.Method.SPECIFIC_VALUE)
  .setHow(MissingValueImputationInfo.How.HOW_ANY)
  .addSubParam(
    SubParameter.newBuilder
      .setKey("numeric")
      .setValue("0")
      .build
      )
  .build