How to use
StandardScaler
in
org.apache.spark.ml.feature

Best Java code snippets using org.apache.spark.ml.feature.StandardScaler (Showing top 6 results out of 315)

 @Test
 public void standardScaler() {
  // The tests are to check Java compatibility.
  List<VectorIndexerSuite.FeatureData> points = Arrays.asList(
   new VectorIndexerSuite.FeatureData(Vectors.dense(0.0, -2.0)),
   new VectorIndexerSuite.FeatureData(Vectors.dense(1.0, 3.0)),
   new VectorIndexerSuite.FeatureData(Vectors.dense(1.0, 4.0))
  );
  Dataset<Row> dataFrame = spark.createDataFrame(jsc.parallelize(points, 2),
   VectorIndexerSuite.FeatureData.class);
  StandardScaler scaler = new StandardScaler()
   .setInputCol("features")
   .setOutputCol("scaledFeatures")
   .setWithStd(true)
   .setWithMean(false);

  // Compute summary statistics by fitting the StandardScaler
  StandardScalerModel scalerModel = scaler.fit(dataFrame);

  // Normalize each feature to have unit standard deviation.
  Dataset<Row> scaledData = scalerModel.transform(dataFrame);
  scaledData.count();
 }
}

 @Test
 public void pipeline() {
  StandardScaler scaler = new StandardScaler()
   .setInputCol("features")
   .setOutputCol("scaledFeatures");
  LogisticRegression lr = new LogisticRegression()
   .setFeaturesCol("scaledFeatures");
  Pipeline pipeline = new Pipeline()
   .setStages(new PipelineStage[]{scaler, lr});
  PipelineModel model = pipeline.fit(dataset);
  model.transform(dataset).createOrReplaceTempView("prediction");
  Dataset<Row> predictions = spark.sql("SELECT label, probability, prediction FROM prediction");
  predictions.collectAsList();
 }
}

 @Test
 public void pipeline() {
  StandardScaler scaler = new StandardScaler()
   .setInputCol("features")
   .setOutputCol("scaledFeatures");
  LogisticRegression lr = new LogisticRegression()
   .setFeaturesCol("scaledFeatures");
  Pipeline pipeline = new Pipeline()
   .setStages(new PipelineStage[]{scaler, lr});
  PipelineModel model = pipeline.fit(dataset);
  model.transform(dataset).createOrReplaceTempView("prediction");
  Dataset<Row> predictions = spark.sql("SELECT label, probability, prediction FROM prediction");
  predictions.collectAsList();
 }
}

 @Test
 public void standardScaler() {
  // The tests are to check Java compatibility.
  List<VectorIndexerSuite.FeatureData> points = Arrays.asList(
   new VectorIndexerSuite.FeatureData(Vectors.dense(0.0, -2.0)),
   new VectorIndexerSuite.FeatureData(Vectors.dense(1.0, 3.0)),
   new VectorIndexerSuite.FeatureData(Vectors.dense(1.0, 4.0))
  );
  Dataset<Row> dataFrame = spark.createDataFrame(jsc.parallelize(points, 2),
   VectorIndexerSuite.FeatureData.class);
  StandardScaler scaler = new StandardScaler()
   .setInputCol("features")
   .setOutputCol("scaledFeatures")
   .setWithStd(true)
   .setWithMean(false);

  // Compute summary statistics by fitting the StandardScaler
  StandardScalerModel scalerModel = scaler.fit(dataFrame);

  // Normalize each feature to have unit standard deviation.
  Dataset<Row> scaledData = scalerModel.transform(dataFrame);
  scaledData.count();
 }
}

 @Test
 public void pipeline() {
  StandardScaler scaler = new StandardScaler()
   .setInputCol("features")
   .setOutputCol("scaledFeatures");
  LogisticRegression lr = new LogisticRegression()
   .setFeaturesCol("scaledFeatures");
  Pipeline pipeline = new Pipeline()
   .setStages(new PipelineStage[]{scaler, lr});
  PipelineModel model = pipeline.fit(dataset);
  model.transform(dataset).createOrReplaceTempView("prediction");
  Dataset<Row> predictions = spark.sql("SELECT label, probability, prediction FROM prediction");
  predictions.collectAsList();
 }
}

 @Test
 public void standardScaler() {
  // The tests are to check Java compatibility.
  List<VectorIndexerSuite.FeatureData> points = Arrays.asList(
   new VectorIndexerSuite.FeatureData(Vectors.dense(0.0, -2.0)),
   new VectorIndexerSuite.FeatureData(Vectors.dense(1.0, 3.0)),
   new VectorIndexerSuite.FeatureData(Vectors.dense(1.0, 4.0))
  );
  Dataset<Row> dataFrame = spark.createDataFrame(jsc.parallelize(points, 2),
   VectorIndexerSuite.FeatureData.class);
  StandardScaler scaler = new StandardScaler()
   .setInputCol("features")
   .setOutputCol("scaledFeatures")
   .setWithStd(true)
   .setWithMean(false);

  // Compute summary statistics by fitting the StandardScaler
  StandardScalerModel scalerModel = scaler.fit(dataFrame);

  // Normalize each feature to have unit standard deviation.
  Dataset<Row> scaledData = scalerModel.transform(dataFrame);
  scaledData.count();
 }
}

Most used methods

Popular in Java

Start an intent from android
setContentView (Activity)
getSharedPreferences (Context)
orElseThrow (Optional)
Return the contained value, if present, otherwise throw an exception to be created by the provided s
BigInteger (java.math)
An immutable arbitrary-precision signed integer.FAST CRYPTOGRAPHY This implementation is efficient f
ConnectException (java.net)
A ConnectException is thrown if a connection cannot be established to a remote host on a specific po
Stream (java.util.stream)
A sequence of elements supporting sequential and parallel aggregate operations. The following exampl
XPath (javax.xml.xpath)
XPath provides access to the XPath evaluation environment and expressions. Evaluation of XPath Expr
Point (java.awt)
A point representing a location in (x,y) coordinate space, specified in integer precision.
Notification (javax.management)
CodeWhisperer alternatives

How to useStandardScaler in org.apache.spark.ml.feature

Best Java code snippets using org.apache.spark.ml.feature.StandardScaler (Showing top 6 results out of 315)

How to use
StandardScaler
in
org.apache.spark.ml.feature