How to use
groupBy
method
in
org.apache.flink.api.java.operators.PartitionOperator

Best Java code snippets using org.apache.flink.api.java.operators.PartitionOperator.groupBy (Showing top 7 results out of 315)

UnsortedGrouping<Tuple3<Integer, Long, String>> partitionedDS = ds.partitionByHash(0).groupBy(1);

.groupBy(1)
.reduceGroup(new IdentityGroupReducerCombinable<Tuple2<Long,Long>>())
.output(new DiscardingOutputFormat<Tuple2<Long, Long>>());

@Test
public void testPartitionCustomOperatorPreservesFields() {
  try {
    ExecutionEnvironment env = ExecutionEnvironment.getExecutionEnvironment();
    
    DataSet<Tuple2<Long, Long>> data = env.fromCollection(Collections.singleton(new Tuple2<>(0L, 0L)));
    
    data.partitionCustom(new Partitioner<Long>() {
        public int partition(Long key, int numPartitions) { return key.intValue(); }
      }, 1)
      .groupBy(1)
      .reduceGroup(new IdentityGroupReducerCombinable<Tuple2<Long, Long>>())
      .output(new DiscardingOutputFormat<Tuple2<Long, Long>>());
    
    Plan p = env.createProgramPlan();
    OptimizedPlan op = compileNoStats(p);
    
    SinkPlanNode sink = op.getDataSinks().iterator().next();
    SingleInputPlanNode reducer = (SingleInputPlanNode) sink.getInput().getSource();
    SingleInputPlanNode partitioner = (SingleInputPlanNode) reducer.getInput().getSource();
    assertEquals(ShipStrategyType.FORWARD, reducer.getInput().getShipStrategy());
    assertEquals(ShipStrategyType.PARTITION_CUSTOM, partitioner.getInput().getShipStrategy());
  }
  catch (Exception e) {
    e.printStackTrace();
    fail(e.getMessage());
  }
}

UnsortedGrouping<Tuple3<Integer, Long, String>> partitionedDS = ds.partitionByHash(0).groupBy(1);

@Test
public void testRangePartitionByKeyField2() throws Exception {
  /*
   * Test range partition by key field
   */
  final ExecutionEnvironment env = ExecutionEnvironment.getExecutionEnvironment();
  DataSet<Tuple3<Integer, Long, String>> ds = CollectionDataSets.get3TupleDataSet(env);
  AggregateOperator<Tuple3<Integer, Long, String>> sum = ds
    .map(new PrefixMapper())
    .partitionByRange(1, 2)
    .groupBy(1, 2)
    .sum(0);
  List<Tuple3<Integer, Long, String>> result = sum.collect();
  String expected = "(1,1,Hi)\n" +
  "(5,2,Hello)\n" +
  "(4,3,Hello)\n" +
  "(5,3,I am )\n" +
  "(6,3,Luke )\n" +
  "(34,4,Comme)\n" +
  "(65,5,Comme)\n" +
  "(111,6,Comme)";
  compareResultAsText(result, expected);
}

@Test
public void testHashPartitionByKeyField2() throws Exception {
  /*
   * Test hash partition by key field
   */
  final ExecutionEnvironment env = ExecutionEnvironment.getExecutionEnvironment();
  DataSet<Tuple3<Integer, Long, String>> ds = CollectionDataSets.get3TupleDataSet(env);
  AggregateOperator<Tuple3<Integer, Long, String>> sum = ds
    .map(new PrefixMapper())
    .partitionByHash(1, 2)
    .groupBy(1, 2)
    .sum(0);
  List<Tuple3<Integer, Long, String>> result = sum.collect();
  String expected = "(1,1,Hi)\n" +
    "(5,2,Hello)\n" +
    "(4,3,Hello)\n" +
    "(5,3,I am )\n" +
    "(6,3,Luke )\n" +
    "(34,4,Comme)\n" +
    "(65,5,Comme)\n" +
    "(111,6,Comme)";
  compareResultAsText(result, expected);
}

.groupBy(1)
.reduceGroup(new IdentityGroupReducerCombinable<Tuple2<Long,Long>>())
.output(new DiscardingOutputFormat<Tuple2<Long, Long>>());

Popular methods of PartitionOperator

Popular in Java

Reading from database using SQL prepared statement
addToBackStack (FragmentTransaction)
onRequestPermissionsResult (Fragment)
getOriginalFilename (MultipartFile)
Return the original filename in the client's filesystem.This may contain path information depending
Pointer (com.sun.jna)
An abstraction for a native pointer data type. A Pointer instance represents, on the Java side, a na
FileInputStream (java.io)
An input stream that reads bytes from a file. File file = ...finally if (in != null) in.clos
OutputStream (java.io)
A writable sink for bytes.Most clients will use output streams that write data to the file system (
ArrayList (java.util)
ArrayList is an implementation of List, backed by an array. All optional operations including adding
Options (org.apache.commons.cli)
Main entry-point into the library. Options represents a collection of Option objects, which describ
Option (scala)
Github Copilot alternatives

How to use groupBymethodin org.apache.flink.api.java.operators.PartitionOperator

Best Java code snippets using org.apache.flink.api.java.operators.PartitionOperator.groupBy (Showing top 7 results out of 315)

How to use
groupBy
method
in
org.apache.flink.api.java.operators.PartitionOperator