How to use
PartitionBatchInput
in
co.cask.cdap.api.dataset.lib.partitioned

Best Java code snippets using co.cask.cdap.api.dataset.lib.partitioned.PartitionBatchInput (Showing top 2 results out of 315)

/**
 * See {@link #setInput(MapReduceContext, String, DatasetStatePersistor, ConsumerConfiguration)},
 * but using {@link ConsumerConfiguration#DEFAULT}.
 */
public static BatchPartitionCommitter setInput(MapReduceContext mapreduceContext,
                        String partitionedFileSetName,
                        DatasetStatePersistor statePersistor) {
 return setInput(mapreduceContext, partitionedFileSetName, statePersistor, ConsumerConfiguration.DEFAULT);
}

@Override
public void initialize() throws Exception {
 MapReduceContext context = getContext();
 batchPartitionCommitter =
  PartitionBatchInput.setInput(context, "lines", new KVTableStatePersistor("consumingState", "state.key"));
 Map<String, String> outputArgs = new HashMap<>();
 PartitionKey partitionKey = PartitionKey.builder().addLongField("time", context.getLogicalStartTime()).build();
 PartitionedFileSetArguments.setOutputPartitionKey(outputArgs, partitionKey);
 context.addOutput(Output.ofDataset("outputLines", outputArgs));
 context.addOutput(Output.ofDataset("counts"));
 Job job = context.getHadoopJob();
 job.setMapperClass(Tokenizer.class);
 job.setReducerClass(Counter.class);
 job.setNumReduceTasks(1);
}

Javadoc

A utility class for batch processing (i.e. MapReduce). It works by exposing functionality to configure a PartitionedFileSet as input to a MapReduceContext with runtime arguments to appropriately process partitions.

Most used methods

setInput
Used from the initialize method of the implementing batch job to configure as input a PartitionedFil

Popular in Java

Making http requests using okhttp
getSupportFragmentManager (FragmentActivity)
orElseThrow (Optional)
Return the contained value, if present, otherwise throw an exception to be created by the provided s
getApplicationContext (Context)
HashSet (java.util)
HashSet is an implementation of a Set. All optional operations (adding and removing) are supported.
Queue (java.util)
A collection designed for holding elements prior to processing. Besides basic java.util.Collection o
Logger (org.slf4j)
The org.slf4j.Logger interface is the main user entry point of SLF4J API. It is expected that loggin
Font (java.awt)
The Font class represents fonts, which are used to render text in a visible way. A font provides the
JTable (javax.swing)
BasicDataSource (org.apache.commons.dbcp)
Basic implementation of javax.sql.DataSource that is configured via JavaBeans properties. This is no
Top Sublime Text plugins

How to usePartitionBatchInput in co.cask.cdap.api.dataset.lib.partitioned

Best Java code snippets using co.cask.cdap.api.dataset.lib.partitioned.PartitionBatchInput (Showing top 2 results out of 315)

How to use
PartitionBatchInput
in
co.cask.cdap.api.dataset.lib.partitioned