How to use
getRecordWriter
method
in
org.apache.hadoop.mapreduce.lib.output.FileOutputFormat

Best Java code snippets using org.apache.hadoop.mapreduce.lib.output.FileOutputFormat.getRecordWriter (Showing top 7 results out of 315)

@Override
public RecordWriter<Object, Object> getRecordWriter(
    TaskAttemptContext context) throws IOException, InterruptedException {
  return dummyFileOutputFormat.getRecordWriter(context);
}

/**
 * @return A RecordWriter object for the given TaskAttemptContext (configured for a particular file name).
 * @throws IOException
 */
RecordWriter<K, V> getBaseRecordWriter(TaskAttemptContext job) throws IOException, InterruptedException {
 // Get a new FileOutputFormat object when creating a RecordWriter, so that different RecordWriter's can have
 // different underlying objects such as WriteSupport. This is a fix for CDAP-12823.
 return createFileOutputFormat().getRecordWriter(job);
}

/** Gets the RecordWriter from the wrapped FileOutputFormat. */
@Override
public RecordWriter<K, V> getRecordWriter(TaskAttemptContext context)
  throws IOException, InterruptedException {
 Configuration conf = context.getConfiguration();
 return getDelegate(conf).getRecordWriter(context);
}

/**
 * @return A RecordWriter object for the given TaskAttemptContext (configured for a particular file name).
 * @throws IOException
 */
RecordWriter<K, V> getBaseRecordWriter(TaskAttemptContext job) throws IOException, InterruptedException {
 // Get a new FileOutputFormat object when creating a RecordWriter, so that different RecordWriter's can have
 // different underlying objects such as WriteSupport. This is a fix for CDAP-12823.
 return createFileOutputFormat().getRecordWriter(job);
}

private void doOpen(String uId) throws Exception {
 this.hash = uId.hashCode();
 Job job = writeOperation.sink.newJob();
 FileOutputFormat.setOutputPath(job, new Path(path));
 // Each Writer is responsible for writing one bundle of elements and is represented by one
 // unique Hadoop task based on uId/hash. All tasks share the same job ID. Since Dataflow
 // handles retrying of failed bundles, each task has one attempt only.
 JobID jobId = job.getJobID();
 TaskID taskId = new TaskID(jobId, TaskType.REDUCE, hash);
 context = new TaskAttemptContextImpl(job.getConfiguration(), new TaskAttemptID(taskId, 0));
 FileOutputFormat<K, V> outputFormat = formatClass.newInstance();
 recordWriter = outputFormat.getRecordWriter(context);
 outputCommitter = (FileOutputCommitter) outputFormat.getOutputCommitter(context);
}

@Override
public void open(String uId) throws Exception {
  this.hash = uId.hashCode();
  Job job = ((ConfigurableHDFSFileSink<K, V>) getWriteOperation().getSink()).jobInstance();
  FileOutputFormat.setOutputPath(job, new Path(path));
  // Each Writer is responsible for writing one bundle of elements and is represented by one
  // unique Hadoop task based on uId/hash. All tasks share the same job ID. Since Dataflow
  // handles retrying of failed bundles, each task has one attempt only.
  JobID jobId = job.getJobID();
  TaskID taskId = new TaskID(jobId, TaskType.REDUCE, hash);
  configure(job);
  context = new TaskAttemptContextImpl(job.getConfiguration(), new TaskAttemptID(taskId, 0));
  FileOutputFormat<K, V> outputFormat = formatClass.newInstance();
  recordWriter = outputFormat.getRecordWriter(context);
  outputCommitter = (FileOutputCommitter) outputFormat.getOutputCommitter(context);
}

@Override
public void open(String uId) throws Exception {
  this.hash = uId.hashCode();
  Job job = ((ConfigurableHDFSFileSink<K, V>) getWriteOperation().getSink()).jobInstance();
  FileOutputFormat.setOutputPath(job, new Path(path));
  // Each Writer is responsible for writing one bundle of elements and is represented by one
  // unique Hadoop task based on uId/hash. All tasks share the same job ID. Since Dataflow
  // handles retrying of failed bundles, each task has one attempt only.
  JobID jobId = job.getJobID();
  TaskID taskId = new TaskID(jobId, TaskType.REDUCE, hash);
  configure(job);
  context = new TaskAttemptContextImpl(job.getConfiguration(), new TaskAttemptID(taskId, 0));
  FileOutputFormat<K, V> outputFormat = formatClass.newInstance();
  recordWriter = outputFormat.getRecordWriter(context);
  outputCommitter = (FileOutputCommitter) outputFormat.getOutputCommitter(context);
}

Popular methods of FileOutputFormat

setOutputPath
Set the Path of the output directory for the map-reduce job.
getOutputPath
Get the Path to the output directory for the map-reduce job.
setCompressOutput
Set whether the output of the job is compressed.
getUniqueFile
Generate a unique filename, based on the task id, name, and extension
setOutputCompressorClass
Set the CompressionCodec to be used to compress job outputs.
getCompressOutput
Is the job output compressed?
getWorkOutputPath
Get the Path to the task's temporary output directory for the map-reduce job TASKS' SIDE-EFFECT FILE
getOutputCommitter
getOutputCompressorClass
Get the CompressionCodec for compressing the job outputs.
getOutputName
Get the base output name for the output file.
checkOutputSpecs
setOutputName
Set the base output name for output file to be created.

Popular in Java

Start an intent from android
scheduleAtFixedRate (Timer)
getExternalFilesDir (Context)
addToBackStack (FragmentTransaction)
InputStream (java.io)
A readable source of bytes.Most clients will use input streams that read data from the file system (
Calendar (java.util)
Calendar is an abstract base class for converting between a Date object and a set of integer fields
Collectors (java.util.stream)
XPath (javax.xml.xpath)
XPath provides access to the XPath evaluation environment and expressions. Evaluation of XPath Expr
Component (java.awt)
A component is an object having a graphical representation that can be displayed on the screen and t
Table (org.hibernate.mapping)
A relational table
Top Sublime Text plugins

How to use getRecordWritermethodin org.apache.hadoop.mapreduce.lib.output.FileOutputFormat

Best Java code snippets using org.apache.hadoop.mapreduce.lib.output.FileOutputFormat.getRecordWriter (Showing top 7 results out of 315)

How to use
getRecordWriter
method
in
org.apache.hadoop.mapreduce.lib.output.FileOutputFormat