org.apache.mahout.common.HadoopUtil.getCustomJobName java code examples

public static int runMapReduce(Configuration conf, Path input, Path output)
 throws IOException, ClassNotFoundException, InterruptedException {
 // Prepare Job for submission.
 Job job = HadoopUtil.prepareJob(input, output, SequenceFileInputFormat.class,
   StreamingKMeansMapper.class, IntWritable.class, CentroidWritable.class,
   StreamingKMeansReducer.class, IntWritable.class, CentroidWritable.class, SequenceFileOutputFormat.class,
   conf);
 job.setJobName(HadoopUtil.getCustomJobName(StreamingKMeansDriver.class.getSimpleName(), job,
   StreamingKMeansMapper.class, StreamingKMeansReducer.class));
 // There is only one reducer so that the intermediate centroids get collected on one
 // machine and are clustered in memory to get the right number of clusters.
 job.setNumReduceTasks(1);
 // Set the JAR (so that the required libraries are available) and run.
 job.setJarByClass(StreamingKMeansDriver.class);
 // Run job!
 long start = System.currentTimeMillis();
 if (!job.waitForCompletion(true)) {
  return -1;
 }
 long end = System.currentTimeMillis();
 log.info("StreamingKMeans clustering complete. Results are in {}. Took {} ms", output.toString(), end - start);
 return 0;
}

public static int runMapReduce(Configuration conf, Path input, Path output)
 throws IOException, ClassNotFoundException, InterruptedException {
 // Prepare Job for submission.
 Job job = HadoopUtil.prepareJob(input, output, SequenceFileInputFormat.class,
   StreamingKMeansMapper.class, IntWritable.class, CentroidWritable.class,
   StreamingKMeansReducer.class, IntWritable.class, CentroidWritable.class, SequenceFileOutputFormat.class,
   conf);
 job.setJobName(HadoopUtil.getCustomJobName(StreamingKMeansDriver.class.getSimpleName(), job,
   StreamingKMeansMapper.class, StreamingKMeansReducer.class));
 // There is only one reducer so that the intermediate centroids get collected on one
 // machine and are clustered in memory to get the right number of clusters.
 job.setNumReduceTasks(1);
 // Set the JAR (so that the required libraries are available) and run.
 job.setJarByClass(StreamingKMeansDriver.class);
 // Run job!
 long start = System.currentTimeMillis();
 if (!job.waitForCompletion(true)) {
  return -1;
 }
 long end = System.currentTimeMillis();
 log.info("StreamingKMeans clustering complete. Results are in {}. Took {} ms", output.toString(), end - start);
 return 0;
}

public static int runMapReduce(Configuration conf, Path input, Path output)
 throws IOException, ClassNotFoundException, InterruptedException {
 // Prepare Job for submission.
 Job job = HadoopUtil.prepareJob(input, output, SequenceFileInputFormat.class,
   StreamingKMeansMapper.class, IntWritable.class, CentroidWritable.class,
   StreamingKMeansReducer.class, IntWritable.class, CentroidWritable.class, SequenceFileOutputFormat.class,
   conf);
 job.setJobName(HadoopUtil.getCustomJobName(StreamingKMeansDriver.class.getSimpleName(), job,
   StreamingKMeansMapper.class, StreamingKMeansReducer.class));
 // There is only one reducer so that the intermediate centroids get collected on one
 // machine and are clustered in memory to get the right number of clusters.
 job.setNumReduceTasks(1);
 // Set the JAR (so that the required libraries are available) and run.
 job.setJarByClass(StreamingKMeansDriver.class);
 // Run job!
 long start = System.currentTimeMillis();
 if (!job.waitForCompletion(true)) {
  return -1;
 }
 long end = System.currentTimeMillis();
 log.info("StreamingKMeans clustering complete. Results are in {}. Took {} ms", output.toString(), end - start);
 return 0;
}

protected Job prepareJob(Path inputPath,
             Path outputPath,
             Class<? extends InputFormat> inputFormat,
             Class<? extends Mapper> mapper,
             Class<? extends Writable> mapperKey,
             Class<? extends Writable> mapperValue,
             Class<? extends OutputFormat> outputFormat,
             String jobname) throws IOException {
 Job job = HadoopUtil.prepareJob(inputPath, outputPath,
     inputFormat, mapper, mapperKey, mapperValue, outputFormat, getConf());
 String name =
   jobname != null ? jobname : HadoopUtil.getCustomJobName(getClass().getSimpleName(), job, mapper, Reducer.class);
 job.setJobName(name);
 return job;
}

protected Job prepareJob(Path inputPath,
             Path outputPath,
             Class<? extends InputFormat> inputFormat,
             Class<? extends Mapper> mapper,
             Class<? extends Writable> mapperKey,
             Class<? extends Writable> mapperValue,
             Class<? extends OutputFormat> outputFormat,
             String jobname) throws IOException {
 Job job = HadoopUtil.prepareJob(inputPath, outputPath,
     inputFormat, mapper, mapperKey, mapperValue, outputFormat, getConf());
 String name =
   jobname != null ? jobname : HadoopUtil.getCustomJobName(getClass().getSimpleName(), job, mapper, Reducer.class);
 job.setJobName(name);
 return job;
}

protected Job prepareJob(Path inputPath,
             Path outputPath,
             Class<? extends InputFormat> inputFormat,
             Class<? extends Mapper> mapper,
             Class<? extends Writable> mapperKey,
             Class<? extends Writable> mapperValue,
             Class<? extends Reducer> reducer,
             Class<? extends Writable> reducerKey,
             Class<? extends Writable> reducerValue,
             Class<? extends OutputFormat> outputFormat) throws IOException {
 Job job = HadoopUtil.prepareJob(inputPath, outputPath,
     inputFormat, mapper, mapperKey, mapperValue, reducer, reducerKey, reducerValue, outputFormat, getConf());
 job.setJobName(HadoopUtil.getCustomJobName(getClass().getSimpleName(), job, mapper, Reducer.class));
 return job;
}

protected Job prepareJob(Path inputPath,
             Path outputPath,
             Class<? extends InputFormat> inputFormat,
             Class<? extends Mapper> mapper,
             Class<? extends Writable> mapperKey,
             Class<? extends Writable> mapperValue,
             Class<? extends Reducer> reducer,
             Class<? extends Writable> reducerKey,
             Class<? extends Writable> reducerValue,
             Class<? extends OutputFormat> outputFormat) throws IOException {
 Job job = HadoopUtil.prepareJob(inputPath, outputPath,
     inputFormat, mapper, mapperKey, mapperValue, reducer, reducerKey, reducerValue, outputFormat, getConf());
 job.setJobName(HadoopUtil.getCustomJobName(getClass().getSimpleName(), job, mapper, Reducer.class));
 return job;
}

protected Job prepareJob(Path inputPath,
             Path outputPath,
             Class<? extends InputFormat> inputFormat,
             Class<? extends Mapper> mapper,
             Class<? extends Writable> mapperKey,
             Class<? extends Writable> mapperValue,
             Class<? extends OutputFormat> outputFormat,
             String jobname) throws IOException {
 Job job = HadoopUtil.prepareJob(inputPath, outputPath,
     inputFormat, mapper, mapperKey, mapperValue, outputFormat, getConf());
 String name =
   jobname != null ? jobname : HadoopUtil.getCustomJobName(getClass().getSimpleName(), job, mapper, Reducer.class);
 job.setJobName(name);
 return job;
}

protected Job prepareJob(Path inputPath,
             Path outputPath,
             Class<? extends InputFormat> inputFormat,
             Class<? extends Mapper> mapper,
             Class<? extends Writable> mapperKey,
             Class<? extends Writable> mapperValue,
             Class<? extends Reducer> reducer,
             Class<? extends Writable> reducerKey,
             Class<? extends Writable> reducerValue,
             Class<? extends OutputFormat> outputFormat) throws IOException {
 Job job = HadoopUtil.prepareJob(inputPath, outputPath,
     inputFormat, mapper, mapperKey, mapperValue, reducer, reducerKey, reducerValue, outputFormat, getConf());
 job.setJobName(HadoopUtil.getCustomJobName(getClass().getSimpleName(), job, mapper, Reducer.class));
 return job;
}

Popular methods of HadoopUtil

delete
countRecords
Count all the records in a directory using a org.apache.mahout.common.iterator.sequencefile.Sequence
getFileStatus
listStatus
buildDirList
Builds a comma-separated list of input splits
cacheFiles
findInCacheByPartOfFilename
Finds a file in the DistributedCache
getCachedFiles
Retrieves paths to cached files.
getSingleCachedFile
Return the first cached file in the list, else null if thre are no cached files.
openStream
prepareJob
Create a map-only Hadoop Job out of the passed in parameters. Does not set the Job name.
readInt

Popular in Java

Updating database using SQL prepared statement
onCreateOptionsMenu (Activity)
scheduleAtFixedRate (Timer)
findViewById (Activity)
FileWriter (java.io)
A specialized Writer that writes to a file in the file system. All write requests made by calling me
Path (java.nio.file)
Timestamp (java.sql)
A Java representation of the SQL TIMESTAMP type. It provides the capability of representing the SQL
Collection (java.util)
Collection is the root of the collection hierarchy. It defines operations on data collections and t
Timer (java.util)
Timers schedule one-shot or recurring TimerTask for execution. Prefer java.util.concurrent.Scheduled
ThreadPoolExecutor (java.util.concurrent)
An ExecutorService that executes each submitted task using one of possibly several pooled threads, n
Top Sublime Text plugins

How to use getCustomJobNamemethodin org.apache.mahout.common.HadoopUtil

Best Java code snippets using org.apache.mahout.common.HadoopUtil.getCustomJobName (Showing top 9 results out of 315)

How to use
getCustomJobName
method
in
org.apache.mahout.common.HadoopUtil