How to use
getInputPaths
method
in
org.apache.hadoop.hive.ql.exec.Utilities

Best Java code snippets using org.apache.hadoop.hive.ql.exec.Utilities.getInputPaths (Showing top 13 results out of 315)

/**
 * On Tez we're not creating dummy files when getting/setting input paths.
 * We let Tez handle the situation. We're also setting the paths in the AM
 * so we don't want to depend on scratch dir and context.
 */
public static List<Path> getInputPathsTez(JobConf job, MapWork work) throws Exception {
 String scratchDir = job.get(DagUtils.TEZ_TMP_DIR_KEY);
 List<Path> paths = getInputPaths(job, work, new Path(scratchDir), null, true);
 return paths;
}

/**
 * On Tez we're not creating dummy files when getting/setting input paths.
 * We let Tez handle the situation. We're also setting the paths in the AM
 * so we don't want to depend on scratch dir and context.
 */
public static List<Path> getInputPathsTez(JobConf job, MapWork work) throws Exception {
 String scratchDir = job.get(DagUtils.TEZ_TMP_DIR_KEY);
 List<Path> paths = getInputPaths(job, work, new Path(scratchDir), null, true);
 return paths;
}

List<Path> inputPaths = Utilities.getInputPaths(cloned, (MapWork) work,
  scratchDir, context, false);
Utilities.setInputPaths(cloned, inputPaths);

List<Path> inputPaths = Utilities.getInputPaths(jobConf, mapWork, scratchDir, mock(Context.class), false);
assertEquals(inputPaths.size(), numOfPartitions);
for (int i=0; i<numOfPartitions; i++) {

MapWork mapWork = (MapWork) work;
cloned.setBoolean("mapred.task.is.map", true);
List<Path> inputPaths = Utilities.getInputPaths(cloned, mapWork,
  scratchDir, context, false);
Utilities.setInputPaths(cloned, inputPaths);

Path scratchDir = new Path(HiveConf.getVar(jobConf, HiveConf.ConfVars.LOCALSCRATCHDIR));
List<Path> inputPaths1 = Utilities.getInputPaths(jobConf, mapWork1, scratchDir,
    mock(Context.class), false);
inputPaths.addAll(inputPaths1);
assertFalse(nonExistentPath1.getFileSystem(conf).exists(nonExistentPath1));
List<Path> inputPaths2 = Utilities.getInputPaths(jobConf, mapWork2, scratchDir,
    mock(Context.class), false);
inputPaths.addAll(inputPaths2);

List<Path> inputPaths = Utilities.getInputPaths(jobConf, mapWork,
    new Path(HiveConf.getVar(jobConf, HiveConf.ConfVars.LOCALSCRATCHDIR)), mock(Context.class), false);
assertEquals(inputPaths.size(), numPartitions);

MapRedTask selectTask = (MapRedTask)plan.getRootTasks().get(0);
List<Path> inputPaths = Utilities.getInputPaths(newJob, selectTask.getWork().getMapWork(), emptyScratchDir, ctx, false);
Utilities.setInputPaths(newJob, inputPaths);

List<Path> inputPaths = Utilities.getInputPaths(job, mWork, emptyScratchDir, ctx, false);
Utilities.setInputPaths(job, inputPaths);

List<Path> inputPaths = Utilities.getInputPaths(job, mWork, emptyScratchDir, ctx, false);
Utilities.setInputPaths(job, inputPaths);

/**
 * On Tez we're not creating dummy files when getting/setting input paths.
 * We let Tez handle the situation. We're also setting the paths in the AM
 * so we don't want to depend on scratch dir and context.
 */
public static List<Path> getInputPathsTez(JobConf job, MapWork work) throws Exception {
 String scratchDir = job.get(DagUtils.TEZ_TMP_DIR_KEY);
 // we usually don't want to create dummy files for tez, however the metadata only
 // optimization relies on it.
 List<Path> paths = getInputPaths(job, work, new Path(scratchDir), null,
   !work.isUseOneNullRowInputFormat());
 return paths;
}

List<Path> inputPaths = Utilities.getInputPaths(cloned, (MapWork) work,
  scratchDir, context, false);
Utilities.setInputPaths(cloned, inputPaths);

List<Path> inputPaths = Utilities.getInputPaths(job, mWork, emptyScratchDir, ctx, false);
Utilities.setInputPaths(job, inputPaths);

Javadoc

Computes a list of all input paths needed to compute the given MapWork. All aliases are considered and a merged list of input paths is returned. If any input path points to an empty table or partition a dummy file in the scratch dir is instead created and added to the list. This is needed to avoid special casing the operator pipeline for these cases.

Popular methods of Utilities

getMapWork
getSessionSpecifiedClassLoader
get session specified class loader and get current class loader if fall
copyTableJobPropertiesToConf
Copies the storage handler properties configured for a table descriptor to a runtime job configurati
getMapRedWork
getResourceFiles
getDbTableName
getTaskId
Gets the task id if we are running as a Hadoop job. Gets a random number otherwise.
setColumnNameList
addToClassPath
Add new elements to the classpath.
deserializeExpression
getColumnNames
getColumnTypes

Popular in Java

Start an intent from android
setContentView (Activity)
onRequestPermissionsResult (Fragment)
orElseThrow (Optional)
Return the contained value, if present, otherwise throw an exception to be created by the provided s
InputStreamReader (java.io)
A class for turning a byte stream into a character stream. Data read from the source input stream is
TimeZone (java.util)
TimeZone represents a time zone offset, and also figures out daylight savings. Typically, you get a
XPath (javax.xml.xpath)
XPath provides access to the XPath evaluation environment and expressions. Evaluation of XPath Expr
JButton (javax.swing)
JTable (javax.swing)
Join (org.hibernate.mapping)
Top Sublime Text plugins

How to use getInputPathsmethodin org.apache.hadoop.hive.ql.exec.Utilities

Best Java code snippets using org.apache.hadoop.hive.ql.exec.Utilities.getInputPaths (Showing top 13 results out of 315)

How to use
getInputPaths
method
in
org.apache.hadoop.hive.ql.exec.Utilities