How to use
getSplits
method
in
org.elasticsearch.hadoop.mr.EsInputFormat

Best Java code snippets using org.elasticsearch.hadoop.mr.EsInputFormat.getSplits (Showing top 7 results out of 315)

@Override
public List<InputSplit> getSplits(JobContext context) throws IOException {
  JobConf conf = HadoopCfgUtils.asJobConf(CompatHandler.jobContext(context).getConfiguration());
  // NOTE: this method expects a ShardInputSplit to be returned (which implements both the old and the new API).
  return Arrays.asList((InputSplit[]) getSplits(conf, conf.getNumMapTasks()));
}

@Override
public FileSplit[] getSplits(JobConf job, int numSplits) throws IOException {
  // first, merge input table properties (since there's no access to them ...)
  Settings settings = HadoopSettingsManager.loadFrom(job);
  //settings.merge(IOUtils.propsFromString(settings.getProperty(HiveConstants.INPUT_TBL_PROPERTIES)));
  Log log = LogFactory.getLog(getClass());
  // move on to initialization
  InitializationUtils.setValueReaderIfNotSet(settings, HiveValueReader.class, log);
  if (settings.getOutputAsJson() == false) {
    // Only set the fields if we aren't asking for raw JSON
    settings.setProperty(InternalConfigurationOptions.INTERNAL_ES_TARGET_FIELDS, StringUtils.concatenate(HiveUtils.columnToAlias(settings), ","));
  }
  HiveUtils.init(settings, log);
  // decorate original splits as FileSplit
  InputSplit[] shardSplits = super.getSplits(job, numSplits);
  FileSplit[] wrappers = new FileSplit[shardSplits.length];
  Path path = new Path(job.get(HiveConstants.TABLE_LOCATION));
  for (int i = 0; i < wrappers.length; i++) {
    wrappers[i] = new EsHiveSplit(shardSplits[i], path);
  }
  return wrappers;
}

@Override
public List<InputSplit> getSplits(JobContext context) throws IOException {
  JobConf conf = HadoopCfgUtils.asJobConf(CompatHandler.jobContext(context).getConfiguration());
  // NOTE: this method expects a ShardInputSplit to be returned (which implements both the old and the new API).
  return Arrays.asList((InputSplit[]) getSplits(conf, conf.getNumMapTasks()));
}

@Override
public List<InputSplit> getSplits(JobContext context) throws IOException {
  JobConf conf = HadoopCfgUtils.asJobConf(CompatHandler.jobContext(context).getConfiguration());
  // NOTE: this method expects a ShardInputSplit to be returned (which implements both the old and the new API).
  return Arrays.asList((InputSplit[]) getSplits(conf, conf.getNumMapTasks()));
}

@Override
public List<InputSplit> getSplits(JobContext context) throws IOException {
  JobConf conf = HadoopCfgUtils.asJobConf(CompatHandler.jobContext(context).getConfiguration());
  // NOTE: this method expects a ShardInputSplit to be returned (which implements both the old and the new API).
  return Arrays.asList((InputSplit[]) getSplits(conf, conf.getNumMapTasks()));
}

@Override
public List<InputSplit> getSplits(JobContext context) throws IOException {
  JobConf conf = HadoopCfgUtils.asJobConf(CompatHandler.jobContext(context).getConfiguration());
  // NOTE: this method expects a ShardInputSplit to be returned (which implements both the old and the new API).
  return Arrays.asList((InputSplit[]) getSplits(conf, conf.getNumMapTasks()));
}

@Override
public FileSplit[] getSplits(JobConf job, int numSplits) throws IOException {
  // first, merge input table properties (since there's no access to them ...)
  Settings settings = HadoopSettingsManager.loadFrom(job);
  //settings.merge(IOUtils.propsFromString(settings.getProperty(HiveConstants.INPUT_TBL_PROPERTIES)));
  Log log = LogFactory.getLog(getClass());
  // move on to initialization
  InitializationUtils.setValueReaderIfNotSet(settings, HiveValueReader.class, log);
  if (settings.getOutputAsJson() == false) {
    // Only set the fields if we aren't asking for raw JSON
    settings.setProperty(InternalConfigurationOptions.INTERNAL_ES_TARGET_FIELDS, StringUtils.concatenate(HiveUtils.columnToAlias(settings), ","));
  }
  HiveUtils.init(settings, log);
  // decorate original splits as FileSplit
  InputSplit[] shardSplits = super.getSplits(job, numSplits);
  FileSplit[] wrappers = new FileSplit[shardSplits.length];
  Path path = new Path(job.get(HiveConstants.TABLE_LOCATION));
  for (int i = 0; i < wrappers.length; i++) {
    wrappers[i] = new EsHiveSplit(shardSplits[i], path);
  }
  return wrappers;
}

Popular methods of EsInputFormat

Popular in Java

Running tasks concurrently on multiple threads
setScale (BigDecimal)
runOnUiThread (Activity)
onRequestPermissionsResult (Fragment)
DateFormat (java.text)
Formats or parses dates and times.This class provides factories for obtaining instances configured f
ArrayList (java.util)
ArrayList is an implementation of List, backed by an array. All optional operations including adding
Enumeration (java.util)
A legacy iteration interface.New code should use Iterator instead. Iterator replaces the enumeration
Locale (java.util)
Locale represents a language/country/variant combination. Locales are used to alter the presentatio
JarFile (java.util.jar)
JarFile is used to read jar entries and their associated data from jar files.
JLabel (javax.swing)
Top Vim plugins

How to use getSplitsmethodin org.elasticsearch.hadoop.mr.EsInputFormat

Best Java code snippets using org.elasticsearch.hadoop.mr.EsInputFormat.getSplits (Showing top 7 results out of 315)

How to use
getSplits
method
in
org.elasticsearch.hadoop.mr.EsInputFormat