How to use
serialize
method
in
org.apache.pig.impl.util.UDFContext

Best Java code snippets using org.apache.pig.impl.util.UDFContext.serialize (Showing top 5 results out of 315)

private void writeObject(ObjectOutputStream out) throws IOException {
  ArrayList<String> udfImportList = Lists.newArrayList(Splitter.on(",").split(properties.getProperty(SPARK_UDF_IMPORT_LIST)));
  out.writeObject(udfImportList);
  //2 threads call SparkEngineConf#writeObject
  //In main thread: SparkLauncher#initialize->SparkUtil#newJobConf
  //                ->ObjectSerializer#serialize-> SparkEngineConf#writeObject
  //In dag-scheduler-event-loop thread: DAGScheduler.submitMissingTasks->JavaSerializationStream.writeObject
  //
  //In main thread,UDFContext#getUDFContext is not empty, we store UDFContext#udfConfs and UDFContext#clientSysProps
  //into properties and serialize them.
  //In dag-scheduler-event-loop thread, UDFContext#getUDFContext is empty, we get value of UDFContext#udfConfs and UDFContext#clientSysProps
  //from properties and serialize them.
  if (!UDFContext.getUDFContext().isUDFConfEmpty()) {
    //In SparkUtil#newJobConf(), sparkEngineConf is serialized in job configuration and will call
    //SparkEngineConf#writeObject(at this time UDFContext#udfConfs and UDFContext#clientSysProps is not null)
    //later spark will call JavaSerializationStream.writeObject to serialize all objects when submit spark
    //jobs(at that time, UDFContext#udfConfs and UDFContext#clientSysProps is null so we need to save their
    //value in SparkEngineConf#properties after these two variables are correctly initialized in
    //SparkUtil#newJobConf, More detailed see PIG-4920
    String udfConfsStr = UDFContext.getUDFContext().serialize();
    String clientSysPropsStr = ObjectSerializer.serialize(UDFContext.getUDFContext().getClientSystemProps());
    this.properties.setProperty(SPARK_UDFCONTEXT_UDFCONFS, udfConfsStr);
    this.properties.setProperty(SPARK_UDFCONTEXT_CLIENTSYSPROPS, clientSysPropsStr);
    out.writeObject(udfConfsStr);
    out.writeObject(clientSysPropsStr);
  } else {
    out.writeObject(this.properties.getProperty(SPARK_UDFCONTEXT_UDFCONFS));
    out.writeObject(this.properties.getProperty(SPARK_UDFCONTEXT_CLIENTSYSPROPS));
  }
}

private void init(PhysicalPlan pp, POStore poStore) throws IOException {
  poStore.setStoreImpl(new FetchPOStoreImpl(pigContext));
  poStore.setUp();
  TaskAttemptID taskAttemptID = HadoopShims.getNewTaskAttemptID();
  //Fetch mode needs to explicitly set the task id which is otherwise done by Hadoop
  conf.setInt(MRConfiguration.JOB_APPLICATION_ATTEMPT_ID, taskAttemptID.getId());
  if (!PlanHelper.getPhysicalOperators(pp, POStream.class).isEmpty()) {
    MapRedUtil.setupStreamingDirsConfSingle(poStore, pigContext, conf);
  }
  String currentTime = Long.toString(System.currentTimeMillis());
  conf.set("pig.script.submitted.timestamp", currentTime);
  conf.set("pig.job.submitted.timestamp", currentTime);
  PhysicalOperator.setReporter(new FetchProgressableReporter());
  SchemaTupleBackend.initialize(conf, pigContext);
  UDFContext udfContext = UDFContext.getUDFContext();
  udfContext.addJobConf(conf);
  udfContext.setClientSystemProps(pigContext.getProperties());
  udfContext.serialize(conf);
  PigMapReduce.sJobConfInternal.set(conf);
  Utils.setDefaultTimeZone(conf);
  boolean aggregateWarning = "true".equalsIgnoreCase(conf.get("aggregate.warning"));
  PigStatusReporter pigStatusReporter = PigStatusReporter.getInstance();
  pigStatusReporter.setContext(new FetchTaskContext(new FetchContext()));
  PigHadoopLogger pigHadoopLogger = PigHadoopLogger.getInstance();
  pigHadoopLogger.setReporter(pigStatusReporter);
  pigHadoopLogger.setAggregate(aggregateWarning);
  PhysicalOperator.setPigLogger(pigHadoopLogger);
}

UDFContext.getUDFContext().serialize(conf);
Job cjob = new Job(new JobConf(conf), new ArrayList<Job>());
jobStoreMap.put(cjob,new Pair<List<POStore>, Path>(storeLocations, tmpLocation));

UDFContext.getUDFContext().serialize(conf);
conf.set("udf.import.list",
    ObjectSerializer.serialize(PigContext.getPackageImportList()));

UDFContext.getUDFContext().serialize(jobConf);

Javadoc

Serialize the UDF specific information into an instance of JobConf. This function is intended to be called on the front end in preparation for sending the data to the backend.

Popular methods of UDFContext

getUDFContext
getUDFProperties
Get a properties object that is specific to this UDF. Note that if a given UDF is called multiple ti
getJobConf
Get the JobConf. This should only be called on the backend. It will return null on the frontend.
getClientSystemProps
Get the System Properties (Read only) as on the client machine from where Pig was launched. This wil
isFrontend
Convenience method for UDF code to check where it runs (see PIG-2576)
isUDFConfEmpty
addJobConf
Adds the JobConf to this singleton. Will be called on the backend by the Map and Reduce functions so
reset
<init>
clone
Make a shallow copy of the context.
deserialize
Populate the udfConfs field. This function is intended to be called by Map.configure or Reduce.confi
deserializeForSpark

Popular in Java

Start an intent from android
scheduleAtFixedRate (Timer)
getResourceAsStream (ClassLoader)
scheduleAtFixedRate (ScheduledExecutorService)
PrintStream (java.io)
Fake signature of an existing Java class.
ByteBuffer (java.nio)
A buffer for bytes. A byte buffer can be created in either one of the following ways: * #allocate
Collection (java.util)
Collection is the root of the collection hierarchy. It defines operations on data collections and t
ConcurrentHashMap (java.util.concurrent)
A plug-in replacement for JDK1.5 java.util.concurrent.ConcurrentHashMap. This version is based on or
JTable (javax.swing)
Scheduler (org.quartz)
This is the main interface of a Quartz Scheduler. A Scheduler maintains a registry of org.quartz.Job
CodeWhisperer alternatives

How to use serializemethodin org.apache.pig.impl.util.UDFContext

Best Java code snippets using org.apache.pig.impl.util.UDFContext.serialize (Showing top 5 results out of 315)

How to use
serialize
method
in
org.apache.pig.impl.util.UDFContext