How to use
FieldReadOperation
in
co.cask.cdap.etl.api.lineage.field

Best Java code snippets using co.cask.cdap.etl.api.lineage.field.FieldReadOperation (Showing top 5 results out of 315)

@Override
public void prepareRun(BatchSourceContext context) throws DatasetManagementException {
 super.prepareRun(context);
 Schema schema = tableConfig.getSchema();
 if (schema != null && schema.getFields() != null) {
  FieldOperation operation =
   new FieldReadOperation("Read", "Read from Table dataset",
               EndPoint.of(context.getNamespace(), tableConfig.getName()),
               schema.getFields().stream().map(Schema.Field::getName)
                .collect(Collectors.toList()));
  context.record(Collections.singletonList(operation));
 }
}

case READ:
 FieldReadOperation read = (FieldReadOperation) fieldOperation;
 newOperation = new ReadOperation(newOperationName, read.getDescription(),
                  read.getSource(), read.getOutputFields());
 currentOperationOutputs.addAll(read.getOutputFields());
 break;
case TRANSFORM:

FieldReadOperation read = (FieldReadOperation) pipelineOperation;
updateInvalidOutputs(Collections.emptyList(), unusedOutputs, redundantOutputs);
validInputsSoFar.addAll(read.getOutputFields());
for (String field : read.getOutputFields()) {
 List<String> origins = unusedOutputs.computeIfAbsent(field, k -> new ArrayList<>());
 origins.add(pipelineOperation.getName());

@Override
public void prepareRun(BatchSourceContext context) throws Exception {
 InputFormatProvider inputFormatProvider = context.newPluginInstance(FORMAT_PLUGIN_ID);
 DatasetProperties datasetProperties = createProperties(inputFormatProvider);
 // Dataset must still be created if macros provided at configure time
 if (!context.datasetExists(config.getName())) {
  context.createDataset(config.getName(), PartitionedFileSet.class.getName(), datasetProperties);
 }
 PartitionedFileSet partitionedFileSet = context.getDataset(config.getName());
 SnapshotFileSet snapshotFileSet = new SnapshotFileSet(partitionedFileSet);
 Map<String, String> arguments = new HashMap<>(datasetProperties.getProperties());
 if (config.getFileProperties() != null) {
  arguments = GSON.fromJson(config.getFileProperties(), MAP_TYPE);
 }
 Schema schema = config.getSchema();
 if (schema.getFields() != null) {
  String formatName = getInputFormatName();
  FieldOperation operation =
   new FieldReadOperation("Read", String.format("Read from SnapshotFile source in %s format.", formatName),
               EndPoint.of(context.getNamespace(), config.getName()),
               schema.getFields().stream().map(Schema.Field::getName).collect(Collectors.toList()));
  context.record(Collections.singletonList(operation));
 }
 context.setInput(Input.ofDataset(config.getName(), snapshotFileSet.getInputArguments(arguments)));
}

@Override
public void prepareRun(BatchSourceContext context) throws DatasetManagementException, InstantiationException {
 config.validate();
 InputFormatProvider inputFormatProvider = context.newPluginInstance(FORMAT_PLUGIN_ID);
 DatasetProperties datasetProperties = createProperties(inputFormatProvider);
 // If macros provided at runtime, dataset still needs to be created
 if (!context.datasetExists(config.getName())) {
  String tpfsName = config.getName();
  context.createDataset(tpfsName, TimePartitionedFileSet.class.getName(), datasetProperties);
 }
 Schema schema = config.getSchema();
 if (schema.getFields() != null) {
  String formatName = getInputFormatName();
  FieldOperation operation =
   new FieldReadOperation("Read", String.format("Read from TimePartitionedFileSet in %s format.", formatName),
               EndPoint.of(context.getNamespace(), config.getName()),
               schema.getFields().stream().map(Schema.Field::getName).collect(Collectors.toList()));
  context.record(Collections.singletonList(operation));
 }
 long duration = TimeParser.parseDuration(config.getDuration());
 long delay = Strings.isNullOrEmpty(config.getDelay()) ? 0 : TimeParser.parseDuration(config.getDelay());
 long endTime = context.getLogicalStartTime() - delay;
 long startTime = endTime - duration;
 Map<String, String> sourceArgs = Maps.newHashMap(datasetProperties.getProperties());
 TimePartitionedFileSetArguments.setInputStartTime(sourceArgs, startTime);
 TimePartitionedFileSetArguments.setInputEndTime(sourceArgs, endTime);
 context.setInput(Input.ofDataset(config.getName(), sourceArgs));
}

Javadoc

Represents a read operation from a data source into a collection of output fields.

Most used methods

<init>
Create an instance of read operation.
getDescription
getOutputFields
Get the list of output fields generated by this read operation
getSource

Popular in Java

Reading from database using SQL prepared statement
getExternalFilesDir (Context)
onCreateOptionsMenu (Activity)
setScale (BigDecimal)
BitSet (java.util)
The BitSet class implements abit array [http://en.wikipedia.org/wiki/Bit_array]. Each element is eit
Comparator (java.util)
A Comparator is used to compare two objects to determine their ordering with respect to each other.
Enumeration (java.util)
A legacy iteration interface.New code should use Iterator instead. Iterator replaces the enumeration
ZipFile (java.util.zip)
This class provides random read access to a zip file. You pay more to read the zip file's central di
LogFactory (org.apache.commons.logging)
Factory for creating Log instances, with discovery and configuration features similar to that employ
Color (java.awt)
The Color class is used to encapsulate colors in the default sRGB color space or colors in arbitrary
Github Copilot alternatives

How to useFieldReadOperation in co.cask.cdap.etl.api.lineage.field

Best Java code snippets using co.cask.cdap.etl.api.lineage.field.FieldReadOperation (Showing top 5 results out of 315)

How to use
FieldReadOperation
in
co.cask.cdap.etl.api.lineage.field