org.apache.beam.sdk.io.TextIO$Read java code examples

PCollection<String> data = pipeline.apply("ReadFromGDELTFile", TextIO.read().from(options.getInput()));

@Test
public void testInitialSplitGzipModeTxt() throws Exception {
 PipelineOptions options = TestPipeline.testingPipelineOptions();
 long desiredBundleSize = 1000;
 File largeTxt = writeToFile(LARGE, tempFolder, "large.txt", UNCOMPRESSED);
 // Sanity check: file is at least 2 bundles long.
 assertThat(largeTxt.length(), greaterThan(2 * desiredBundleSize));
 FileBasedSource<String> source =
   TextIO.read().from(largeTxt.getPath()).withCompression(GZIP).getSource();
 List<? extends FileBasedSource<String>> splits = source.split(desiredBundleSize, options);
 // Exactly 1 split, even though splittable text file, since using GZIP mode.
 assertThat(splits, hasSize(equalTo(1)));
 SourceTestUtils.assertSourcesEqualReferenceSource(source, splits, options);
}

"ReadFromSource",
TextIO.read()
  .from(options.getInputFilePattern())
  .watchForNewFiles(DEFAULT_POLL_INTERVAL, Growth.never()))

.apply("ReadFromHDFS", TextIO.read().from(options.getInput().toString()));

p.apply(TextIO.read().from(options.getInputFile()))
  .apply(ParDo.of(new ExtractHashtags()))
  .apply(Window.into(windowFn))

.from(options.getSitesFilepath()))
.apply(new SitesToShards.SitesToStreamVariantsShardsTransform(prototype));

.apply("Read from source", TextIO.read().from(options.getInputFilePattern()))
.apply(
  TransformTextViaJavascript.newBuilder()

p.apply("ReadMyFile", TextIO.read().from(inputFile.getPath()))
  .apply(sample)
  .apply(Flatten.iterables())

.from(options.getSitesFilepath()))
.apply(new SitesToShards.SitesToStreamVariantsShardsTransform(prototype));

p.apply(TextIO.read().from("gs://apache-beam-samples/shakespeare/*"))

if(null != options.getSitesFilepath()) {
 requests = p.apply("ReadSites", TextIO.read().from(options.getSitesFilepath()))
   .apply(new SitesToShards.SitesToStreamVariantsShardsTransform(prototype));
} else {

.apply(TextIO.read().from(options.getInput()))

p.apply(TextIO.read().from(inputFilePath))

p.apply("ReadLines", TextIO.read().from(options.getInput()))
  .apply("ParseVariantIds", ParDo.of(new DoFn<String, String>() {
   @ProcessElement

p.apply("ReadLines", TextIO.read().from(options.getInput()))
  .apply("ParseVariantIds", ParDo.of(new DoFn<String, String>() {
   @ProcessElement

pipeline
  .apply(TextIO.read().from(options.getInputFile()))

if(null != options.getSitesFilepath()) {
 requests = p.apply("ReadSites", TextIO.read().from(options.getSitesFilepath()))
   .apply(new SitesToShards.SitesToStreamVariantsShardsTransform(prototype));
} else {

p.apply("ReadMyFile", TextIO.read().from(options.getInputFile()))
  .apply("TransformParsingsToBigtable", ParDo.of(MUTATION_TRANSFORM))
  .apply("WriteToBigtable", CloudBigtableIO.writeToTable(config));

    .apply("ReadFromGDELTFile", TextIO.read().from(options.getInput()))
    .apply("TakeASample", Sample.<String>any(10));
read.apply(ParDo.of(new DoFn<String, Void>() {

 return pipeline.apply(read).apply(MapElements.into(TypeDescriptor.of(String.class)).via(KV::getValue));
} else {
 return pipeline.apply(TextIO.read().from(path));

Javadoc

Implementation of #read.

Most used methods

from
Same as from(filepattern), but accepting a ValueProvider.
getCompression
getSource
isSelfOverlapping
watchForNewFiles
See MatchConfiguration#continuously.This works only in runners supporting Kind#SPLITTABLE_DO_FN.
withCompression
Reads from input sources using the specified compression type.If no compression type is specified, t
getDelimiter
getFilepattern
getHintMatchesManyFiles
getMatchConfiguration
getName
toBuilder

Popular in Java

Reading from database using SQL prepared statement
onCreateOptionsMenu (Activity)
setContentView (Activity)
addToBackStack (FragmentTransaction)
FileWriter (java.io)
A specialized Writer that writes to a file in the file system. All write requests made by calling me
OutputStream (java.io)
A writable sink for bytes.Most clients will use output streams that write data to the file system (
PrintWriter (java.io)
Wraps either an existing OutputStream or an existing Writerand provides convenience methods for prin
Color (java.awt)
The Color class is used to encapsulate colors in the default sRGB color space or colors in arbitrary
JTable (javax.swing)
Location (org.springframework.beans.factory.parsing)
Class that models an arbitrary location in a Resource.Typically used to track the location of proble
Best plugins for Eclipse

How to useTextIO$Read in org.apache.beam.sdk.io

Best Java code snippets using org.apache.beam.sdk.io.TextIO$Read (Showing top 20 results out of 315)

How to use
TextIO$Read
in
org.apache.beam.sdk.io