com.datastax.spark.connector.japi.PairRDDJavaFunctions java code examples

/**
 * Applies a function to each item, and groups consecutive items having the same value together.
 * Contrary to `groupBy`, items from the same group must be already next to each other in the
 * original collection. Works locally on each partition, so items from different partitions will
 * never be placed in the same group.
 */
public <U> JavaPairRDD<U, Iterable<Tuple2<K, V>>> spanBy(
    Function<Tuple2<K, V>, U> function,
    ClassTag<U> uClassTag
) {
  return new PairRDDJavaFunctions<>(rdd()).spanBy(function, uClassTag);
}

/**
 * Groups items with the same key, assuming the items with the same key are next to each other in the
 * collection. It does not perform shuffle, therefore it is much faster than using much more
 * universal Spark RDD `groupByKey`. For this method to be useful with Cassandra tables, the key must
 * represent a prefix of the primary key, containing at least the partition key of the Cassandra
 * table.
 */
public JavaPairRDD<K, Collection<V>> spanByKey() {
  return new PairRDDJavaFunctions<>(rdd()).spanByKey(kClassTag());
}

/**
 * A static factory method to create a {@link PairRDDJavaFunctions} based on an existing {@link
 * JavaPairRDD} instance.
 */
public static <K, V> PairRDDJavaFunctions<K, V> javaFunctions(JavaPairRDD<K, V> rdd) {
  return new PairRDDJavaFunctions<>(rdd.rdd());
}

/**
 * Groups items with the same key, assuming the items with the same key are next to each other in the
 * collection. It does not perform shuffle, therefore it is much faster than using much more
 * universal Spark RDD `groupByKey`. For this method to be useful with Cassandra tables, the key must
 * represent a prefix of the primary key, containing at least the partition key of the Cassandra
 * table.
 */
public JavaPairRDD<K, Collection<V>> spanByKey() {
  return new PairRDDJavaFunctions<>(rdd()).spanByKey(kClassTag());
}

/**
 * A static factory method to create a {@link PairRDDJavaFunctions} based on an existing {@link
 * JavaPairRDD} instance.
 */
public static <K, V> PairRDDJavaFunctions<K, V> javaFunctions(JavaPairRDD<K, V> rdd) {
  return new PairRDDJavaFunctions<>(rdd.rdd());
}

/**
 * Applies a function to each item, and groups consecutive items having the same value together.
 * Contrary to `groupBy`, items from the same group must be already next to each other in the
 * original collection. Works locally on each partition, so items from different partitions will
 * never be placed in the same group.
 */
public <U> JavaPairRDD<U, Iterable<Tuple2<K, V>>> spanBy(
    Function<Tuple2<K, V>, U> function,
    ClassTag<U> uClassTag
) {
  return new PairRDDJavaFunctions<>(rdd()).spanBy(function, uClassTag);
}

/**
 * Groups items with the same key, assuming the items with the same key are next to each other in the
 * collection. It does not perform shuffle, therefore it is much faster than using much more
 * universal Spark RDD `groupByKey`. For this method to be useful with Cassandra tables, the key must
 * represent a prefix of the primary key, containing at least the partition key of the Cassandra
 * table.
 */
public JavaPairRDD<K, Collection<V>> spanByKey() {
  return new PairRDDJavaFunctions<>(rdd()).spanByKey(kClassTag());
}

/**
 * A static factory method to create a {@link PairRDDJavaFunctions} based on an existing {@link
 * JavaPairRDD} instance.
 */
public static <K, V> PairRDDJavaFunctions<K, V> javaFunctions(JavaPairRDD<K, V> rdd) {
  return new PairRDDJavaFunctions<>(rdd.rdd());
}

/**
 * Applies a function to each item, and groups consecutive items having the same value together.
 * Contrary to `groupBy`, items from the same group must be already next to each other in the
 * original collection. Works locally on each partition, so items from different partitions will
 * never be placed in the same group.
 */
public <U> JavaPairRDD<U, Iterable<Tuple2<K, V>>> spanBy(
    Function<Tuple2<K, V>, U> function,
    ClassTag<U> uClassTag
) {
  return new PairRDDJavaFunctions<>(rdd()).spanBy(function, uClassTag);
}

/**
 * Groups items with the same key, assuming the items with the same key are next to each other in the
 * collection. It does not perform shuffle, therefore it is much faster than using much more
 * universal Spark RDD `groupByKey`. For this method to be useful with Cassandra tables, the key must
 * represent a prefix of the primary key, containing at least the partition key of the Cassandra
 * table.
 */
public JavaPairRDD<K, Collection<V>> spanByKey() {
  return new PairRDDJavaFunctions<>(rdd()).spanByKey(kClassTag());
}

/**
 * A static factory method to create a {@link PairRDDJavaFunctions} based on an existing {@link
 * JavaPairRDD} instance.
 */
public static <K, V> PairRDDJavaFunctions<K, V> javaFunctions(JavaPairRDD<K, V> rdd) {
  return new PairRDDJavaFunctions<>(rdd.rdd());
}

/**
 * Applies a function to each item, and groups consecutive items having the same value together.
 * Contrary to `groupBy`, items from the same group must be already next to each other in the
 * original collection. Works locally on each partition, so items from different partitions will
 * never be placed in the same group.
 */
public <U> JavaPairRDD<U, Iterable<Tuple2<K, V>>> spanBy(
    Function<Tuple2<K, V>, U> function,
    ClassTag<U> uClassTag
) {
  return new PairRDDJavaFunctions<>(rdd()).spanBy(function, uClassTag);
}

/**
 * Groups items with the same key, assuming the items with the same key are next to each other in the
 * collection. It does not perform shuffle, therefore it is much faster than using much more
 * universal Spark RDD `groupByKey`. For this method to be useful with Cassandra tables, the key must
 * represent a prefix of the primary key, containing at least the partition key of the Cassandra
 * table.
 */
public JavaPairRDD<K, Collection<V>> spanByKey() {
  return new PairRDDJavaFunctions<>(rdd()).spanByKey(kClassTag());
}

/**
 * A static factory method to create a {@link PairRDDJavaFunctions} based on an existing {@link
 * JavaPairRDD} instance.
 */
public static <K, V> PairRDDJavaFunctions<K, V> javaFunctions(JavaPairRDD<K, V> rdd) {
  return new PairRDDJavaFunctions<>(rdd.rdd());
}

/**
 * Applies a function to each item, and groups consecutive items having the same value together.
 * Contrary to `groupBy`, items from the same group must be already next to each other in the
 * original collection. Works locally on each partition, so items from different partitions will
 * never be placed in the same group.
 */
public <U> JavaPairRDD<U, Iterable<Tuple2<K, V>>> spanBy(
    Function<Tuple2<K, V>, U> function,
    ClassTag<U> uClassTag
) {
  return new PairRDDJavaFunctions<>(rdd()).spanBy(function, uClassTag);
}

/** @see {@link #spanBy(Function, ClassTag)} */
public <U> JavaPairRDD<U, Iterable<Tuple2<K, V>>> spanBy(
    Function<Tuple2<K, V>, U> function,
    Class<U> uClass
) {
  return new PairRDDJavaFunctions<>(rdd()).spanBy(function, getClassTag(uClass));
}

/** @see {@link #spanBy(Function, ClassTag)} */
public <U> JavaPairRDD<U, Iterable<Tuple2<K, V>>> spanBy(
    Function<Tuple2<K, V>, U> function,
    Class<U> uClass
) {
  return new PairRDDJavaFunctions<>(rdd()).spanBy(function, getClassTag(uClass));
}

/** @see {@link #spanBy(Function, ClassTag)} */
public <U> JavaPairRDD<U, Iterable<Tuple2<K, V>>> spanBy(
    Function<Tuple2<K, V>, U> function,
    Class<U> uClass
) {
  return new PairRDDJavaFunctions<>(rdd()).spanBy(function, getClassTag(uClass));
}

/** @see {@link #spanBy(Function, ClassTag)} */
public <U> JavaPairRDD<U, Iterable<Tuple2<K, V>>> spanBy(
    Function<Tuple2<K, V>, U> function,
    Class<U> uClass
) {
  return new PairRDDJavaFunctions<>(rdd()).spanBy(function, getClassTag(uClass));
}

/** @see {@link #spanBy(Function, ClassTag)} */
public <U> JavaPairRDD<U, Iterable<Tuple2<K, V>>> spanBy(
    Function<Tuple2<K, V>, U> function,
    Class<U> uClass
) {
  return new PairRDDJavaFunctions<>(rdd()).spanBy(function, getClassTag(uClass));
}

Most used methods

<init>
spanBy
spanByKey
Groups items with the same key, assuming the items with the same key are next to each other in the c

Popular in Java

Running tasks concurrently on multiple threads
setScale (BigDecimal)
onRequestPermissionsResult (Fragment)
setContentView (Activity)
Path (java.nio.file)
DecimalFormat (java.text)
A concrete subclass of NumberFormat that formats decimal numbers. It has a variety of features desig
BitSet (java.util)
The BitSet class implements abit array [http://en.wikipedia.org/wiki/Bit_array]. Each element is eit
NoSuchElementException (java.util)
Thrown when trying to retrieve an element past the end of an Enumeration or Iterator.
TimerTask (java.util)
The TimerTask class represents a task to run at a specified time. The task may be run once or repeat
Callable (java.util.concurrent)
A task that returns a result and may throw an exception. Implementors define a single method with no
From CI to AI: The AI layer in your organization

How to usePairRDDJavaFunctions in com.datastax.spark.connector.japi

Best Java code snippets using com.datastax.spark.connector.japi.PairRDDJavaFunctions (Showing top 20 results out of 315)

How to use
PairRDDJavaFunctions
in
com.datastax.spark.connector.japi