How to use
getGlobalMetaData
method
in
parquet.hadoop.ParquetFileWriter

Best Java code snippets using parquet.hadoop.ParquetFileWriter.getGlobalMetaData (Showing top 3 results out of 315)

/**
 * Will merge the metadata of all the footers together
 * @param footers the list files footers to merge
 * @return the global meta data for all the footers
 */
static GlobalMetaData getGlobalMetaData(List<Footer> footers) {
 return getGlobalMetaData(footers, true);
}

/**
 * @param jobContext the current job context
 * @return the merged metadata from the footers
 * @throws IOException
 */
public GlobalMetaData getGlobalMetaData(JobContext jobContext) throws IOException {
 return ParquetFileWriter.getGlobalMetaData(getFooters(jobContext));
}

/**
 * @param configuration the configuration to connect to the file system
 * @param footers the footers of the files to read
 * @return the splits for the footers
 * @throws IOException
 * @deprecated split planning using file footers will be removed
 */
@Deprecated
public List<ParquetInputSplit> getSplits(Configuration configuration, List<Footer> footers) throws IOException {
 boolean strictTypeChecking = configuration.getBoolean(STRICT_TYPE_CHECKING, true);
 final long maxSplitSize = configuration.getLong("mapred.max.split.size", Long.MAX_VALUE);
 final long minSplitSize = Math.max(getFormatMinSplitSize(), configuration.getLong("mapred.min.split.size", 0L));
 if (maxSplitSize < 0 || minSplitSize < 0) {
  throw new ParquetDecodingException("maxSplitSize or minSplitSize should not be negative: maxSplitSize = " + maxSplitSize + "; minSplitSize = " + minSplitSize);
 }
 GlobalMetaData globalMetaData = ParquetFileWriter.getGlobalMetaData(footers, strictTypeChecking);
 ReadContext readContext = getReadSupport(configuration).init(new InitContext(
   configuration,
   globalMetaData.getKeyValueMetaData(),
   globalMetaData.getSchema()));
 return new ClientSideMetadataSplitStrategy().getSplits(
   configuration, footers, maxSplitSize, minSplitSize, readContext);
}

Javadoc

Will merge the metadata of all the footers together

Popular methods of ParquetFileWriter

<init>
end
ends a file once all blocks have been written. closes the file.
endBlock
ends a block once all column chunks have been written
endColumn
end a column (once all rep, def and data have been written)
getPos
mergeFooters
mergeInto
will return the result of merging toMerge into mergedSchema
serializeFooter
start
start the file
startBlock
start a block
startColumn
start a column inside a block
writeDataPages
writes a number of pages at once

Popular in Java

Creating JSON documents from java classes using gson
startActivity (Activity)
onCreateOptionsMenu (Activity)
notifyDataSetChanged (ArrayAdapter)
ConnectException (java.net)
A ConnectException is thrown if a connection cannot be established to a remote host on a specific po
InetAddress (java.net)
An Internet Protocol (IP) address. This can be either an IPv4 address or an IPv6 address, and in pra
SocketException (java.net)
This SocketException may be thrown during socket creation or setting options, and is the superclass
ResultSet (java.sql)
An interface for an object which represents a database table entry, returned as the result of the qu
SSLHandshakeException (javax.net.ssl)
The exception that is thrown when a handshake could not be completed successfully.
Rectangle (java.awt)
A Rectangle specifies an area in a coordinate space that is enclosed by the Rectangle object's top-
Github Copilot alternatives

How to use getGlobalMetaDatamethodin parquet.hadoop.ParquetFileWriter

Best Java code snippets using parquet.hadoop.ParquetFileWriter.getGlobalMetaData (Showing top 3 results out of 315)

How to use
getGlobalMetaData
method
in
parquet.hadoop.ParquetFileWriter