How to use
init
method
in
org.apache.tika.parser.pdf.PDFParserConfig

Best Java code snippets using org.apache.tika.parser.pdf.PDFParserConfig.init (Showing top 6 results out of 315)

/**
 * Loads properties from InputStream and then tries to close InputStream.
 * If there is an IOException, this silently swallows the exception
 * and goes back to the default.
 *
 * @param is
 */
public PDFParserConfig(InputStream is) {
  init(is);
}

public PDFParserConfig() {
  init(this.getClass().getResourceAsStream("PDFParser.properties"));
}

/**
 * Loads properties from InputStream and then tries to close InputStream.
 * If there is an IOException, this silently swallows the exception
 * and goes back to the default.
 *
 * @param is
 */
public PDFParserConfig(InputStream is) {
  init(is);
}

/**
 * Loads properties from InputStream and then tries to close InputStream.
 * If there is an IOException, this silently swallows the exception
 * and goes back to the default.
 *
 * @param is
 */
public PDFParserConfig(InputStream is) {
  init(is);
}

public PDFParserConfig() {
  init(this.getClass().getResourceAsStream("PDFParser.properties"));
}

public PDFParserConfig() {
  init(this.getClass().getResourceAsStream("PDFParser.properties"));
}

Popular methods of PDFParserConfig

setExtractInlineImages
If true, extract inline embedded OBXImages.Beware: some PDF documents of modest size (~4MB) can cont
<init>
Loads properties from InputStream and then tries to close InputStream. If there is an IOException, t
setExtractUniqueInlineImagesOnly
Multiple pages within a PDF file might refer to the same underlying image. If #extractUniqueInlineIm
setOcrStrategy
Which strategy to use for OCR
setSuppressDuplicateOverlappingText
If true, the parser should try to remove duplicated text over the same region. This is needed for so
configure
Configures the given pdf2XHTML.
setEnableAutoSpace
If true (the default), the parser should estimate where spaces should be inserted between words. For
setExtractAcroFormContent
If true (the default), extract content from AcroForms at the end of the document. If an XFA is found
setExtractAnnotationText
If true (the default), text in annotations will be extracted.
setSortByPosition
If true, sort text tokens by their x/y position before extracting text. This may be necessary for so
getAccessChecker
getAverageCharTolerance

Popular in Java

Updating database using SQL prepared statement
putExtra (Intent)
getResourceAsStream (ClassLoader)
getOriginalFilename (MultipartFile)
Return the original filename in the client's filesystem.This may contain path information depending
FileReader (java.io)
A specialized Reader that reads from a file in the file system. All read requests made by calling me
Timer (java.util)
Timers schedule one-shot or recurring TimerTask for execution. Prefer java.util.concurrent.Scheduled
Executor (java.util.concurrent)
An object that executes submitted Runnable tasks. This interface provides a way of decoupling task s
Executors (java.util.concurrent)
Factory and utility methods for Executor, ExecutorService, ScheduledExecutorService, ThreadFactory,
TimeUnit (java.util.concurrent)
A TimeUnit represents time durations at a given unit of granularity and provides utility methods to
DataSource (javax.sql)
An interface for the creation of Connection objects which represent a connection to a database. This
Top plugins for WebStorm

How to use initmethodin org.apache.tika.parser.pdf.PDFParserConfig

Best Java code snippets using org.apache.tika.parser.pdf.PDFParserConfig.init (Showing top 6 results out of 315)

How to use
init
method
in
org.apache.tika.parser.pdf.PDFParserConfig