How to use
AbstractCharStreamFilter
in
com.norconex.importer.handler.filter

Best Java code snippets using com.norconex.importer.handler.filter.AbstractCharStreamFilter (Showing top 6 results out of 315)

@Override
protected final boolean isDocumentMatched(
    String reference, InputStream input,
    ImporterMetadata metadata, boolean parsed)
    throws ImporterHandlerException {
  String inputCharset = detectCharsetIfBlank(
      sourceCharset, reference, input, metadata, parsed);
  try {
    InputStreamReader is = new InputStreamReader(input, inputCharset);
    return isTextDocumentMatching(reference, is, metadata, parsed);
  } catch (UnsupportedEncodingException e) {
    throw new ImporterHandlerException(e);
  }
}

@Override
protected final void loadFilterFromXML(
    XMLConfiguration xml) throws IOException {
  setSourceCharset(xml.getString("[@sourceCharset]", getSourceCharset()));
  loadCharStreamFilterFromXML(xml);
}
/**

@Override
protected final void saveFilterToXML(EnhancedXMLStreamWriter writer)
    throws XMLStreamException {
  writer.writeAttributeString("sourceCharset", getSourceCharset());
  saveCharStreamFilterToXML(writer);
}
/**

@Override
public String toString() {
  return new ToStringBuilder(this, ToStringStyle.SHORT_PREFIX_STYLE)
    .appendSuper(super.toString())
    .append("maxReadSize", maxReadSize)
    .toString();
}

@Override
public int hashCode() {
  return new HashCodeBuilder()
    .appendSuper(super.hashCode())
    .append(maxReadSize)
    .toHashCode();
}

@Override
public boolean equals(Object obj) {
  if (this == obj) {
    return true;
  }
  if (obj == null) {
    return false;
  }
  if (!(obj instanceof AbstractStringFilter)) {
    return false;
  }
  AbstractStringFilter other = (AbstractStringFilter) obj;
  return new EqualsBuilder()
    .appendSuper(super.equals(obj))
    .append(maxReadSize, other.maxReadSize)
    .isEquals();
}

Javadoc

Base class for filters dealing with the body of text documents only. Subclasses can safely be used as either pre-parse or post-parse handlers restricted to text documents only (see AbstractImporterHandler).

Since 2.5.0, when used as a pre-parse handler, this class attempts to detect the content character encoding unless the character encoding was specified using #setSourceCharset(String). Since document parsing converts content to UTF-8, UTF-8 is always assumed when used as a post-parse handler.

Subclasses inherit this IXMLConfigurable configuration:

 
<!-- parent tag has these attribute:  
sourceCharset="(character encoding)" 
onMatch="[include|exclude]" 
-->  
<restrictTo 
caseSensitive="[false|true]" 
field="(name of header/metadata field name to match)"> 
(regular expression of value to match) 
</restrictTo> 
<!-- multiple "restrictTo" tags allowed (only one needs to match) -->

Most used methods

detectCharsetIfBlank
equals
getSourceCharset
Gets the assumed source character encoding.
hashCode
isTextDocumentMatching
loadCharStreamFilterFromXML
Loads configuration settings specific to the implementing class.
saveCharStreamFilterToXML
Saves configuration settings specific to the implementing class. The parent tag along with the "clas
setSourceCharset
Sets the assumed source character encoding.
toString

Popular in Java

Reactive rest calls using spring rest template
notifyDataSetChanged (ArrayAdapter)
setContentView (Activity)
runOnUiThread (Activity)
Dictionary (java.util)
Note: Do not use this class since it is obsolete. Please use the Map interface for new implementatio
LinkedList (java.util)
Doubly-linked list implementation of the List and Dequeinterfaces. Implements all optional list oper
TimeZone (java.util)
TimeZone represents a time zone offset, and also figures out daylight savings. Typically, you get a
Vector (java.util)
Vector is an implementation of List, backed by an array and synchronized. All optional operations in
Logger (org.apache.log4j)
This is the central class in the log4j package. Most logging operations, except configuration, are d
Response (javax.ws.rs.core)
Defines the contract between a returned instance and the runtime when an application needs to provid
Top 12 Jupyter Notebook extensions

How to useAbstractCharStreamFilter in com.norconex.importer.handler.filter

Best Java code snippets using com.norconex.importer.handler.filter.AbstractCharStreamFilter (Showing top 6 results out of 315)

How to use
AbstractCharStreamFilter
in
com.norconex.importer.handler.filter