- <init>
Constructor that allows specification of the target number of words in the
output.
- setWordsToKeep
Sets the number of words (per class if there is a class attribute assigned) to
attempt to keep.
- setIDFTransform
Sets whether if the word frequencies in a document should be transformed into:
fij*log(num of Docs/
- setNormalizeDocLength
Sets whether if the word frequencies for a document (instance) should be
normalized or not.
- setOutputWordCounts
Sets whether output instances contain 0 or 1 indicating word presence, or word
counts.
- setStemmer
the stemming algorithm to use, null means no stemming at all (i.e., the
NullStemmer is used).
- setTFTransform
Sets whether if the word frequencies should be transformed into log(1+fij) where
fij is the frequenc
- bufferInput
- flushInput
- getAttributeNamePrefix
Get the attribute name prefix.
- getDoNotOperateOnPerClassBasis
Get the DoNotOperateOnPerClassBasis value.
- getIDFTransform
Sets whether if the word frequencies in a document should be transformed into:
fij*log(num of Docs/