- delete
- countRecords
Count all the records in a directory using a
org.apache.mahout.common.iterator.sequencefile.Sequence
- getFileStatus
- listStatus
- buildDirList
Builds a comma-separated list of input splits
- cacheFiles
- findInCacheByPartOfFilename
Finds a file in the DistributedCache
- getCachedFiles
Retrieves paths to cached files.
- getCustomJobName
- getSingleCachedFile
Return the first cached file in the list, else null if thre are no cached files.
- openStream
- prepareJob
Create a map-only Hadoop Job out of the passed in parameters. Does not set the
Job name.