public final class MinFulltextWordsFilter extends java.lang.Object implements BoilerpipeFilter
HeuristicFilterBase.getNumFullTextWords(TextBlock)). k is 30 by default.| Modifier and Type | Field and Description |
|---|---|
static MinFulltextWordsFilter |
DEFAULT_INSTANCE |
| Constructor and Description |
|---|
MinFulltextWordsFilter(int minWords) |
| Modifier and Type | Method and Description |
|---|---|
static MinFulltextWordsFilter |
getDefaultInstance() |
protected static int |
getNumFullTextWords(TextBlock tb) |
protected static int |
getNumFullTextWords(TextBlock tb,
float minTextDensity) |
boolean |
process(TextDocument doc)
Processes the given document
doc. |
public static final MinFulltextWordsFilter DEFAULT_INSTANCE
public static MinFulltextWordsFilter getDefaultInstance()
public boolean process(TextDocument doc) throws BoilerpipeProcessingException
BoilerpipeFilterdoc.process in interface BoilerpipeFilterdoc - The TextDocument that is to be processed.true if changes have been made to the
TextDocument.BoilerpipeProcessingExceptionprotected static int getNumFullTextWords(TextBlock tb)
protected static int getNumFullTextWords(TextBlock tb, float minTextDensity)