Package | Description |
---|---|
org.apache.lucene.analysis |
Text analysis.
|
org.apache.lucene.analysis.core |
Basic, general-purpose analysis components.
|
org.apache.lucene.analysis.miscellaneous |
Miscellaneous Tokenstreams.
|
org.apache.lucene.search.uhighlight |
The UnifiedHighlighter -- a flexible highlighter that can get offsets from postings, term vectors, or analysis.
|
Modifier and Type | Class and Description |
---|---|
class |
StopFilter
Removes stop words from a token stream.
|
Modifier and Type | Class and Description |
---|---|
class |
TypeTokenFilter
Removes tokens whose types appear in a set of blocked types from a token stream.
|
Modifier and Type | Class and Description |
---|---|
class |
CodepointCountFilter
Removes words that are too long or too short from the stream.
|
class |
DateRecognizerFilter
Filters all tokens that cannot be parsed to a date, using the provided
DateFormat . |
class |
KeepWordFilter
A TokenFilter that only keeps tokens with text contained in the
required words.
|
class |
LengthFilter
Removes words that are too long or too short from the stream.
|
Modifier and Type | Method and Description |
---|---|
private static FilteringTokenFilter |
MemoryIndexOffsetStrategy.newKeepWordFilter(TokenStream tokenStream,
CharacterRunAutomaton charRunAutomaton) |