Package | Description |
---|---|
org.apache.lucene.analysis |
Text analysis.
|
org.apache.lucene.analysis.ar |
Analyzer for Arabic.
|
org.apache.lucene.analysis.bg |
Analyzer for Bulgarian.
|
org.apache.lucene.analysis.bn |
Analyzer for Bengali Language.
|
org.apache.lucene.analysis.br |
Analyzer for Brazilian Portuguese.
|
org.apache.lucene.analysis.ca |
Analyzer for Catalan.
|
org.apache.lucene.analysis.charfilter |
Normalization of text before the tokenizer.
|
org.apache.lucene.analysis.cjk |
Analyzer for Chinese, Japanese, and Korean, which indexes bigrams.
|
org.apache.lucene.analysis.ckb |
Analyzer for Sorani Kurdish.
|
org.apache.lucene.analysis.cn.smart |
Analyzer for Simplified Chinese, which indexes words.
|
org.apache.lucene.analysis.commongrams |
Construct n-grams for frequently occurring terms and phrases.
|
org.apache.lucene.analysis.compound |
A filter that decomposes compound words you find in many Germanic
languages into the word parts.
|
org.apache.lucene.analysis.core |
Basic, general-purpose analysis components.
|
org.apache.lucene.analysis.cz |
Analyzer for Czech.
|
org.apache.lucene.analysis.da |
Analyzer for Danish.
|
org.apache.lucene.analysis.de |
Analyzer for German.
|
org.apache.lucene.analysis.el |
Analyzer for Greek.
|
org.apache.lucene.analysis.en |
Analyzer for English.
|
org.apache.lucene.analysis.es |
Analyzer for Spanish.
|
org.apache.lucene.analysis.eu |
Analyzer for Basque.
|
org.apache.lucene.analysis.fa |
Analyzer for Persian.
|
org.apache.lucene.analysis.fi |
Analyzer for Finnish.
|
org.apache.lucene.analysis.fr |
Analyzer for French.
|
org.apache.lucene.analysis.ga |
Analyzer for Irish.
|
org.apache.lucene.analysis.gl |
Analyzer for Galician.
|
org.apache.lucene.analysis.hi |
Analyzer for Hindi.
|
org.apache.lucene.analysis.hu |
Analyzer for Hungarian.
|
org.apache.lucene.analysis.hy |
Analyzer for Armenian.
|
org.apache.lucene.analysis.id |
Analyzer for Indonesian.
|
org.apache.lucene.analysis.it |
Analyzer for Italian.
|
org.apache.lucene.analysis.lt |
Analyzer for Lithuanian.
|
org.apache.lucene.analysis.lv |
Analyzer for Latvian.
|
org.apache.lucene.analysis.miscellaneous |
Miscellaneous Tokenstreams.
|
org.apache.lucene.analysis.nl |
Analyzer for Dutch.
|
org.apache.lucene.analysis.no |
Analyzer for Norwegian.
|
org.apache.lucene.analysis.pt |
Analyzer for Portuguese.
|
org.apache.lucene.analysis.ro |
Analyzer for Romanian.
|
org.apache.lucene.analysis.ru |
Analyzer for Russian.
|
org.apache.lucene.analysis.snowball |
TokenFilter and Analyzer implementations that use Snowball
stemmers. |
org.apache.lucene.analysis.standard |
Fast, general-purpose grammar-based tokenizer
StandardTokenizer
implements the Word Break rules from the Unicode Text Segmentation algorithm, as specified in
Unicode Standard Annex #29. |
org.apache.lucene.analysis.sv |
Analyzer for Swedish.
|
org.apache.lucene.analysis.th |
Analyzer for Thai.
|
org.apache.lucene.analysis.tr |
Analyzer for Turkish.
|
org.apache.lucene.analysis.util |
Utility functions for text analysis.
|
Modifier and Type | Field and Description |
---|---|
static CharArraySet |
CharArraySet.EMPTY_SET
An empty
CharArraySet . |
private CharArraySet |
CharArrayMap.keySet |
protected CharArraySet |
StopwordAnalyzerBase.stopwords
An immutable stopword set
|
private CharArraySet |
StopFilter.stopWords |
Modifier and Type | Method and Description |
---|---|
static CharArraySet |
CharArraySet.copy(java.util.Set<?> set)
Returns a copy of the given set as a
CharArraySet . |
static CharArraySet |
WordlistLoader.getSnowballWordSet(java.io.Reader reader)
Reads stopwords from a stopword list in Snowball format.
|
static CharArraySet |
WordlistLoader.getSnowballWordSet(java.io.Reader reader,
CharArraySet result)
Reads stopwords from a stopword list in Snowball format.
|
CharArraySet |
StopwordAnalyzerBase.getStopwordSet()
Returns the analyzer's stopword set or an empty set if the analyzer has no
stopwords
|
static CharArraySet |
WordlistLoader.getWordSet(java.io.Reader reader)
Reads lines from a Reader and adds every line as an entry to a CharArraySet (omitting
leading and trailing whitespace).
|
static CharArraySet |
WordlistLoader.getWordSet(java.io.Reader reader,
CharArraySet result)
Reads lines from a Reader and adds every line as an entry to a CharArraySet (omitting
leading and trailing whitespace).
|
static CharArraySet |
WordlistLoader.getWordSet(java.io.Reader reader,
java.lang.String comment)
Reads lines from a Reader and adds every non-comment line as an entry to a CharArraySet (omitting
leading and trailing whitespace).
|
static CharArraySet |
WordlistLoader.getWordSet(java.io.Reader reader,
java.lang.String comment,
CharArraySet result)
Reads lines from a Reader and adds every non-comment line as an entry to a CharArraySet (omitting
leading and trailing whitespace).
|
CharArraySet |
CharArrayMap.keySet()
Returns an
CharArraySet view on the map's keys. |
protected static CharArraySet |
StopwordAnalyzerBase.loadStopwordSet(boolean ignoreCase,
java.lang.Class<? extends Analyzer> aClass,
java.lang.String resource,
java.lang.String comment)
Creates a CharArraySet from a file resource associated with a class.
|
protected static CharArraySet |
StopwordAnalyzerBase.loadStopwordSet(java.nio.file.Path stopwords)
Creates a CharArraySet from a path.
|
protected static CharArraySet |
StopwordAnalyzerBase.loadStopwordSet(java.io.Reader stopwords)
Creates a CharArraySet from a file.
|
static CharArraySet |
StopFilter.makeStopSet(java.util.List<?> stopWords)
Builds a Set from an array of stop words,
appropriate for passing into the StopFilter constructor.
|
static CharArraySet |
StopFilter.makeStopSet(java.util.List<?> stopWords,
boolean ignoreCase)
Creates a stopword set from the given stopword list.
|
static CharArraySet |
StopFilter.makeStopSet(java.lang.String... stopWords)
Builds a Set from an array of stop words,
appropriate for passing into the StopFilter constructor.
|
static CharArraySet |
StopFilter.makeStopSet(java.lang.String[] stopWords,
boolean ignoreCase)
Creates a stopword set from the given stopword array.
|
static CharArraySet |
CharArraySet.unmodifiableSet(CharArraySet set)
Returns an unmodifiable
CharArraySet . |
Modifier and Type | Method and Description |
---|---|
static CharArraySet |
WordlistLoader.getSnowballWordSet(java.io.Reader reader,
CharArraySet result)
Reads stopwords from a stopword list in Snowball format.
|
static CharArraySet |
WordlistLoader.getWordSet(java.io.Reader reader,
CharArraySet result)
Reads lines from a Reader and adds every line as an entry to a CharArraySet (omitting
leading and trailing whitespace).
|
static CharArraySet |
WordlistLoader.getWordSet(java.io.Reader reader,
java.lang.String comment,
CharArraySet result)
Reads lines from a Reader and adds every non-comment line as an entry to a CharArraySet (omitting
leading and trailing whitespace).
|
static CharArraySet |
CharArraySet.unmodifiableSet(CharArraySet set)
Returns an unmodifiable
CharArraySet . |
Constructor and Description |
---|
StopFilter(TokenStream in,
CharArraySet stopWords)
Constructs a filter which removes words from the input TokenStream that are
named in the Set.
|
StopwordAnalyzerBase(CharArraySet stopwords)
Creates a new instance initialized with the given stopword set
|
Modifier and Type | Field and Description |
---|---|
(package private) static CharArraySet |
ArabicAnalyzer.DefaultSetHolder.DEFAULT_STOP_SET |
private CharArraySet |
ArabicAnalyzer.stemExclusionSet |
Modifier and Type | Method and Description |
---|---|
static CharArraySet |
ArabicAnalyzer.getDefaultStopSet()
Returns an unmodifiable instance of the default stop-words set.
|
Constructor and Description |
---|
ArabicAnalyzer(CharArraySet stopwords)
Builds an analyzer with the given stop words
|
ArabicAnalyzer(CharArraySet stopwords,
CharArraySet stemExclusionSet)
Builds an analyzer with the given stop word.
|
Modifier and Type | Field and Description |
---|---|
(package private) static CharArraySet |
BulgarianAnalyzer.DefaultSetHolder.DEFAULT_STOP_SET |
private CharArraySet |
BulgarianAnalyzer.stemExclusionSet |
Modifier and Type | Method and Description |
---|---|
static CharArraySet |
BulgarianAnalyzer.getDefaultStopSet()
Returns an unmodifiable instance of the default stop-words set.
|
Constructor and Description |
---|
BulgarianAnalyzer(CharArraySet stopwords)
Builds an analyzer with the given stop words.
|
BulgarianAnalyzer(CharArraySet stopwords,
CharArraySet stemExclusionSet)
Builds an analyzer with the given stop words and a stem exclusion set.
|
Modifier and Type | Field and Description |
---|---|
(package private) static CharArraySet |
BengaliAnalyzer.DefaultSetHolder.DEFAULT_STOP_SET |
private CharArraySet |
BengaliAnalyzer.stemExclusionSet |
Modifier and Type | Method and Description |
---|---|
static CharArraySet |
BengaliAnalyzer.getDefaultStopSet()
Returns an unmodifiable instance of the default stop-words set.
|
Constructor and Description |
---|
BengaliAnalyzer(CharArraySet stopwords)
Builds an analyzer with the given stop words
|
BengaliAnalyzer(CharArraySet stopwords,
CharArraySet stemExclusionSet)
Builds an analyzer with the given stop words
|
Modifier and Type | Field and Description |
---|---|
(package private) static CharArraySet |
BrazilianAnalyzer.DefaultSetHolder.DEFAULT_STOP_SET |
private CharArraySet |
BrazilianAnalyzer.excltable
Contains words that should be indexed but not stemmed.
|
Modifier and Type | Method and Description |
---|---|
static CharArraySet |
BrazilianAnalyzer.getDefaultStopSet()
Returns an unmodifiable instance of the default stop-words set.
|
Constructor and Description |
---|
BrazilianAnalyzer(CharArraySet stopwords)
Builds an analyzer with the given stop words
|
BrazilianAnalyzer(CharArraySet stopwords,
CharArraySet stemExclusionSet)
Builds an analyzer with the given stop words and stemming exclusion words
|
Modifier and Type | Field and Description |
---|---|
private static CharArraySet |
CatalanAnalyzer.DEFAULT_ARTICLES |
(package private) static CharArraySet |
CatalanAnalyzer.DefaultSetHolder.DEFAULT_STOP_SET |
private CharArraySet |
CatalanAnalyzer.stemExclusionSet |
Modifier and Type | Method and Description |
---|---|
static CharArraySet |
CatalanAnalyzer.getDefaultStopSet()
Returns an unmodifiable instance of the default stop words set.
|
Constructor and Description |
---|
CatalanAnalyzer(CharArraySet stopwords)
Builds an analyzer with the given stop words.
|
CatalanAnalyzer(CharArraySet stopwords,
CharArraySet stemExclusionSet)
Builds an analyzer with the given stop words.
|
Modifier and Type | Field and Description |
---|---|
private CharArraySet |
HTMLStripCharFilter.escapedTags |
Modifier and Type | Field and Description |
---|---|
(package private) static CharArraySet |
CJKAnalyzer.DefaultSetHolder.DEFAULT_STOP_SET |
Modifier and Type | Method and Description |
---|---|
static CharArraySet |
CJKAnalyzer.getDefaultStopSet()
Returns an unmodifiable instance of the default stop-words set.
|
Constructor and Description |
---|
CJKAnalyzer(CharArraySet stopwords)
Builds an analyzer with the given stop words
|
Modifier and Type | Field and Description |
---|---|
(package private) static CharArraySet |
SoraniAnalyzer.DefaultSetHolder.DEFAULT_STOP_SET |
private CharArraySet |
SoraniAnalyzer.stemExclusionSet |
Modifier and Type | Method and Description |
---|---|
static CharArraySet |
SoraniAnalyzer.getDefaultStopSet()
Returns an unmodifiable instance of the default stop words set.
|
Constructor and Description |
---|
SoraniAnalyzer(CharArraySet stopwords)
Builds an analyzer with the given stop words.
|
SoraniAnalyzer(CharArraySet stopwords,
CharArraySet stemExclusionSet)
Builds an analyzer with the given stop words.
|
Modifier and Type | Field and Description |
---|---|
(package private) static CharArraySet |
SmartChineseAnalyzer.DefaultSetHolder.DEFAULT_STOP_SET |
private CharArraySet |
SmartChineseAnalyzer.stopWords |
Modifier and Type | Method and Description |
---|---|
static CharArraySet |
SmartChineseAnalyzer.getDefaultStopSet()
Returns an unmodifiable instance of the default stop-words set.
|
(package private) static CharArraySet |
SmartChineseAnalyzer.DefaultSetHolder.loadDefaultStopWordSet() |
Constructor and Description |
---|
SmartChineseAnalyzer(CharArraySet stopWords)
Create a new SmartChineseAnalyzer, using the provided
Set of stopwords. |
Modifier and Type | Field and Description |
---|---|
private CharArraySet |
CommonGramsFilterFactory.commonWords |
private CharArraySet |
CommonGramsFilter.commonWords |
Modifier and Type | Method and Description |
---|---|
CharArraySet |
CommonGramsFilterFactory.getCommonWords() |
Constructor and Description |
---|
CommonGramsFilter(TokenStream input,
CharArraySet commonWords)
Construct a token stream filtering the given input using a Set of common
words to create bigrams.
|
Modifier and Type | Field and Description |
---|---|
protected CharArraySet |
CompoundWordTokenFilterBase.dictionary |
private CharArraySet |
HyphenationCompoundWordTokenFilterFactory.dictionary |
private CharArraySet |
DictionaryCompoundWordTokenFilterFactory.dictionary |
Constructor and Description |
---|
CompoundWordTokenFilterBase(TokenStream input,
CharArraySet dictionary) |
CompoundWordTokenFilterBase(TokenStream input,
CharArraySet dictionary,
boolean onlyLongestMatch) |
CompoundWordTokenFilterBase(TokenStream input,
CharArraySet dictionary,
int minWordSize,
int minSubwordSize,
int maxSubwordSize,
boolean onlyLongestMatch) |
DictionaryCompoundWordTokenFilter(TokenStream input,
CharArraySet dictionary)
Creates a new
DictionaryCompoundWordTokenFilter |
DictionaryCompoundWordTokenFilter(TokenStream input,
CharArraySet dictionary,
int minWordSize,
int minSubwordSize,
int maxSubwordSize,
boolean onlyLongestMatch)
Creates a new
DictionaryCompoundWordTokenFilter |
HyphenationCompoundWordTokenFilter(TokenStream input,
HyphenationTree hyphenator,
CharArraySet dictionary)
Creates a new
HyphenationCompoundWordTokenFilter instance. |
HyphenationCompoundWordTokenFilter(TokenStream input,
HyphenationTree hyphenator,
CharArraySet dictionary,
int minWordSize,
int minSubwordSize,
int maxSubwordSize,
boolean onlyLongestMatch)
Creates a new
HyphenationCompoundWordTokenFilter instance. |
Modifier and Type | Field and Description |
---|---|
private CharArraySet |
StopFilterFactory.stopWords |
Modifier and Type | Method and Description |
---|---|
CharArraySet |
StopFilterFactory.getStopWords() |
Constructor and Description |
---|
StopAnalyzer(CharArraySet stopWords)
Builds an analyzer with the stop words from the given set.
|
StopFilter(TokenStream in,
CharArraySet stopWords)
Constructs a filter which removes words from the input TokenStream that are
named in the Set.
|
Modifier and Type | Field and Description |
---|---|
private static CharArraySet |
CzechAnalyzer.DefaultSetHolder.DEFAULT_SET |
private CharArraySet |
CzechAnalyzer.stemExclusionTable |
Modifier and Type | Method and Description |
---|---|
static CharArraySet |
CzechAnalyzer.getDefaultStopSet()
Returns a set of default Czech-stopwords
|
Constructor and Description |
---|
CzechAnalyzer(CharArraySet stopwords)
Builds an analyzer with the given stop words.
|
CzechAnalyzer(CharArraySet stopwords,
CharArraySet stemExclusionTable)
Builds an analyzer with the given stop words and a set of work to be
excluded from the
CzechStemFilter . |
Modifier and Type | Field and Description |
---|---|
(package private) static CharArraySet |
DanishAnalyzer.DefaultSetHolder.DEFAULT_STOP_SET |
private CharArraySet |
DanishAnalyzer.stemExclusionSet |
Modifier and Type | Method and Description |
---|---|
static CharArraySet |
DanishAnalyzer.getDefaultStopSet()
Returns an unmodifiable instance of the default stop words set.
|
Constructor and Description |
---|
DanishAnalyzer(CharArraySet stopwords)
Builds an analyzer with the given stop words.
|
DanishAnalyzer(CharArraySet stopwords,
CharArraySet stemExclusionSet)
Builds an analyzer with the given stop words.
|
Modifier and Type | Field and Description |
---|---|
private static CharArraySet |
GermanAnalyzer.DefaultSetHolder.DEFAULT_SET |
private CharArraySet |
GermanAnalyzer.exclusionSet
Contains words that should be indexed but not stemmed.
|
Modifier and Type | Method and Description |
---|---|
static CharArraySet |
GermanAnalyzer.getDefaultStopSet()
Returns a set of default German-stopwords
|
Constructor and Description |
---|
GermanAnalyzer(CharArraySet stopwords)
Builds an analyzer with the given stop words
|
GermanAnalyzer(CharArraySet stopwords,
CharArraySet stemExclusionSet)
Builds an analyzer with the given stop words
|
Modifier and Type | Field and Description |
---|---|
private static CharArraySet |
GreekAnalyzer.DefaultSetHolder.DEFAULT_SET |
private static CharArraySet |
GreekStemmer.exc12a |
private static CharArraySet |
GreekStemmer.exc12b |
private static CharArraySet |
GreekStemmer.exc13 |
private static CharArraySet |
GreekStemmer.exc14 |
private static CharArraySet |
GreekStemmer.exc15a |
private static CharArraySet |
GreekStemmer.exc15b |
private static CharArraySet |
GreekStemmer.exc16 |
private static CharArraySet |
GreekStemmer.exc17 |
private static CharArraySet |
GreekStemmer.exc18 |
private static CharArraySet |
GreekStemmer.exc19 |
private static CharArraySet |
GreekStemmer.exc4 |
private static CharArraySet |
GreekStemmer.exc6 |
private static CharArraySet |
GreekStemmer.exc7 |
private static CharArraySet |
GreekStemmer.exc8a |
private static CharArraySet |
GreekStemmer.exc8b |
private static CharArraySet |
GreekStemmer.exc9 |
Modifier and Type | Method and Description |
---|---|
static CharArraySet |
GreekAnalyzer.getDefaultStopSet()
Returns a set of default Greek-stopwords
|
Constructor and Description |
---|
GreekAnalyzer(CharArraySet stopwords)
Builds an analyzer with the given stop words.
|
Modifier and Type | Field and Description |
---|---|
static CharArraySet |
EnglishAnalyzer.ENGLISH_STOP_WORDS_SET
An unmodifiable set containing some common English words that are not usually useful
for searching.
|
private CharArraySet |
EnglishAnalyzer.stemExclusionSet |
Modifier and Type | Method and Description |
---|---|
static CharArraySet |
EnglishAnalyzer.getDefaultStopSet()
Returns an unmodifiable instance of the default stop words set.
|
Constructor and Description |
---|
EnglishAnalyzer(CharArraySet stopwords)
Builds an analyzer with the given stop words.
|
EnglishAnalyzer(CharArraySet stopwords,
CharArraySet stemExclusionSet)
Builds an analyzer with the given stop words.
|
Modifier and Type | Field and Description |
---|---|
(package private) static CharArraySet |
SpanishAnalyzer.DefaultSetHolder.DEFAULT_STOP_SET |
private CharArraySet |
SpanishAnalyzer.stemExclusionSet |
Modifier and Type | Method and Description |
---|---|
static CharArraySet |
SpanishAnalyzer.getDefaultStopSet()
Returns an unmodifiable instance of the default stop words set.
|
Constructor and Description |
---|
SpanishAnalyzer(CharArraySet stopwords)
Builds an analyzer with the given stop words.
|
SpanishAnalyzer(CharArraySet stopwords,
CharArraySet stemExclusionSet)
Builds an analyzer with the given stop words.
|
Modifier and Type | Field and Description |
---|---|
(package private) static CharArraySet |
BasqueAnalyzer.DefaultSetHolder.DEFAULT_STOP_SET |
private CharArraySet |
BasqueAnalyzer.stemExclusionSet |
Modifier and Type | Method and Description |
---|---|
static CharArraySet |
BasqueAnalyzer.getDefaultStopSet()
Returns an unmodifiable instance of the default stop words set.
|
Constructor and Description |
---|
BasqueAnalyzer(CharArraySet stopwords)
Builds an analyzer with the given stop words.
|
BasqueAnalyzer(CharArraySet stopwords,
CharArraySet stemExclusionSet)
Builds an analyzer with the given stop words.
|
Modifier and Type | Field and Description |
---|---|
(package private) static CharArraySet |
PersianAnalyzer.DefaultSetHolder.DEFAULT_STOP_SET |
Modifier and Type | Method and Description |
---|---|
static CharArraySet |
PersianAnalyzer.getDefaultStopSet()
Returns an unmodifiable instance of the default stop-words set.
|
Constructor and Description |
---|
PersianAnalyzer(CharArraySet stopwords)
Builds an analyzer with the given stop words
|
Modifier and Type | Field and Description |
---|---|
(package private) static CharArraySet |
FinnishAnalyzer.DefaultSetHolder.DEFAULT_STOP_SET |
private CharArraySet |
FinnishAnalyzer.stemExclusionSet |
Modifier and Type | Method and Description |
---|---|
static CharArraySet |
FinnishAnalyzer.getDefaultStopSet()
Returns an unmodifiable instance of the default stop words set.
|
Constructor and Description |
---|
FinnishAnalyzer(CharArraySet stopwords)
Builds an analyzer with the given stop words.
|
FinnishAnalyzer(CharArraySet stopwords,
CharArraySet stemExclusionSet)
Builds an analyzer with the given stop words.
|
Modifier and Type | Field and Description |
---|---|
static CharArraySet |
FrenchAnalyzer.DEFAULT_ARTICLES
Default set of articles for ElisionFilter
|
(package private) static CharArraySet |
FrenchAnalyzer.DefaultSetHolder.DEFAULT_STOP_SET |
private CharArraySet |
FrenchAnalyzer.excltable
Contains words that should be indexed but not stemmed.
|
Modifier and Type | Method and Description |
---|---|
static CharArraySet |
FrenchAnalyzer.getDefaultStopSet()
Returns an unmodifiable instance of the default stop-words set.
|
Constructor and Description |
---|
FrenchAnalyzer(CharArraySet stopwords)
Builds an analyzer with the given stop words
|
FrenchAnalyzer(CharArraySet stopwords,
CharArraySet stemExclutionSet)
Builds an analyzer with the given stop words
|
Modifier and Type | Field and Description |
---|---|
private static CharArraySet |
IrishAnalyzer.DEFAULT_ARTICLES |
(package private) static CharArraySet |
IrishAnalyzer.DefaultSetHolder.DEFAULT_STOP_SET |
private static CharArraySet |
IrishAnalyzer.HYPHENATIONS
When StandardTokenizer splits t‑athair into {t, athair}, we don't
want to cause a position increment, otherwise there will be problems
with phrase queries versus tAthair (which would not have a gap).
|
private CharArraySet |
IrishAnalyzer.stemExclusionSet |
Modifier and Type | Method and Description |
---|---|
static CharArraySet |
IrishAnalyzer.getDefaultStopSet()
Returns an unmodifiable instance of the default stop words set.
|
Constructor and Description |
---|
IrishAnalyzer(CharArraySet stopwords)
Builds an analyzer with the given stop words.
|
IrishAnalyzer(CharArraySet stopwords,
CharArraySet stemExclusionSet)
Builds an analyzer with the given stop words.
|
Modifier and Type | Field and Description |
---|---|
(package private) static CharArraySet |
GalicianAnalyzer.DefaultSetHolder.DEFAULT_STOP_SET |
private CharArraySet |
GalicianAnalyzer.stemExclusionSet |
Modifier and Type | Method and Description |
---|---|
static CharArraySet |
GalicianAnalyzer.getDefaultStopSet()
Returns an unmodifiable instance of the default stop words set.
|
Constructor and Description |
---|
GalicianAnalyzer(CharArraySet stopwords)
Builds an analyzer with the given stop words.
|
GalicianAnalyzer(CharArraySet stopwords,
CharArraySet stemExclusionSet)
Builds an analyzer with the given stop words.
|
Modifier and Type | Field and Description |
---|---|
(package private) static CharArraySet |
HindiAnalyzer.DefaultSetHolder.DEFAULT_STOP_SET |
private CharArraySet |
HindiAnalyzer.stemExclusionSet |
Modifier and Type | Method and Description |
---|---|
static CharArraySet |
HindiAnalyzer.getDefaultStopSet()
Returns an unmodifiable instance of the default stop-words set.
|
Constructor and Description |
---|
HindiAnalyzer(CharArraySet stopwords)
Builds an analyzer with the given stop words
|
HindiAnalyzer(CharArraySet stopwords,
CharArraySet stemExclusionSet)
Builds an analyzer with the given stop words
|
Modifier and Type | Field and Description |
---|---|
(package private) static CharArraySet |
HungarianAnalyzer.DefaultSetHolder.DEFAULT_STOP_SET |
private CharArraySet |
HungarianAnalyzer.stemExclusionSet |
Modifier and Type | Method and Description |
---|---|
static CharArraySet |
HungarianAnalyzer.getDefaultStopSet()
Returns an unmodifiable instance of the default stop words set.
|
Constructor and Description |
---|
HungarianAnalyzer(CharArraySet stopwords)
Builds an analyzer with the given stop words.
|
HungarianAnalyzer(CharArraySet stopwords,
CharArraySet stemExclusionSet)
Builds an analyzer with the given stop words.
|
Modifier and Type | Field and Description |
---|---|
(package private) static CharArraySet |
ArmenianAnalyzer.DefaultSetHolder.DEFAULT_STOP_SET |
private CharArraySet |
ArmenianAnalyzer.stemExclusionSet |
Modifier and Type | Method and Description |
---|---|
static CharArraySet |
ArmenianAnalyzer.getDefaultStopSet()
Returns an unmodifiable instance of the default stop words set.
|
Constructor and Description |
---|
ArmenianAnalyzer(CharArraySet stopwords)
Builds an analyzer with the given stop words.
|
ArmenianAnalyzer(CharArraySet stopwords,
CharArraySet stemExclusionSet)
Builds an analyzer with the given stop words.
|
Modifier and Type | Field and Description |
---|---|
(package private) static CharArraySet |
IndonesianAnalyzer.DefaultSetHolder.DEFAULT_STOP_SET |
private CharArraySet |
IndonesianAnalyzer.stemExclusionSet |
Modifier and Type | Method and Description |
---|---|
static CharArraySet |
IndonesianAnalyzer.getDefaultStopSet()
Returns an unmodifiable instance of the default stop-words set.
|
Constructor and Description |
---|
IndonesianAnalyzer(CharArraySet stopwords)
Builds an analyzer with the given stop words
|
IndonesianAnalyzer(CharArraySet stopwords,
CharArraySet stemExclusionSet)
Builds an analyzer with the given stop word.
|
Modifier and Type | Field and Description |
---|---|
private static CharArraySet |
ItalianAnalyzer.DEFAULT_ARTICLES |
(package private) static CharArraySet |
ItalianAnalyzer.DefaultSetHolder.DEFAULT_STOP_SET |
private CharArraySet |
ItalianAnalyzer.stemExclusionSet |
Modifier and Type | Method and Description |
---|---|
static CharArraySet |
ItalianAnalyzer.getDefaultStopSet()
Returns an unmodifiable instance of the default stop words set.
|
Constructor and Description |
---|
ItalianAnalyzer(CharArraySet stopwords)
Builds an analyzer with the given stop words.
|
ItalianAnalyzer(CharArraySet stopwords,
CharArraySet stemExclusionSet)
Builds an analyzer with the given stop words.
|
Modifier and Type | Field and Description |
---|---|
(package private) static CharArraySet |
LithuanianAnalyzer.DefaultSetHolder.DEFAULT_STOP_SET |
private CharArraySet |
LithuanianAnalyzer.stemExclusionSet |
Modifier and Type | Method and Description |
---|---|
static CharArraySet |
LithuanianAnalyzer.getDefaultStopSet()
Returns an unmodifiable instance of the default stop words set.
|
Constructor and Description |
---|
LithuanianAnalyzer(CharArraySet stopwords)
Builds an analyzer with the given stop words.
|
LithuanianAnalyzer(CharArraySet stopwords,
CharArraySet stemExclusionSet)
Builds an analyzer with the given stop words.
|
Modifier and Type | Field and Description |
---|---|
(package private) static CharArraySet |
LatvianAnalyzer.DefaultSetHolder.DEFAULT_STOP_SET |
private CharArraySet |
LatvianAnalyzer.stemExclusionSet |
Modifier and Type | Method and Description |
---|---|
static CharArraySet |
LatvianAnalyzer.getDefaultStopSet()
Returns an unmodifiable instance of the default stop words set.
|
Constructor and Description |
---|
LatvianAnalyzer(CharArraySet stopwords)
Builds an analyzer with the given stop words.
|
LatvianAnalyzer(CharArraySet stopwords,
CharArraySet stemExclusionSet)
Builds an analyzer with the given stop words.
|
Modifier and Type | Field and Description |
---|---|
(package private) CharArraySet |
CapitalizationFilterFactory.keep |
private CharArraySet |
CapitalizationFilter.keep |
private CharArraySet |
SetKeywordMarkerFilter.keywordSet |
private CharArraySet |
RemoveDuplicatesTokenFilter.previous |
private CharArraySet |
ProtectedTermFilterFactory.protectedTerms |
private CharArraySet |
ProtectedTermFilter.protectedTerms |
private CharArraySet |
KeywordMarkerFilterFactory.protectedWords |
private CharArraySet |
WordDelimiterGraphFilterFactory.protectedWords |
private CharArraySet |
WordDelimiterFilterFactory.protectedWords
Deprecated.
|
(package private) CharArraySet |
WordDelimiterFilter.protWords
Deprecated.
If not null is the set of tokens to protect from being delimited
|
(package private) CharArraySet |
WordDelimiterGraphFilter.protWords
If not null is the set of tokens to protect from being delimited
|
private CharArraySet |
FingerprintFilter.uniqueTerms |
private CharArraySet |
KeepWordFilter.words |
private CharArraySet |
KeepWordFilterFactory.words |
Modifier and Type | Method and Description |
---|---|
CharArraySet |
ProtectedTermFilterFactory.getProtectedTerms() |
CharArraySet |
KeepWordFilterFactory.getWords() |
Constructor and Description |
---|
CapitalizationFilter(TokenStream in,
boolean onlyFirstWord,
CharArraySet keep,
boolean forceFirstLetter,
java.util.Collection<char[]> okPrefix,
int minWordLength,
int maxWordCount,
int maxTokenLength)
Creates a CapitalizationFilter with the specified parameters.
|
KeepWordFilter(TokenStream in,
CharArraySet words)
Create a new
KeepWordFilter . |
ProtectedTermFilter(CharArraySet protectedTerms,
TokenStream input,
java.util.function.Function<TokenStream,TokenStream> inputFactory)
Creates a new ProtectedTermFilter
|
SetKeywordMarkerFilter(TokenStream in,
CharArraySet keywordSet)
Create a new KeywordSetMarkerFilter, that marks the current token as a
keyword if the tokens term buffer is contained in the given set via the
KeywordAttribute . |
WordDelimiterFilter(TokenStream in,
byte[] charTypeTable,
int configurationFlags,
CharArraySet protWords)
Deprecated.
Creates a new WordDelimiterFilter
|
WordDelimiterFilter(TokenStream in,
int configurationFlags,
CharArraySet protWords)
Deprecated.
Creates a new WordDelimiterFilter using
WordDelimiterIterator.DEFAULT_WORD_DELIM_TABLE
as its charTypeTable |
WordDelimiterGraphFilter(TokenStream in,
boolean adjustInternalOffsets,
byte[] charTypeTable,
int configurationFlags,
CharArraySet protWords)
Creates a new WordDelimiterGraphFilter
|
WordDelimiterGraphFilter(TokenStream in,
int configurationFlags,
CharArraySet protWords)
Creates a new WordDelimiterGraphFilter using
WordDelimiterIterator.DEFAULT_WORD_DELIM_TABLE
as its charTypeTable |
Modifier and Type | Field and Description |
---|---|
(package private) static CharArraySet |
DutchAnalyzer.DefaultSetHolder.DEFAULT_STOP_SET |
private CharArraySet |
DutchAnalyzer.excltable
Contains words that should be indexed but not stemmed.
|
private CharArraySet |
DutchAnalyzer.stoptable
Contains the stopwords used with the StopFilter.
|
Modifier and Type | Method and Description |
---|---|
static CharArraySet |
DutchAnalyzer.getDefaultStopSet()
Returns an unmodifiable instance of the default stop-words set.
|
Constructor and Description |
---|
DutchAnalyzer(CharArraySet stopwords) |
DutchAnalyzer(CharArraySet stopwords,
CharArraySet stemExclusionTable) |
DutchAnalyzer(CharArraySet stopwords,
CharArraySet stemExclusionTable,
CharArrayMap<java.lang.String> stemOverrideDict) |
Modifier and Type | Field and Description |
---|---|
(package private) static CharArraySet |
NorwegianAnalyzer.DefaultSetHolder.DEFAULT_STOP_SET |
private CharArraySet |
NorwegianAnalyzer.stemExclusionSet |
Modifier and Type | Method and Description |
---|---|
static CharArraySet |
NorwegianAnalyzer.getDefaultStopSet()
Returns an unmodifiable instance of the default stop words set.
|
Constructor and Description |
---|
NorwegianAnalyzer(CharArraySet stopwords)
Builds an analyzer with the given stop words.
|
NorwegianAnalyzer(CharArraySet stopwords,
CharArraySet stemExclusionSet)
Builds an analyzer with the given stop words.
|
Modifier and Type | Field and Description |
---|---|
(package private) static CharArraySet |
PortugueseAnalyzer.DefaultSetHolder.DEFAULT_STOP_SET |
protected CharArraySet |
RSLPStemmerBase.RuleWithSetExceptions.exceptions |
private CharArraySet |
PortugueseAnalyzer.stemExclusionSet |
Modifier and Type | Method and Description |
---|---|
static CharArraySet |
PortugueseAnalyzer.getDefaultStopSet()
Returns an unmodifiable instance of the default stop words set.
|
Constructor and Description |
---|
PortugueseAnalyzer(CharArraySet stopwords)
Builds an analyzer with the given stop words.
|
PortugueseAnalyzer(CharArraySet stopwords,
CharArraySet stemExclusionSet)
Builds an analyzer with the given stop words.
|
Modifier and Type | Field and Description |
---|---|
(package private) static CharArraySet |
RomanianAnalyzer.DefaultSetHolder.DEFAULT_STOP_SET |
private CharArraySet |
RomanianAnalyzer.stemExclusionSet |
Modifier and Type | Method and Description |
---|---|
static CharArraySet |
RomanianAnalyzer.getDefaultStopSet()
Returns an unmodifiable instance of the default stop words set.
|
Constructor and Description |
---|
RomanianAnalyzer(CharArraySet stopwords)
Builds an analyzer with the given stop words.
|
RomanianAnalyzer(CharArraySet stopwords,
CharArraySet stemExclusionSet)
Builds an analyzer with the given stop words.
|
Modifier and Type | Field and Description |
---|---|
(package private) static CharArraySet |
RussianAnalyzer.DefaultSetHolder.DEFAULT_STOP_SET |
private CharArraySet |
RussianAnalyzer.stemExclusionSet |
Modifier and Type | Method and Description |
---|---|
static CharArraySet |
RussianAnalyzer.getDefaultStopSet()
Returns an unmodifiable instance of the default stop-words set.
|
Constructor and Description |
---|
RussianAnalyzer(CharArraySet stopwords)
Builds an analyzer with the given stop words
|
RussianAnalyzer(CharArraySet stopwords,
CharArraySet stemExclusionSet)
Builds an analyzer with the given stop words
|
Modifier and Type | Field and Description |
---|---|
private CharArraySet |
SnowballPorterFilterFactory.protectedWords |
Modifier and Type | Field and Description |
---|---|
static CharArraySet |
ClassicAnalyzer.STOP_WORDS_SET
An unmodifiable set containing some common English words that are usually not
useful for searching.
|
static CharArraySet |
UAX29URLEmailAnalyzer.STOP_WORDS_SET
An unmodifiable set containing some common English words that are usually not
useful for searching.
|
Constructor and Description |
---|
ClassicAnalyzer(CharArraySet stopWords)
Builds an analyzer with the given stop words.
|
StandardAnalyzer(CharArraySet stopWords)
Builds an analyzer with the given stop words.
|
UAX29URLEmailAnalyzer(CharArraySet stopWords)
Builds an analyzer with the given stop words.
|
Modifier and Type | Field and Description |
---|---|
(package private) static CharArraySet |
SwedishAnalyzer.DefaultSetHolder.DEFAULT_STOP_SET |
private CharArraySet |
SwedishAnalyzer.stemExclusionSet |
Modifier and Type | Method and Description |
---|---|
static CharArraySet |
SwedishAnalyzer.getDefaultStopSet()
Returns an unmodifiable instance of the default stop words set.
|
Constructor and Description |
---|
SwedishAnalyzer(CharArraySet stopwords)
Builds an analyzer with the given stop words.
|
SwedishAnalyzer(CharArraySet stopwords,
CharArraySet stemExclusionSet)
Builds an analyzer with the given stop words.
|
Modifier and Type | Field and Description |
---|---|
(package private) static CharArraySet |
ThaiAnalyzer.DefaultSetHolder.DEFAULT_STOP_SET |
Modifier and Type | Method and Description |
---|---|
static CharArraySet |
ThaiAnalyzer.getDefaultStopSet()
Returns an unmodifiable instance of the default stop words set.
|
Constructor and Description |
---|
ThaiAnalyzer(CharArraySet stopwords)
Builds an analyzer with the given stop words.
|
Modifier and Type | Field and Description |
---|---|
(package private) static CharArraySet |
TurkishAnalyzer.DefaultSetHolder.DEFAULT_STOP_SET |
private CharArraySet |
TurkishAnalyzer.stemExclusionSet |
Modifier and Type | Method and Description |
---|---|
static CharArraySet |
TurkishAnalyzer.getDefaultStopSet()
Returns an unmodifiable instance of the default stop words set.
|
Constructor and Description |
---|
TurkishAnalyzer(CharArraySet stopwords)
Builds an analyzer with the given stop words.
|
TurkishAnalyzer(CharArraySet stopwords,
CharArraySet stemExclusionSet)
Builds an analyzer with the given stop words.
|
Modifier and Type | Field and Description |
---|---|
private CharArraySet |
ElisionFilterFactory.articles |
private CharArraySet |
ElisionFilter.articles |
Modifier and Type | Method and Description |
---|---|
protected CharArraySet |
AbstractAnalysisFactory.getSnowballWordSet(ResourceLoader loader,
java.lang.String wordFiles,
boolean ignoreCase)
same as
AbstractAnalysisFactory.getWordSet(ResourceLoader, String, boolean) ,
except the input is in snowball format. |
protected CharArraySet |
AbstractAnalysisFactory.getWordSet(ResourceLoader loader,
java.lang.String wordFiles,
boolean ignoreCase)
Returns as
CharArraySet from wordFiles, which
can be a comma-separated list of filenames |
Constructor and Description |
---|
ElisionFilter(TokenStream input,
CharArraySet articles)
Constructs an elision filter with a Set of stop words
|