Package | Description |
---|---|
org.apache.lucene.analysis |
Text analysis.
|
org.apache.lucene.analysis.ar |
Analyzer for Arabic.
|
org.apache.lucene.analysis.bg |
Analyzer for Bulgarian.
|
org.apache.lucene.analysis.bn |
Analyzer for Bengali Language.
|
org.apache.lucene.analysis.br |
Analyzer for Brazilian Portuguese.
|
org.apache.lucene.analysis.ca |
Analyzer for Catalan.
|
org.apache.lucene.analysis.cjk |
Analyzer for Chinese, Japanese, and Korean, which indexes bigrams.
|
org.apache.lucene.analysis.ckb |
Analyzer for Sorani Kurdish.
|
org.apache.lucene.analysis.cn.smart |
Analyzer for Simplified Chinese, which indexes words.
|
org.apache.lucene.analysis.core |
Basic, general-purpose analysis components.
|
org.apache.lucene.analysis.custom |
A general-purpose Analyzer that can be created with a builder-style API.
|
org.apache.lucene.analysis.cz |
Analyzer for Czech.
|
org.apache.lucene.analysis.da |
Analyzer for Danish.
|
org.apache.lucene.analysis.de |
Analyzer for German.
|
org.apache.lucene.analysis.el |
Analyzer for Greek.
|
org.apache.lucene.analysis.en |
Analyzer for English.
|
org.apache.lucene.analysis.es |
Analyzer for Spanish.
|
org.apache.lucene.analysis.eu |
Analyzer for Basque.
|
org.apache.lucene.analysis.fa |
Analyzer for Persian.
|
org.apache.lucene.analysis.fi |
Analyzer for Finnish.
|
org.apache.lucene.analysis.fr |
Analyzer for French.
|
org.apache.lucene.analysis.ga |
Analyzer for Irish.
|
org.apache.lucene.analysis.gl |
Analyzer for Galician.
|
org.apache.lucene.analysis.hi |
Analyzer for Hindi.
|
org.apache.lucene.analysis.hu |
Analyzer for Hungarian.
|
org.apache.lucene.analysis.hy |
Analyzer for Armenian.
|
org.apache.lucene.analysis.id |
Analyzer for Indonesian.
|
org.apache.lucene.analysis.it |
Analyzer for Italian.
|
org.apache.lucene.analysis.lt |
Analyzer for Lithuanian.
|
org.apache.lucene.analysis.lv |
Analyzer for Latvian.
|
org.apache.lucene.analysis.miscellaneous |
Miscellaneous Tokenstreams.
|
org.apache.lucene.analysis.nl |
Analyzer for Dutch.
|
org.apache.lucene.analysis.no |
Analyzer for Norwegian.
|
org.apache.lucene.analysis.pt |
Analyzer for Portuguese.
|
org.apache.lucene.analysis.query |
Automatically filter high-frequency stopwords.
|
org.apache.lucene.analysis.ro |
Analyzer for Romanian.
|
org.apache.lucene.analysis.ru |
Analyzer for Russian.
|
org.apache.lucene.analysis.shingle |
Word n-gram filters.
|
org.apache.lucene.analysis.standard |
Fast, general-purpose grammar-based tokenizer
StandardTokenizer
implements the Word Break rules from the Unicode Text Segmentation algorithm, as specified in
Unicode Standard Annex #29. |
org.apache.lucene.analysis.sv |
Analyzer for Swedish.
|
org.apache.lucene.analysis.th |
Analyzer for Thai.
|
org.apache.lucene.analysis.tr |
Analyzer for Turkish.
|
org.apache.lucene.collation |
Unicode collation support.
|
Modifier and Type | Method and Description |
---|---|
protected abstract Analyzer.TokenStreamComponents |
Analyzer.createComponents(java.lang.String fieldName)
Creates a new
Analyzer.TokenStreamComponents instance for this analyzer. |
protected Analyzer.TokenStreamComponents |
AnalyzerWrapper.createComponents(java.lang.String fieldName) |
Analyzer.TokenStreamComponents |
DelegatingAnalyzerWrapper.DelegatingReuseStrategy.getReusableComponents(Analyzer analyzer,
java.lang.String fieldName) |
abstract Analyzer.TokenStreamComponents |
Analyzer.ReuseStrategy.getReusableComponents(Analyzer analyzer,
java.lang.String fieldName)
Gets the reusable TokenStreamComponents for the field with the given name.
|
protected Analyzer.TokenStreamComponents |
DelegatingAnalyzerWrapper.wrapComponents(java.lang.String fieldName,
Analyzer.TokenStreamComponents components) |
protected Analyzer.TokenStreamComponents |
AnalyzerWrapper.wrapComponents(java.lang.String fieldName,
Analyzer.TokenStreamComponents components)
Wraps / alters the given TokenStreamComponents, taken from the wrapped
Analyzer, to form new components.
|
Modifier and Type | Method and Description |
---|---|
void |
DelegatingAnalyzerWrapper.DelegatingReuseStrategy.setReusableComponents(Analyzer analyzer,
java.lang.String fieldName,
Analyzer.TokenStreamComponents components) |
abstract void |
Analyzer.ReuseStrategy.setReusableComponents(Analyzer analyzer,
java.lang.String fieldName,
Analyzer.TokenStreamComponents components)
Stores the given TokenStreamComponents as the reusable components for the
field with the give name.
|
protected Analyzer.TokenStreamComponents |
DelegatingAnalyzerWrapper.wrapComponents(java.lang.String fieldName,
Analyzer.TokenStreamComponents components) |
protected Analyzer.TokenStreamComponents |
AnalyzerWrapper.wrapComponents(java.lang.String fieldName,
Analyzer.TokenStreamComponents components)
Wraps / alters the given TokenStreamComponents, taken from the wrapped
Analyzer, to form new components.
|
Modifier and Type | Method and Description |
---|---|
protected Analyzer.TokenStreamComponents |
ArabicAnalyzer.createComponents(java.lang.String fieldName)
Creates
Analyzer.TokenStreamComponents
used to tokenize all the text in the provided Reader . |
Modifier and Type | Method and Description |
---|---|
Analyzer.TokenStreamComponents |
BulgarianAnalyzer.createComponents(java.lang.String fieldName)
Creates a
Analyzer.TokenStreamComponents
which tokenizes all the text in the provided Reader . |
Modifier and Type | Method and Description |
---|---|
protected Analyzer.TokenStreamComponents |
BengaliAnalyzer.createComponents(java.lang.String fieldName)
Creates
Analyzer.TokenStreamComponents
used to tokenize all the text in the provided Reader . |
Modifier and Type | Method and Description |
---|---|
protected Analyzer.TokenStreamComponents |
BrazilianAnalyzer.createComponents(java.lang.String fieldName)
Creates
Analyzer.TokenStreamComponents
used to tokenize all the text in the provided Reader . |
Modifier and Type | Method and Description |
---|---|
protected Analyzer.TokenStreamComponents |
CatalanAnalyzer.createComponents(java.lang.String fieldName)
Creates a
Analyzer.TokenStreamComponents
which tokenizes all the text in the provided Reader . |
Modifier and Type | Method and Description |
---|---|
protected Analyzer.TokenStreamComponents |
CJKAnalyzer.createComponents(java.lang.String fieldName) |
Modifier and Type | Method and Description |
---|---|
protected Analyzer.TokenStreamComponents |
SoraniAnalyzer.createComponents(java.lang.String fieldName)
Creates a
Analyzer.TokenStreamComponents
which tokenizes all the text in the provided Reader . |
Modifier and Type | Method and Description |
---|---|
Analyzer.TokenStreamComponents |
SmartChineseAnalyzer.createComponents(java.lang.String fieldName) |
Modifier and Type | Method and Description |
---|---|
protected Analyzer.TokenStreamComponents |
UnicodeWhitespaceAnalyzer.createComponents(java.lang.String fieldName) |
protected Analyzer.TokenStreamComponents |
StopAnalyzer.createComponents(java.lang.String fieldName)
Creates
Analyzer.TokenStreamComponents
used to tokenize all the text in the provided Reader . |
protected Analyzer.TokenStreamComponents |
SimpleAnalyzer.createComponents(java.lang.String fieldName) |
protected Analyzer.TokenStreamComponents |
WhitespaceAnalyzer.createComponents(java.lang.String fieldName) |
protected Analyzer.TokenStreamComponents |
KeywordAnalyzer.createComponents(java.lang.String fieldName) |
Modifier and Type | Method and Description |
---|---|
protected Analyzer.TokenStreamComponents |
CustomAnalyzer.createComponents(java.lang.String fieldName) |
Modifier and Type | Method and Description |
---|---|
protected Analyzer.TokenStreamComponents |
CzechAnalyzer.createComponents(java.lang.String fieldName)
Creates
Analyzer.TokenStreamComponents
used to tokenize all the text in the provided Reader . |
Modifier and Type | Method and Description |
---|---|
protected Analyzer.TokenStreamComponents |
DanishAnalyzer.createComponents(java.lang.String fieldName)
Creates a
Analyzer.TokenStreamComponents
which tokenizes all the text in the provided Reader . |
Modifier and Type | Method and Description |
---|---|
protected Analyzer.TokenStreamComponents |
GermanAnalyzer.createComponents(java.lang.String fieldName)
Creates
Analyzer.TokenStreamComponents
used to tokenize all the text in the provided Reader . |
Modifier and Type | Method and Description |
---|---|
protected Analyzer.TokenStreamComponents |
GreekAnalyzer.createComponents(java.lang.String fieldName)
Creates
Analyzer.TokenStreamComponents
used to tokenize all the text in the provided Reader . |
Modifier and Type | Method and Description |
---|---|
protected Analyzer.TokenStreamComponents |
EnglishAnalyzer.createComponents(java.lang.String fieldName)
Creates a
Analyzer.TokenStreamComponents
which tokenizes all the text in the provided Reader . |
Modifier and Type | Method and Description |
---|---|
protected Analyzer.TokenStreamComponents |
SpanishAnalyzer.createComponents(java.lang.String fieldName)
Creates a
Analyzer.TokenStreamComponents
which tokenizes all the text in the provided Reader . |
Modifier and Type | Method and Description |
---|---|
protected Analyzer.TokenStreamComponents |
BasqueAnalyzer.createComponents(java.lang.String fieldName)
Creates a
Analyzer.TokenStreamComponents
which tokenizes all the text in the provided Reader . |
Modifier and Type | Method and Description |
---|---|
protected Analyzer.TokenStreamComponents |
PersianAnalyzer.createComponents(java.lang.String fieldName)
Creates
Analyzer.TokenStreamComponents
used to tokenize all the text in the provided Reader . |
Modifier and Type | Method and Description |
---|---|
protected Analyzer.TokenStreamComponents |
FinnishAnalyzer.createComponents(java.lang.String fieldName)
Creates a
Analyzer.TokenStreamComponents
which tokenizes all the text in the provided Reader . |
Modifier and Type | Method and Description |
---|---|
protected Analyzer.TokenStreamComponents |
FrenchAnalyzer.createComponents(java.lang.String fieldName)
Creates
Analyzer.TokenStreamComponents
used to tokenize all the text in the provided Reader . |
Modifier and Type | Method and Description |
---|---|
protected Analyzer.TokenStreamComponents |
IrishAnalyzer.createComponents(java.lang.String fieldName)
Creates a
Analyzer.TokenStreamComponents
which tokenizes all the text in the provided Reader . |
Modifier and Type | Method and Description |
---|---|
protected Analyzer.TokenStreamComponents |
GalicianAnalyzer.createComponents(java.lang.String fieldName)
Creates a
Analyzer.TokenStreamComponents
which tokenizes all the text in the provided Reader . |
Modifier and Type | Method and Description |
---|---|
protected Analyzer.TokenStreamComponents |
HindiAnalyzer.createComponents(java.lang.String fieldName)
Creates
Analyzer.TokenStreamComponents
used to tokenize all the text in the provided Reader . |
Modifier and Type | Method and Description |
---|---|
protected Analyzer.TokenStreamComponents |
HungarianAnalyzer.createComponents(java.lang.String fieldName)
Creates a
Analyzer.TokenStreamComponents
which tokenizes all the text in the provided Reader . |
Modifier and Type | Method and Description |
---|---|
protected Analyzer.TokenStreamComponents |
ArmenianAnalyzer.createComponents(java.lang.String fieldName)
Creates a
Analyzer.TokenStreamComponents
which tokenizes all the text in the provided Reader . |
Modifier and Type | Method and Description |
---|---|
protected Analyzer.TokenStreamComponents |
IndonesianAnalyzer.createComponents(java.lang.String fieldName)
Creates
Analyzer.TokenStreamComponents
used to tokenize all the text in the provided Reader . |
Modifier and Type | Method and Description |
---|---|
protected Analyzer.TokenStreamComponents |
ItalianAnalyzer.createComponents(java.lang.String fieldName)
Creates a
Analyzer.TokenStreamComponents
which tokenizes all the text in the provided Reader . |
Modifier and Type | Method and Description |
---|---|
protected Analyzer.TokenStreamComponents |
LithuanianAnalyzer.createComponents(java.lang.String fieldName)
Creates a
Analyzer.TokenStreamComponents
which tokenizes all the text in the provided Reader . |
Modifier and Type | Method and Description |
---|---|
protected Analyzer.TokenStreamComponents |
LatvianAnalyzer.createComponents(java.lang.String fieldName)
Creates a
Analyzer.TokenStreamComponents
which tokenizes all the text in the provided Reader . |
Modifier and Type | Method and Description |
---|---|
protected Analyzer.TokenStreamComponents |
LimitTokenCountAnalyzer.wrapComponents(java.lang.String fieldName,
Analyzer.TokenStreamComponents components) |
Modifier and Type | Method and Description |
---|---|
protected Analyzer.TokenStreamComponents |
LimitTokenCountAnalyzer.wrapComponents(java.lang.String fieldName,
Analyzer.TokenStreamComponents components) |
Modifier and Type | Method and Description |
---|---|
protected Analyzer.TokenStreamComponents |
DutchAnalyzer.createComponents(java.lang.String fieldName)
Returns a (possibly reused)
TokenStream which tokenizes all the
text in the provided Reader . |
Modifier and Type | Method and Description |
---|---|
protected Analyzer.TokenStreamComponents |
NorwegianAnalyzer.createComponents(java.lang.String fieldName)
Creates a
Analyzer.TokenStreamComponents
which tokenizes all the text in the provided Reader . |
Modifier and Type | Method and Description |
---|---|
protected Analyzer.TokenStreamComponents |
PortugueseAnalyzer.createComponents(java.lang.String fieldName)
Creates a
Analyzer.TokenStreamComponents
which tokenizes all the text in the provided Reader . |
Modifier and Type | Method and Description |
---|---|
protected Analyzer.TokenStreamComponents |
QueryAutoStopWordAnalyzer.wrapComponents(java.lang.String fieldName,
Analyzer.TokenStreamComponents components) |
Modifier and Type | Method and Description |
---|---|
protected Analyzer.TokenStreamComponents |
QueryAutoStopWordAnalyzer.wrapComponents(java.lang.String fieldName,
Analyzer.TokenStreamComponents components) |
Modifier and Type | Method and Description |
---|---|
protected Analyzer.TokenStreamComponents |
RomanianAnalyzer.createComponents(java.lang.String fieldName)
Creates a
Analyzer.TokenStreamComponents
which tokenizes all the text in the provided Reader . |
Modifier and Type | Method and Description |
---|---|
protected Analyzer.TokenStreamComponents |
RussianAnalyzer.createComponents(java.lang.String fieldName)
Creates
Analyzer.TokenStreamComponents
used to tokenize all the text in the provided Reader . |
Modifier and Type | Method and Description |
---|---|
protected Analyzer.TokenStreamComponents |
ShingleAnalyzerWrapper.wrapComponents(java.lang.String fieldName,
Analyzer.TokenStreamComponents components) |
Modifier and Type | Method and Description |
---|---|
protected Analyzer.TokenStreamComponents |
ShingleAnalyzerWrapper.wrapComponents(java.lang.String fieldName,
Analyzer.TokenStreamComponents components) |
Modifier and Type | Method and Description |
---|---|
protected Analyzer.TokenStreamComponents |
ClassicAnalyzer.createComponents(java.lang.String fieldName) |
protected Analyzer.TokenStreamComponents |
UAX29URLEmailAnalyzer.createComponents(java.lang.String fieldName) |
protected Analyzer.TokenStreamComponents |
StandardAnalyzer.createComponents(java.lang.String fieldName) |
Modifier and Type | Method and Description |
---|---|
protected Analyzer.TokenStreamComponents |
SwedishAnalyzer.createComponents(java.lang.String fieldName)
Creates a
Analyzer.TokenStreamComponents
which tokenizes all the text in the provided Reader . |
Modifier and Type | Method and Description |
---|---|
protected Analyzer.TokenStreamComponents |
ThaiAnalyzer.createComponents(java.lang.String fieldName)
Creates
Analyzer.TokenStreamComponents
used to tokenize all the text in the provided Reader . |
Modifier and Type | Method and Description |
---|---|
protected Analyzer.TokenStreamComponents |
TurkishAnalyzer.createComponents(java.lang.String fieldName)
Creates a
Analyzer.TokenStreamComponents
which tokenizes all the text in the provided Reader . |
Modifier and Type | Method and Description |
---|---|
protected Analyzer.TokenStreamComponents |
CollationKeyAnalyzer.createComponents(java.lang.String fieldName) |