Package | Description |
---|---|
org.apache.lucene.analysis |
Text analysis.
|
org.apache.lucene.analysis.ar |
Analyzer for Arabic.
|
org.apache.lucene.analysis.bg |
Analyzer for Bulgarian.
|
org.apache.lucene.analysis.bn |
Analyzer for Bengali Language.
|
org.apache.lucene.analysis.br |
Analyzer for Brazilian Portuguese.
|
org.apache.lucene.analysis.cjk |
Analyzer for Chinese, Japanese, and Korean, which indexes bigrams.
|
org.apache.lucene.analysis.ckb |
Analyzer for Sorani Kurdish.
|
org.apache.lucene.analysis.cn.smart |
Analyzer for Simplified Chinese, which indexes words.
|
org.apache.lucene.analysis.commongrams |
Construct n-grams for frequently occurring terms and phrases.
|
org.apache.lucene.analysis.compound |
A filter that decomposes compound words you find in many Germanic
languages into the word parts.
|
org.apache.lucene.analysis.core |
Basic, general-purpose analysis components.
|
org.apache.lucene.analysis.cz |
Analyzer for Czech.
|
org.apache.lucene.analysis.de |
Analyzer for German.
|
org.apache.lucene.analysis.el |
Analyzer for Greek.
|
org.apache.lucene.analysis.en |
Analyzer for English.
|
org.apache.lucene.analysis.es |
Analyzer for Spanish.
|
org.apache.lucene.analysis.fa |
Analyzer for Persian.
|
org.apache.lucene.analysis.fi |
Analyzer for Finnish.
|
org.apache.lucene.analysis.fr |
Analyzer for French.
|
org.apache.lucene.analysis.ga |
Analyzer for Irish.
|
org.apache.lucene.analysis.gl |
Analyzer for Galician.
|
org.apache.lucene.analysis.hi |
Analyzer for Hindi.
|
org.apache.lucene.analysis.hu |
Analyzer for Hungarian.
|
org.apache.lucene.analysis.hunspell |
Stemming TokenFilter using a Java implementation of the
Hunspell stemming algorithm.
|
org.apache.lucene.analysis.id |
Analyzer for Indonesian.
|
org.apache.lucene.analysis.in |
Analyzer for Indian languages.
|
org.apache.lucene.analysis.it |
Analyzer for Italian.
|
org.apache.lucene.analysis.lv |
Analyzer for Latvian.
|
org.apache.lucene.analysis.minhash |
MinHash filtering (for LSH).
|
org.apache.lucene.analysis.miscellaneous |
Miscellaneous Tokenstreams.
|
org.apache.lucene.analysis.ngram |
Character n-gram tokenizers and filters.
|
org.apache.lucene.analysis.no |
Analyzer for Norwegian.
|
org.apache.lucene.analysis.path |
Analysis components for path-like strings such as filenames.
|
org.apache.lucene.analysis.pattern |
Set of components for pattern-based (regex) analysis.
|
org.apache.lucene.analysis.payloads |
Provides various convenience classes for creating payloads on Tokens.
|
org.apache.lucene.analysis.pt |
Analyzer for Portuguese.
|
org.apache.lucene.analysis.reverse |
Filter to reverse token text.
|
org.apache.lucene.analysis.ru |
Analyzer for Russian.
|
org.apache.lucene.analysis.shingle |
Word n-gram filters.
|
org.apache.lucene.analysis.snowball |
TokenFilter and Analyzer implementations that use Snowball
stemmers. |
org.apache.lucene.analysis.sr |
Analyzer for Serbian.
|
org.apache.lucene.analysis.standard |
Fast, general-purpose grammar-based tokenizer
StandardTokenizer
implements the Word Break rules from the Unicode Text Segmentation algorithm, as specified in
Unicode Standard Annex #29. |
org.apache.lucene.analysis.sv |
Analyzer for Swedish.
|
org.apache.lucene.analysis.synonym |
Analysis components for Synonyms.
|
org.apache.lucene.analysis.th |
Analyzer for Thai.
|
org.apache.lucene.analysis.tokenattributes |
General-purpose attributes for text analysis.
|
org.apache.lucene.analysis.tr |
Analyzer for Turkish.
|
org.apache.lucene.analysis.util |
Utility functions for text analysis.
|
org.apache.lucene.analysis.wikipedia |
Tokenizer that is aware of Wikipedia syntax.
|
org.apache.lucene.collation.tokenattributes |
Custom
AttributeImpl for indexing collation keys as index terms. |
org.apache.lucene.document |
The logical representation of a
Document for indexing and searching. |
org.apache.lucene.search.highlight |
Highlighting search terms.
|
org.apache.lucene.search.uhighlight |
The UnifiedHighlighter -- a flexible highlighter that can get offsets from postings, term vectors, or analysis.
|
Modifier and Type | Field and Description |
---|---|
private CharTermAttribute |
LowerCaseFilter.termAtt |
private CharTermAttribute |
StopFilter.termAtt |
private CharTermAttribute |
Analyzer.StringTokenStream.termAttribute |
Modifier and Type | Field and Description |
---|---|
private CharTermAttribute |
ArabicNormalizationFilter.termAtt |
private CharTermAttribute |
ArabicStemFilter.termAtt |
Modifier and Type | Field and Description |
---|---|
private CharTermAttribute |
BulgarianStemFilter.termAtt |
Modifier and Type | Field and Description |
---|---|
private CharTermAttribute |
BengaliNormalizationFilter.termAtt |
private CharTermAttribute |
BengaliStemFilter.termAttribute |
Modifier and Type | Field and Description |
---|---|
private CharTermAttribute |
BrazilianStemFilter.termAtt |
Modifier and Type | Field and Description |
---|---|
private CharTermAttribute |
CJKWidthFilter.termAtt |
private CharTermAttribute |
CJKBigramFilter.termAtt |
Modifier and Type | Field and Description |
---|---|
private CharTermAttribute |
SoraniStemFilter.termAtt |
private CharTermAttribute |
SoraniNormalizationFilter.termAtt |
Modifier and Type | Field and Description |
---|---|
private CharTermAttribute |
HMMChineseTokenizer.termAtt |
Modifier and Type | Field and Description |
---|---|
private CharTermAttribute |
CommonGramsFilter.termAttribute |
Modifier and Type | Field and Description |
---|---|
protected CharTermAttribute |
CompoundWordTokenFilterBase.termAtt |
Modifier and Type | Field and Description |
---|---|
private CharTermAttribute |
KeywordTokenizer.termAtt |
private CharTermAttribute |
DecimalDigitFilter.termAtt |
private CharTermAttribute |
UpperCaseFilter.termAtt |
Modifier and Type | Field and Description |
---|---|
private CharTermAttribute |
CzechStemFilter.termAtt |
Modifier and Type | Field and Description |
---|---|
private CharTermAttribute |
GermanStemFilter.termAtt |
private CharTermAttribute |
GermanLightStemFilter.termAtt |
private CharTermAttribute |
GermanMinimalStemFilter.termAtt |
private CharTermAttribute |
GermanNormalizationFilter.termAtt |
Modifier and Type | Field and Description |
---|---|
private CharTermAttribute |
GreekLowerCaseFilter.termAtt |
private CharTermAttribute |
GreekStemFilter.termAtt |
Modifier and Type | Field and Description |
---|---|
private CharTermAttribute |
PorterStemFilter.termAtt |
private CharTermAttribute |
EnglishPossessiveFilter.termAtt |
private CharTermAttribute |
EnglishMinimalStemFilter.termAtt |
private CharTermAttribute |
KStemFilter.termAttribute |
Modifier and Type | Field and Description |
---|---|
private CharTermAttribute |
SpanishLightStemFilter.termAtt |
Modifier and Type | Field and Description |
---|---|
private CharTermAttribute |
PersianNormalizationFilter.termAtt |
Modifier and Type | Field and Description |
---|---|
private CharTermAttribute |
FinnishLightStemFilter.termAtt |
Modifier and Type | Field and Description |
---|---|
private CharTermAttribute |
FrenchLightStemFilter.termAtt |
private CharTermAttribute |
FrenchMinimalStemFilter.termAtt |
Modifier and Type | Field and Description |
---|---|
private CharTermAttribute |
IrishLowerCaseFilter.termAtt |
Modifier and Type | Field and Description |
---|---|
private CharTermAttribute |
GalicianMinimalStemFilter.termAtt |
private CharTermAttribute |
GalicianStemFilter.termAtt |
Modifier and Type | Field and Description |
---|---|
private CharTermAttribute |
HindiStemFilter.termAtt |
private CharTermAttribute |
HindiNormalizationFilter.termAtt |
Modifier and Type | Field and Description |
---|---|
private CharTermAttribute |
HungarianLightStemFilter.termAtt |
Modifier and Type | Field and Description |
---|---|
private CharTermAttribute |
HunspellStemFilter.termAtt |
Modifier and Type | Field and Description |
---|---|
private CharTermAttribute |
IndonesianStemFilter.termAtt |
Modifier and Type | Field and Description |
---|---|
private CharTermAttribute |
IndicNormalizationFilter.termAtt |
Modifier and Type | Field and Description |
---|---|
private CharTermAttribute |
ItalianLightStemFilter.termAtt |
Modifier and Type | Field and Description |
---|---|
private CharTermAttribute |
LatvianStemFilter.termAtt |
Modifier and Type | Field and Description |
---|---|
private CharTermAttribute |
MinHashFilter.termAttribute |
Modifier and Type | Field and Description |
---|---|
private CharTermAttribute |
ScandinavianNormalizationFilter.charTermAttribute |
private CharTermAttribute |
ScandinavianFoldingFilter.charTermAttribute |
private CharTermAttribute |
ConcatenateGraphFilter.charTermAttribute |
private CharTermAttribute |
DelimitedTermFrequencyTokenFilter.termAtt |
private CharTermAttribute |
LengthFilter.termAtt |
private CharTermAttribute |
ASCIIFoldingFilter.termAtt |
private CharTermAttribute |
PatternKeywordMarkerFilter.termAtt |
private CharTermAttribute |
SetKeywordMarkerFilter.termAtt |
private CharTermAttribute |
KeepWordFilter.termAtt |
private CharTermAttribute |
TrimFilter.termAtt |
private CharTermAttribute |
ProtectedTermFilter.termAtt |
private CharTermAttribute |
DateRecognizerFilter.termAtt |
private CharTermAttribute |
CodepointCountFilter.termAtt |
private CharTermAttribute |
StemmerOverrideFilter.termAtt |
private CharTermAttribute |
CapitalizationFilter.termAtt |
private CharTermAttribute |
TypeAsSynonymFilter.termAtt |
private CharTermAttribute |
WordDelimiterFilter.termAttribute
Deprecated.
|
private CharTermAttribute |
RemoveDuplicatesTokenFilter.termAttribute |
private CharTermAttribute |
TruncateTokenFilter.termAttribute |
private CharTermAttribute |
FingerprintFilter.termAttribute |
private CharTermAttribute |
HyphenatedWordsFilter.termAttribute |
private CharTermAttribute |
WordDelimiterGraphFilter.termAttribute |
Modifier and Type | Field and Description |
---|---|
private CharTermAttribute |
EdgeNGramTokenFilter.termAtt |
private CharTermAttribute |
NGramTokenFilter.termAtt |
private CharTermAttribute |
NGramTokenizer.termAtt |
Modifier and Type | Field and Description |
---|---|
private CharTermAttribute |
NorwegianMinimalStemFilter.termAtt |
private CharTermAttribute |
NorwegianLightStemFilter.termAtt |
Modifier and Type | Field and Description |
---|---|
private CharTermAttribute |
ReversePathHierarchyTokenizer.termAtt |
private CharTermAttribute |
PathHierarchyTokenizer.termAtt |
Modifier and Type | Field and Description |
---|---|
private CharTermAttribute |
PatternCaptureGroupTokenFilter.charTermAttr |
private CharTermAttribute |
SimplePatternSplitTokenizer.termAtt |
private CharTermAttribute |
SimplePatternTokenizer.termAtt |
private CharTermAttribute |
PatternTokenizer.termAtt |
private CharTermAttribute |
PatternReplaceFilter.termAtt |
Modifier and Type | Field and Description |
---|---|
private CharTermAttribute |
DelimitedPayloadTokenFilter.termAtt |
Modifier and Type | Field and Description |
---|---|
private CharTermAttribute |
PortugueseStemFilter.termAtt |
private CharTermAttribute |
PortugueseLightStemFilter.termAtt |
private CharTermAttribute |
PortugueseMinimalStemFilter.termAtt |
Modifier and Type | Field and Description |
---|---|
private CharTermAttribute |
ReverseStringFilter.termAtt |
Modifier and Type | Field and Description |
---|---|
private CharTermAttribute |
RussianLightStemFilter.termAtt |
Modifier and Type | Field and Description |
---|---|
private CharTermAttribute |
FixedShingleFilter.buffer |
private CharTermAttribute |
ShingleFilter.termAtt |
(package private) CharTermAttribute |
ShingleFilter.InputWindowToken.termAtt |
private CharTermAttribute |
FixedShingleFilter.termAtt |
Modifier and Type | Field and Description |
---|---|
private CharTermAttribute |
SnowballFilter.termAtt |
Modifier and Type | Field and Description |
---|---|
private CharTermAttribute |
SerbianNormalizationRegularFilter.termAtt |
private CharTermAttribute |
SerbianNormalizationFilter.termAtt |
Modifier and Type | Field and Description |
---|---|
private CharTermAttribute |
ClassicTokenizer.termAtt |
private CharTermAttribute |
ClassicFilter.termAtt |
private CharTermAttribute |
UAX29URLEmailTokenizer.termAtt |
private CharTermAttribute |
StandardTokenizer.termAtt |
Modifier and Type | Method and Description |
---|---|
void |
UAX29URLEmailTokenizerImpl.getText(CharTermAttribute t)
Fills CharTermAttribute with the current token text.
|
void |
ClassicTokenizerImpl.getText(CharTermAttribute t)
Fills CharTermAttribute with the current token text.
|
void |
StandardTokenizerImpl.getText(CharTermAttribute t)
Fills CharTermAttribute with the current token text.
|
Modifier and Type | Field and Description |
---|---|
private CharTermAttribute |
SwedishLightStemFilter.termAtt |
Modifier and Type | Field and Description |
---|---|
private CharTermAttribute |
SynonymGraphFilter.termAtt |
private CharTermAttribute |
SynonymFilter.termAtt
Deprecated.
|
Modifier and Type | Field and Description |
---|---|
private CharTermAttribute |
ThaiTokenizer.termAtt |
Modifier and Type | Class and Description |
---|---|
class |
CharTermAttributeImpl
Default implementation of
CharTermAttribute . |
class |
PackedTokenAttributeImpl
Default implementation of the common attributes used by Lucene:
CharTermAttribute
TypeAttribute
PositionIncrementAttribute
PositionLengthAttribute
OffsetAttribute
TermFrequencyAttribute
|
Modifier and Type | Method and Description |
---|---|
CharTermAttribute |
CharTermAttributeImpl.append(char c) |
CharTermAttribute |
CharTermAttribute.append(char c) |
CharTermAttribute |
CharTermAttributeImpl.append(java.lang.CharSequence csq) |
CharTermAttribute |
CharTermAttribute.append(java.lang.CharSequence csq) |
CharTermAttribute |
CharTermAttributeImpl.append(java.lang.CharSequence csq,
int start,
int end) |
CharTermAttribute |
CharTermAttribute.append(java.lang.CharSequence csq,
int start,
int end) |
CharTermAttribute |
CharTermAttributeImpl.append(CharTermAttribute ta) |
CharTermAttribute |
CharTermAttribute.append(CharTermAttribute termAtt)
Appends the contents of the other
CharTermAttribute to this character sequence. |
CharTermAttribute |
CharTermAttributeImpl.append(java.lang.String s) |
CharTermAttribute |
CharTermAttribute.append(java.lang.String s)
Appends the specified
String to this character sequence. |
CharTermAttribute |
CharTermAttributeImpl.append(java.lang.StringBuilder s) |
CharTermAttribute |
CharTermAttribute.append(java.lang.StringBuilder sb)
Appends the specified
StringBuilder to this character sequence. |
private CharTermAttribute |
CharTermAttributeImpl.appendNull() |
CharTermAttribute |
CharTermAttributeImpl.setEmpty() |
CharTermAttribute |
CharTermAttribute.setEmpty()
Sets the length of the termBuffer to zero.
|
CharTermAttribute |
CharTermAttributeImpl.setLength(int length) |
CharTermAttribute |
CharTermAttribute.setLength(int length)
Set number of valid characters (length of the term) in
the termBuffer array.
|
Modifier and Type | Method and Description |
---|---|
CharTermAttribute |
CharTermAttributeImpl.append(CharTermAttribute ta) |
CharTermAttribute |
CharTermAttribute.append(CharTermAttribute termAtt)
Appends the contents of the other
CharTermAttribute to this character sequence. |
Modifier and Type | Field and Description |
---|---|
private CharTermAttribute |
TurkishLowerCaseFilter.termAtt |
private CharTermAttribute |
ApostropheFilter.termAtt |
Modifier and Type | Field and Description |
---|---|
private CharTermAttribute |
CharTokenizer.termAtt |
private CharTermAttribute |
ElisionFilter.termAtt |
Modifier and Type | Field and Description |
---|---|
private CharTermAttribute |
WikipediaTokenizer.termAtt |
Modifier and Type | Method and Description |
---|---|
(package private) void |
WikipediaTokenizerImpl.getText(CharTermAttribute t)
Fills Lucene token with the current token text.
|
Modifier and Type | Class and Description |
---|---|
class |
CollatedTermAttributeImpl
Extension of
CharTermAttributeImpl that encodes the term
text as a binary Unicode collation key instead of as UTF-8 bytes. |
Modifier and Type | Field and Description |
---|---|
private CharTermAttribute |
FeatureField.FeatureTokenStream.termAttribute |
private CharTermAttribute |
Field.StringTokenStream.termAttribute |
Modifier and Type | Field and Description |
---|---|
private CharTermAttribute |
QueryScorer.termAtt |
private CharTermAttribute |
QueryTermScorer.termAtt |
private CharTermAttribute |
SimpleSpanFragmenter.termAtt |
private CharTermAttribute |
TokenGroup.termAtt |
private CharTermAttribute |
TokenStreamFromTermVector.termAttribute |
Modifier and Type | Field and Description |
---|---|
(package private) CharTermAttribute |
TokenStreamOffsetStrategy.TokenStreamOffsetsEnum.charTermAtt |