Package | Description |
---|---|
org.apache.lucene.analysis |
Text analysis.
|
org.apache.lucene.analysis.compound |
A filter that decomposes compound words you find in many Germanic
languages into the word parts.
|
org.apache.lucene.analysis.sinks |
TeeSinkTokenFilter and implementations
of TeeSinkTokenFilter.SinkFilter that
might be useful. |
org.apache.lucene.analysis.standard |
Fast, general-purpose grammar-based tokenizers.
|
org.apache.lucene.analysis.standard.std40 |
Backwards-compatible implementation to match
Version.LUCENE_4_0 |
org.apache.lucene.analysis.tokenattributes |
General-purpose attributes for text analysis.
|
org.apache.lucene.collation.tokenattributes |
Custom
AttributeImpl for indexing collation keys as index terms. |
Modifier and Type | Class and Description |
---|---|
class |
MockUTF16TermAttributeImpl
Extension of
CharTermAttributeImpl that encodes the term
text as UTF-16 bytes instead of as UTF-8 bytes. |
class |
Token
Deprecated.
This class is outdated and no longer used since Lucene 2.9. Nuke it finally!
|
Modifier and Type | Field and Description |
---|---|
protected CharTermAttribute |
CompoundWordTokenFilterBase.termAtt |
protected CharTermAttribute |
Lucene43CompoundWordTokenFilterBase.termAtt
Deprecated.
|
Modifier and Type | Field and Description |
---|---|
protected CharTermAttribute |
DateRecognizerSinkFilter.termAtt |
Modifier and Type | Method and Description |
---|---|
void |
UAX29URLEmailTokenizerImpl.getText(CharTermAttribute t)
Fills CharTermAttribute with the current token text.
|
void |
StandardTokenizerImpl.getText(CharTermAttribute t)
Fills CharTermAttribute with the current token text.
|
Modifier and Type | Method and Description |
---|---|
void |
StandardTokenizerImpl40.getText(CharTermAttribute t)
Deprecated.
Fills CharTermAttribute with the current token text.
|
void |
UAX29URLEmailTokenizerImpl40.getText(CharTermAttribute t)
Deprecated.
Fills CharTermAttribute with the current token text.
|
Modifier and Type | Class and Description |
---|---|
class |
CharTermAttributeImpl
Default implementation of
CharTermAttribute . |
class |
PackedTokenAttributeImpl
Default implementation of the common attributes used by Lucene:
CharTermAttribute
TypeAttribute
PositionIncrementAttribute
PositionLengthAttribute
OffsetAttribute
|
Modifier and Type | Method and Description |
---|---|
CharTermAttribute |
CharTermAttribute.append(char c) |
CharTermAttribute |
CharTermAttributeImpl.append(char c) |
CharTermAttribute |
CharTermAttribute.append(CharSequence csq) |
CharTermAttribute |
CharTermAttributeImpl.append(CharSequence csq) |
CharTermAttribute |
CharTermAttribute.append(CharSequence csq,
int start,
int end) |
CharTermAttribute |
CharTermAttributeImpl.append(CharSequence csq,
int start,
int end) |
CharTermAttribute |
CharTermAttribute.append(CharTermAttribute termAtt)
Appends the contents of the other
CharTermAttribute to this character sequence. |
CharTermAttribute |
CharTermAttributeImpl.append(CharTermAttribute ta) |
CharTermAttribute |
CharTermAttribute.append(String s)
Appends the specified
String to this character sequence. |
CharTermAttribute |
CharTermAttributeImpl.append(String s) |
CharTermAttribute |
CharTermAttribute.append(StringBuilder sb)
Appends the specified
StringBuilder to this character sequence. |
CharTermAttribute |
CharTermAttributeImpl.append(StringBuilder s) |
CharTermAttribute |
CharTermAttribute.setEmpty()
Sets the length of the termBuffer to zero.
|
CharTermAttribute |
CharTermAttributeImpl.setEmpty() |
CharTermAttribute |
CharTermAttribute.setLength(int length)
Set number of valid characters (length of the term) in
the termBuffer array.
|
CharTermAttribute |
CharTermAttributeImpl.setLength(int length) |
Modifier and Type | Method and Description |
---|---|
CharTermAttribute |
CharTermAttribute.append(CharTermAttribute termAtt)
Appends the contents of the other
CharTermAttribute to this character sequence. |
CharTermAttribute |
CharTermAttributeImpl.append(CharTermAttribute ta) |
Modifier and Type | Class and Description |
---|---|
class |
CollatedTermAttributeImpl
Extension of
CharTermAttributeImpl that encodes the term
text as a binary Unicode collation key instead of as UTF-8 bytes. |
class |
ICUCollatedTermAttributeImpl
Extension of
CharTermAttributeImpl that encodes the term
text as a binary Unicode collation key instead of as UTF-8 bytes. |
Copyright © 2000–2015 The Apache Software Foundation. All rights reserved.