Package | Description |
---|---|
org.apache.lucene.analysis |
Text analysis.
|
org.apache.lucene.analysis.cjk |
Analyzer for Chinese, Japanese, and Korean, which indexes bigrams.
|
org.apache.lucene.analysis.cn.smart |
Analyzer for Simplified Chinese, which indexes words.
|
org.apache.lucene.analysis.commongrams |
Construct n-grams for frequently occurring terms and phrases.
|
org.apache.lucene.analysis.compound |
A filter that decomposes compound words you find in many Germanic
languages into the word parts.
|
org.apache.lucene.analysis.core |
Basic, general-purpose analysis components.
|
org.apache.lucene.analysis.minhash |
MinHash filtering (for LSH).
|
org.apache.lucene.analysis.miscellaneous |
Miscellaneous Tokenstreams.
|
org.apache.lucene.analysis.ngram |
Character n-gram tokenizers and filters.
|
org.apache.lucene.analysis.path |
Analysis components for path-like strings such as filenames.
|
org.apache.lucene.analysis.pattern |
Set of components for pattern-based (regex) analysis.
|
org.apache.lucene.analysis.payloads |
Provides various convenience classes for creating payloads on Tokens.
|
org.apache.lucene.analysis.shingle |
Word n-gram filters.
|
org.apache.lucene.analysis.standard |
Fast, general-purpose grammar-based tokenizer
StandardTokenizer
implements the Word Break rules from the Unicode Text Segmentation algorithm, as specified in
Unicode Standard Annex #29. |
org.apache.lucene.analysis.synonym |
Analysis components for Synonyms.
|
org.apache.lucene.analysis.th |
Analyzer for Thai.
|
org.apache.lucene.analysis.tokenattributes |
General-purpose attributes for text analysis.
|
org.apache.lucene.analysis.util |
Utility functions for text analysis.
|
org.apache.lucene.analysis.wikipedia |
Tokenizer that is aware of Wikipedia syntax.
|
org.apache.lucene.document |
The logical representation of a
Document for indexing and searching. |
org.apache.lucene.index |
Code to maintain and access indices.
|
org.apache.lucene.search.highlight |
Highlighting search terms.
|
org.apache.lucene.search.uhighlight |
The UnifiedHighlighter -- a flexible highlighter that can get offsets from postings, term vectors, or analysis.
|
Modifier and Type | Field and Description |
---|---|
private OffsetAttribute |
GraphTokenFilter.offsetAtt |
private OffsetAttribute |
Analyzer.StringTokenStream.offsetAttribute |
Modifier and Type | Field and Description |
---|---|
private OffsetAttribute |
CJKBigramFilter.offsetAtt |
Modifier and Type | Field and Description |
---|---|
private OffsetAttribute |
HMMChineseTokenizer.offsetAtt |
Modifier and Type | Field and Description |
---|---|
private OffsetAttribute |
CommonGramsFilter.offsetAttribute |
Modifier and Type | Field and Description |
---|---|
protected OffsetAttribute |
CompoundWordTokenFilterBase.offsetAtt |
Modifier and Type | Field and Description |
---|---|
private OffsetAttribute |
KeywordTokenizer.offsetAtt |
private OffsetAttribute |
FlattenGraphFilter.offsetAtt |
Modifier and Type | Field and Description |
---|---|
private OffsetAttribute |
MinHashFilter.offsetAttribute |
Modifier and Type | Field and Description |
---|---|
private OffsetAttribute |
ConditionalTokenFilter.OneTimeWrapper.offsetAtt |
private OffsetAttribute |
ConcatenatingTokenStream.offsetAtt |
private OffsetAttribute |
FixBrokenOffsetsFilter.offsetAtt
Deprecated.
|
private OffsetAttribute |
FingerprintFilter.offsetAtt |
private OffsetAttribute |
ConcatenateGraphFilter.offsetAtt |
private OffsetAttribute |
LimitTokenOffsetFilter.offsetAttrib |
private OffsetAttribute |
WordDelimiterFilter.offsetAttribute
Deprecated.
|
private OffsetAttribute |
HyphenatedWordsFilter.offsetAttribute |
private OffsetAttribute |
WordDelimiterGraphFilter.offsetAttribute |
private OffsetAttribute[] |
ConcatenatingTokenStream.sourceOffsets |
Modifier and Type | Field and Description |
---|---|
private OffsetAttribute |
NGramTokenizer.offsetAtt |
Modifier and Type | Field and Description |
---|---|
private OffsetAttribute |
ReversePathHierarchyTokenizer.offsetAtt |
private OffsetAttribute |
PathHierarchyTokenizer.offsetAtt |
Modifier and Type | Field and Description |
---|---|
private OffsetAttribute |
SimplePatternSplitTokenizer.offsetAtt |
private OffsetAttribute |
SimplePatternTokenizer.offsetAtt |
private OffsetAttribute |
PatternTokenizer.offsetAtt |
Modifier and Type | Field and Description |
---|---|
private OffsetAttribute |
TokenOffsetPayloadTokenFilter.offsetAtt |
Modifier and Type | Field and Description |
---|---|
private OffsetAttribute |
ShingleFilter.offsetAtt |
(package private) OffsetAttribute |
ShingleFilter.InputWindowToken.offsetAtt |
private OffsetAttribute |
FixedShingleFilter.offsetAtt |
Modifier and Type | Field and Description |
---|---|
private OffsetAttribute |
ClassicTokenizer.offsetAtt |
private OffsetAttribute |
UAX29URLEmailTokenizer.offsetAtt |
private OffsetAttribute |
StandardTokenizer.offsetAtt |
Modifier and Type | Field and Description |
---|---|
private OffsetAttribute |
SynonymGraphFilter.offsetAtt |
private OffsetAttribute |
SynonymFilter.offsetAtt
Deprecated.
|
Modifier and Type | Field and Description |
---|---|
private OffsetAttribute |
ThaiTokenizer.offsetAtt |
Modifier and Type | Class and Description |
---|---|
class |
OffsetAttributeImpl
Default implementation of
OffsetAttribute . |
class |
PackedTokenAttributeImpl
Default implementation of the common attributes used by Lucene:
CharTermAttribute
TypeAttribute
PositionIncrementAttribute
PositionLengthAttribute
OffsetAttribute
TermFrequencyAttribute
|
Modifier and Type | Field and Description |
---|---|
private OffsetAttribute |
CharTokenizer.offsetAtt |
private OffsetAttribute |
SegmentingTokenizerBase.offsetAtt |
Modifier and Type | Field and Description |
---|---|
private OffsetAttribute |
WikipediaTokenizer.offsetAtt |
Modifier and Type | Field and Description |
---|---|
private OffsetAttribute |
Field.StringTokenStream.offsetAttribute |
Modifier and Type | Field and Description |
---|---|
(package private) OffsetAttribute |
TermVectorsConsumerPerField.offsetAttribute |
(package private) OffsetAttribute |
FreqProxTermsWriterPerField.offsetAttribute |
(package private) OffsetAttribute |
FieldInvertState.offsetAttribute |
Modifier and Type | Field and Description |
---|---|
private OffsetAttribute |
SimpleFragmenter.offsetAtt |
private OffsetAttribute |
SimpleSpanFragmenter.offsetAtt |
private OffsetAttribute |
TokenGroup.offsetAtt |
private OffsetAttribute |
OffsetLimitTokenFilter.offsetAttrib |
private OffsetAttribute |
LimitTokenOffsetFilter.offsetAttrib |
private OffsetAttribute |
TokenStreamFromTermVector.offsetAttribute |
Modifier and Type | Field and Description |
---|---|
private OffsetAttribute |
AnalysisOffsetStrategy.MultiValueTokenStream.offsetAtt |
(package private) OffsetAttribute |
TokenStreamOffsetStrategy.TokenStreamOffsetsEnum.offsetAtt |