Package | Description |
---|---|
org.apache.lucene.analysis.cn.smart |
Analyzer for Simplified Chinese, which indexes words.
|
org.apache.lucene.analysis.cn.smart.hhmm |
SmartChineseAnalyzer Hidden Markov Model package.
|
Modifier and Type | Field and Description |
---|---|
private java.util.Iterator<SegToken> |
HMMChineseTokenizer.tokens |
Modifier and Type | Method and Description |
---|---|
SegToken |
WordSegmenter.convertSegToken(SegToken st,
java.lang.String sentence,
int sentenceStartOffset)
Process a
SegToken so that it is ready for indexing. |
Modifier and Type | Method and Description |
---|---|
java.util.List<SegToken> |
WordSegmenter.segmentSentence(java.lang.String sentence,
int startOffset)
Segment a sentence into words with
HHMMSegmenter |
Modifier and Type | Method and Description |
---|---|
SegToken |
WordSegmenter.convertSegToken(SegToken st,
java.lang.String sentence,
int sentenceStartOffset)
Process a
SegToken so that it is ready for indexing. |
Modifier and Type | Field and Description |
---|---|
private java.util.List<SegToken> |
BiSegGraph.segTokenList |
private java.util.Map<java.lang.Integer,java.util.ArrayList<SegToken>> |
SegGraph.tokenListTable
Map of start offsets to ArrayList of tokens at that position
|
Modifier and Type | Method and Description |
---|---|
SegToken |
SegTokenFilter.filter(SegToken token)
Filter an input
SegToken |
Modifier and Type | Method and Description |
---|---|
java.util.List<SegToken> |
BiSegGraph.getShortPath()
Find the shortest path with the Viterbi algorithm.
|
java.util.List<SegToken> |
SegGraph.getStartList(int s)
Get the list of tokens at the specified start offset
|
java.util.List<SegToken> |
SegGraph.makeIndex()
Set the
index for each token, based upon its order by startOffset. |
java.util.List<SegToken> |
HHMMSegmenter.process(java.lang.String sentence)
Return a list of
SegToken representing the best segmentation of a sentence |
java.util.List<SegToken> |
SegGraph.toTokenList()
Return a
List of all tokens in the map, ordered by startOffset. |
Modifier and Type | Method and Description |
---|---|
void |
SegGraph.addToken(SegToken token)
Add a
SegToken to the mapping, creating a new mapping at the token's startOffset if one does not exist. |
SegToken |
SegTokenFilter.filter(SegToken token)
Filter an input
SegToken |