public class WikipediaTokenizerFactory extends TokenizerFactory
WikipediaTokenizer
.
<fieldType name="text_wiki" class="solr.TextField" positionIncrementGap="100"> <analyzer> <tokenizer class="solr.WikipediaTokenizerFactory"/> </analyzer> </fieldType>
Modifier and Type | Field and Description |
---|---|
static java.lang.String |
TOKEN_OUTPUT |
protected int |
tokenOutput |
static java.lang.String |
UNTOKENIZED_TYPES |
protected java.util.Set<java.lang.String> |
untokenizedTypes |
LUCENE_MATCH_VERSION_PARAM, luceneMatchVersion
Constructor and Description |
---|
WikipediaTokenizerFactory(java.util.Map<java.lang.String,java.lang.String> args)
Creates a new WikipediaTokenizerFactory
|
Modifier and Type | Method and Description |
---|---|
WikipediaTokenizer |
create(AttributeFactory factory)
Creates a TokenStream of the specified input using the given AttributeFactory
|
availableTokenizers, create, forName, lookupClass, reloadTokenizers
get, get, get, get, get, getBoolean, getChar, getClassArg, getFloat, getInt, getLines, getLuceneMatchVersion, getOriginalArgs, getPattern, getSet, getSnowballWordSet, getWordSet, isExplicitLuceneMatchVersion, require, require, require, requireBoolean, requireChar, requireFloat, requireInt, setExplicitLuceneMatchVersion, splitAt, splitFileNames
public static final java.lang.String TOKEN_OUTPUT
public static final java.lang.String UNTOKENIZED_TYPES
protected final int tokenOutput
protected java.util.Set<java.lang.String> untokenizedTypes
public WikipediaTokenizerFactory(java.util.Map<java.lang.String,java.lang.String> args)
public WikipediaTokenizer create(AttributeFactory factory)
TokenizerFactory
create
in class TokenizerFactory