public class ShingleFilterFactory extends TokenFilterFactory
ShingleFilter
.
<fieldType name="text_shingle" class="solr.TextField" positionIncrementGap="100"> <analyzer> <tokenizer class="solr.WhitespaceTokenizerFactory"/> <filter class="solr.ShingleFilterFactory" minShingleSize="2" maxShingleSize="2" outputUnigrams="true" outputUnigramsIfNoShingles="false" tokenSeparator=" " fillerToken="_"/> </analyzer> </fieldType>
Modifier and Type | Field and Description |
---|---|
private java.lang.String |
fillerToken |
private int |
maxShingleSize |
private int |
minShingleSize |
private boolean |
outputUnigrams |
private boolean |
outputUnigramsIfNoShingles |
private java.lang.String |
tokenSeparator |
LUCENE_MATCH_VERSION_PARAM, luceneMatchVersion
Constructor and Description |
---|
ShingleFilterFactory(java.util.Map<java.lang.String,java.lang.String> args)
Creates a new ShingleFilterFactory
|
Modifier and Type | Method and Description |
---|---|
ShingleFilter |
create(TokenStream input)
Transform the specified input TokenStream
|
availableTokenFilters, forName, lookupClass, normalize, reloadTokenFilters
get, get, get, get, get, getBoolean, getChar, getClassArg, getFloat, getInt, getLines, getLuceneMatchVersion, getOriginalArgs, getPattern, getSet, getSnowballWordSet, getWordSet, isExplicitLuceneMatchVersion, require, require, require, requireBoolean, requireChar, requireFloat, requireInt, setExplicitLuceneMatchVersion, splitAt, splitFileNames
private final int minShingleSize
private final int maxShingleSize
private final boolean outputUnigrams
private final boolean outputUnigramsIfNoShingles
private final java.lang.String tokenSeparator
private final java.lang.String fillerToken
public ShingleFilterFactory(java.util.Map<java.lang.String,java.lang.String> args)
public ShingleFilter create(TokenStream input)
TokenFilterFactory
create
in class TokenFilterFactory