public class CzechStemmer
extends java.lang.Object
Implements the algorithm described in: Indexing and stemming approaches for the Czech language http://portal.acm.org/citation.cfm?id=1598600
Constructor and Description |
---|
CzechStemmer() |
Modifier and Type | Method and Description |
---|---|
private int |
normalize(char[] s,
int len) |
private int |
removeCase(char[] s,
int len) |
private int |
removePossessives(char[] s,
int len) |
int |
stem(char[] s,
int len)
Stem an input buffer of Czech text.
|
public int stem(char[] s, int len)
s
- input bufferlen
- length of input bufferNOTE: Input is expected to be in lowercase, but with diacritical marks
private int removeCase(char[] s, int len)
private int removePossessives(char[] s, int len)
private int normalize(char[] s, int len)