org.apache.lucene.analysis.nl
Class DutchStemFilter
java.lang.Object
org.apache.lucene.util.AttributeSource
org.apache.lucene.analysis.TokenStream
org.apache.lucene.analysis.TokenFilter
org.apache.lucene.analysis.nl.DutchStemFilter
public final class DutchStemFilter
- extends org.apache.lucene.analysis.TokenFilter
A TokenFilter
that stems Dutch words.
It supports a table of words that should
not be stemmed at all. The stemmer used can be changed at runtime after the
filter object is created (as long as it is a DutchStemmer
).
Nested classes/interfaces inherited from class org.apache.lucene.util.AttributeSource |
org.apache.lucene.util.AttributeSource.AttributeFactory, org.apache.lucene.util.AttributeSource.State |
Fields inherited from class org.apache.lucene.analysis.TokenFilter |
input |
Constructor Summary |
DutchStemFilter(org.apache.lucene.analysis.TokenStream _in)
|
DutchStemFilter(org.apache.lucene.analysis.TokenStream _in,
Set exclusiontable)
Builds a DutchStemFilter that uses an exclusion table. |
DutchStemFilter(org.apache.lucene.analysis.TokenStream _in,
Set exclusiontable,
Map stemdictionary)
|
Methods inherited from class org.apache.lucene.analysis.TokenFilter |
close, end, reset |
Methods inherited from class org.apache.lucene.analysis.TokenStream |
getOnlyUseNewAPI, next, next, setOnlyUseNewAPI |
Methods inherited from class org.apache.lucene.util.AttributeSource |
addAttribute, addAttributeImpl, captureState, clearAttributes, cloneAttributes, equals, getAttribute, getAttributeClassesIterator, getAttributeFactory, getAttributeImplsIterator, hasAttribute, hasAttributes, hashCode, restoreState, toString |
DutchStemFilter
public DutchStemFilter(org.apache.lucene.analysis.TokenStream _in)
DutchStemFilter
public DutchStemFilter(org.apache.lucene.analysis.TokenStream _in,
Set exclusiontable)
- Builds a DutchStemFilter that uses an exclusion table.
DutchStemFilter
public DutchStemFilter(org.apache.lucene.analysis.TokenStream _in,
Set exclusiontable,
Map stemdictionary)
- Parameters:
stemdictionary
- Dictionary of word stem pairs, that overrule the algorithm
incrementToken
public boolean incrementToken()
throws IOException
- Returns the next token in the stream, or null at EOS
- Overrides:
incrementToken
in class org.apache.lucene.analysis.TokenStream
- Throws:
IOException
setStemmer
public void setStemmer(DutchStemmer stemmer)
- Set a alternative/custom
DutchStemmer
for this filter.
setExclusionTable
public void setExclusionTable(HashSet exclusiontable)
- Set an alternative exclusion list for this filter.
setStemDictionary
public void setStemDictionary(HashMap dict)
- Set dictionary for stemming, this dictionary overrules the algorithm,
so you can correct for a particular unwanted word-stem pair.
Copyright © 2000-2010 Apache Software Foundation. All Rights Reserved.