See: Description
Class | Description |
---|---|
BLDutchAnalyzer |
A simple analyzer that isn't limited to Latin.
|
BLDutchTokenFilter |
The token filter to accompany BLDutchTokenizer.
|
BLDutchTokenizer |
A simple tokenizer for Dutch texts.
|
BLNonTokenizer |
A tokenizer that doesn't tokenize (returns the whole field value as one
token)
|
BLNonTokenizingAnalyzer |
Analyzer that doesn't tokenize but returns a single token.
|
BLStandardAnalyzer |
A simple analyzer based on StandardTokenizer that isn't limited to Latin.
|
BLWhitespaceAnalyzer |
Simple whitespace analyzer.
|
DesensitizeFilter |
Lowercases and/or removes any accents from the input.
|
RemoveAllAccentsFilter |
Removes any accents from the input.
|
RemovePunctuationFilter |
Replaces punctuation with space.
|
Analyzer implementations, including Tokenizers and Filters.
Copyright © 2020 Instituut voor Nederlandse Taal (INT). All rights reserved.