Title: | Unine Light Stemmer |
---|---|
Description: | Implementation of "light" stemmers for French, German, Italian, Spanish, Portuguese, Finnish, Swedish. They are based on the same work as the "light" stemmers found in 'SolR' <https://lucene.apache.org/solr/> or 'ElasticSearch' <https://www.elastic.co/fr/products/elasticsearch>. A "light" stemmer consists in removing inflections only for noun and adjectives. Indexing verbs for these languages is not of primary importance compared to nouns and adjectives. The stemming procedure for French is described in (Savoy, 1999) <doi:10.1002/(SICI)1097-4571(1999)50:10%3C944::AID-ASI9%3E3.3.CO;2-H>. |
Authors: | Michaël Benesty [aut, cre, cph], Jacques Savoy [cph] |
Maintainer: | Michaël Benesty <[email protected]> |
License: | MIT + file LICENSE |
Version: | 0.2.0 |
Built: | 2024-11-19 04:55:29 UTC |
Source: | https://github.com/pommedeterresautee/unine |
Stemmer for Finnish words
finnish_stemmer(words)
finnish_stemmer(words)
words |
a character containing the original words. |
character with stemmed words.
finnish_stemmer(c("taivas"))
finnish_stemmer(c("taivas"))
Finnish stop list
finnish_stoplist
finnish_stoplist
character containing the list of words
data("finnish_stoplist")
data("finnish_stoplist")
Stemmer for French words
french_stemmer(words)
french_stemmer(words)
words |
a character containing the original words. |
character with stemmed words.
french_stemmer(c("tester", "testament", "clients"))
french_stemmer(c("tester", "testament", "clients"))
French stop list
french_stoplist
french_stoplist
character containing the list of words
data("french_stoplist")
data("french_stoplist")
Stemmer for German words
german_stemmer(words)
german_stemmer(words)
words |
a character containing the original words. |
character with stemmed words.
german_stemmer(c("kinder"))
german_stemmer(c("kinder"))
German stop list
german_stoplist
german_stoplist
character containing the list of words
data("german_stoplist")
data("german_stoplist")
Stemmer for Italian words
italian_stemmer(words)
italian_stemmer(words)
words |
a character containing the original words. |
character with stemmed words.
italian_stemmer(c("arrivederci"))
italian_stemmer(c("arrivederci"))
Italian stop list
italian_stoplist
italian_stoplist
character containing the list of words
data("italian_stoplist")
data("italian_stoplist")
Stemmer for Portuguese words
portuguese_stemmer(words)
portuguese_stemmer(words)
words |
a character containing the original words. |
character with stemmed words.
portuguese_stemmer(c("adeus"))
portuguese_stemmer(c("adeus"))
Portuguese stop list
portuguese_stoplist
portuguese_stoplist
character containing the list of words
data("portuguese_stoplist")
data("portuguese_stoplist")
Stemmer for Spanish words
spanish_stemmer(words)
spanish_stemmer(words)
words |
a character containing the original words. |
character with stemmed words.
spanish_stemmer(c("perros"))
spanish_stemmer(c("perros"))
Spanish stop list
spanish_stoplist
spanish_stoplist
character containing the list of words
data("spanish_stoplist")
data("spanish_stoplist")
Stemmer for Swedish words
swedish_stemmer(words)
swedish_stemmer(words)
words |
a character containing the original words. |
character with stemmed words.
swedish_stemmer(c("stiga"))
swedish_stemmer(c("stiga"))
Swedish stop list
swedish_stoplist
swedish_stoplist
character containing the list of words
data("swedish_stoplist")
data("swedish_stoplist")
Implementation of "light" stemmers for French, German, Italian, Spanish, Portuguese, Finnish, Swedish. They are based on the same work as the "light" stemmers found in 'SolR' <https://lucene.apache.org/solr/> or 'ElasticSearch' <https://www.elastic.co/fr/products/elasticsearch>. A "light" stemmer consists in removing inflections only for noun and adjectives. Indexing verbs for these languages is not of primary importance compared to nouns and adjectives. The stemming procedure for French is described in (Savoy, 1999) <doi:10.1002/(SICI)1097-4571(1999)50:10
Maintainer: Michaël Benesty [email protected] [copyright holder]
Other contributors:
Jacques Savoy [email protected] [copyright holder]
Useful links:
Report bugs at https://github.com/pommedeterresautee/unine/issues