Package 'unine'

Title: Unine Light Stemmer
Description: Implementation of "light" stemmers for French, German, Italian, Spanish, Portuguese, Finnish, Swedish. They are based on the same work as the "light" stemmers found in 'SolR' <https://lucene.apache.org/solr/> or 'ElasticSearch' <https://www.elastic.co/fr/products/elasticsearch>. A "light" stemmer consists in removing inflections only for noun and adjectives. Indexing verbs for these languages is not of primary importance compared to nouns and adjectives. The stemming procedure for French is described in (Savoy, 1999) <doi:10.1002/(SICI)1097-4571(1999)50:10%3C944::AID-ASI9%3E3.3.CO;2-H>.
Authors: Michaël Benesty [aut, cre, cph], Jacques Savoy [cph]
Maintainer: Michaël Benesty <[email protected]>
License: MIT + file LICENSE
Version: 0.2.0
Built: 2024-09-19 04:13:32 UTC
Source: https://github.com/pommedeterresautee/unine

Help Index


Stem Finnish words

Description

Stemmer for Finnish words

Usage

finnish_stemmer(words)

Arguments

words

a character containing the original words.

Value

character with stemmed words.

Examples

finnish_stemmer(c("taivas"))

Finnish stop list

Description

Finnish stop list

Usage

finnish_stoplist

Format

character containing the list of words

Source

Unine website

Examples

data("finnish_stoplist")

Stem French words

Description

Stemmer for French words

Usage

french_stemmer(words)

Arguments

words

a character containing the original words.

Value

character with stemmed words.

Examples

french_stemmer(c("tester", "testament", "clients"))

French stop list

Description

French stop list

Usage

french_stoplist

Format

character containing the list of words

Source

Unine website

Examples

data("french_stoplist")

Stem German words

Description

Stemmer for German words

Usage

german_stemmer(words)

Arguments

words

a character containing the original words.

Value

character with stemmed words.

Examples

german_stemmer(c("kinder"))

German stop list

Description

German stop list

Usage

german_stoplist

Format

character containing the list of words

Source

Unine website

Examples

data("german_stoplist")

Stem Italian words

Description

Stemmer for Italian words

Usage

italian_stemmer(words)

Arguments

words

a character containing the original words.

Value

character with stemmed words.

Examples

italian_stemmer(c("arrivederci"))

Italian stop list

Description

Italian stop list

Usage

italian_stoplist

Format

character containing the list of words

Source

Unine website

Examples

data("italian_stoplist")

Stem Portuguese words

Description

Stemmer for Portuguese words

Usage

portuguese_stemmer(words)

Arguments

words

a character containing the original words.

Value

character with stemmed words.

Examples

portuguese_stemmer(c("adeus"))

Portuguese stop list

Description

Portuguese stop list

Usage

portuguese_stoplist

Format

character containing the list of words

Source

Unine website

Examples

data("portuguese_stoplist")

Stem Spanish words

Description

Stemmer for Spanish words

Usage

spanish_stemmer(words)

Arguments

words

a character containing the original words.

Value

character with stemmed words.

Examples

spanish_stemmer(c("perros"))

Spanish stop list

Description

Spanish stop list

Usage

spanish_stoplist

Format

character containing the list of words

Source

Unine website

Examples

data("spanish_stoplist")

Stem Swedish words

Description

Stemmer for Swedish words

Usage

swedish_stemmer(words)

Arguments

words

a character containing the original words.

Value

character with stemmed words.

Examples

swedish_stemmer(c("stiga"))

Swedish stop list

Description

Swedish stop list

Usage

swedish_stoplist

Format

character containing the list of words

Source

Unine website

Examples

data("swedish_stoplist")

unine: Unine Light Stemmer

Description

Implementation of "light" stemmers for French, German, Italian, Spanish, Portuguese, Finnish, Swedish. They are based on the same work as the "light" stemmers found in 'SolR' <https://lucene.apache.org/solr/> or 'ElasticSearch' <https://www.elastic.co/fr/products/elasticsearch>. A "light" stemmer consists in removing inflections only for noun and adjectives. Indexing verbs for these languages is not of primary importance compared to nouns and adjectives. The stemming procedure for French is described in (Savoy, 1999) <doi:10.1002/(SICI)1097-4571(1999)50:10

Author(s)

Maintainer: Michaël Benesty [email protected] [copyright holder]

Other contributors:

See Also

Useful links: