core.language¤
core.language
¤
Attributes¤
core.language.LANG_MAP
module-attribute
¤
LANG_MAP = {
"en": "english",
"fr": "french",
"de": "german",
"es": "spanish",
"it": "italian",
"pt": "portuguese",
"nl": "dutch",
"sv": "swedish",
"no": "norwegian",
"da": "danish",
"fi": "finnish",
"ru": "russian",
"ro": "romanian",
"hu": "hungarian",
"tr": "turkish",
}
Map ISO 639-1 language codes of supported languages to their full-name, as used by pre-trained corpora
core.language.LANG_MAP_REVERSE
module-attribute
¤
LANG_MAP_REVERSE = {v: k for k, v in (LANG_MAP.items())}
Map the full-name of supported languages, as used by pre-trained corpora, to ISO 639-1 language codes
core.language.STOPWORDS_DICT
module-attribute
¤
STOPWORDS_DICT = {
language: (set(STOPWORDS_DICT[language])) for language in STOPWORDS_DICT
}
Dictionnary of stopwords (as sets values) mapped to full language names (as keys)