class Cadmium::Tokenizer::Word
Defined in:
cadmium/tokenizer/word.crConstant Summary
-
REGEX_PATTERN =
/[^A-Za-zА-Яа-я0-9_]+/
/[^A-Za-zА-Яа-я0-9_]+/
Cadmium::Tokenizer::RegexCadmium::Tokenizer::RegexCadmium::Tokenizer::BaseCadmium::Tokenizer::DiacriticsCadmium::Tokenizer::StopWords