class Cadmium::WordPunctuationTokenizer
Defined in:
cadmium/tokenizer/word_punctuation_tokenizer.crConstant Summary
-
REGEX_PATTERN =
/(\w+|[а-я0-9_]+|\.|\!|\'|\"")/i
/(\w+|[а-я0-9_]+|\.|\!|\'|\"")/i
Cadmium::RegexTokenizer
Cadmium::RegexTokenizer
Cadmium::Tokenizer