class Linguist::NGramsTokenizer
- Linguist::NGramsTokenizer
- Cadmium::Tokenizer::Base
- Reference
- Object
Defined in:
linguist/tokenizer.crConstant Summary
-
SYMBOL_RANGES =
[0..64, 91..96]
Constructors
Instance Method Summary
- #downcase?(string)
- #max_size : Int32
- #max_size=(max_size : Int32)
- #symbol?(char)
- #tokenize(string : String) : Array(String)