module Cadmium
Overview
Cadmium is a Natrual Language Processing (NLP) library for Crystal. It includes several modules and classes for processing sentences and splitting them into digestable pieces.
Every module in Cadmium is accessable via a convenience method directly from the
Cadmium module. This allows you to type Cadmium.bayes_classifier
instead of
the more explicit Cadmium::BayesClassifier
.
Extended Modules
Defined in:
cadmium.crcadmium/classifier.cr
cadmium/classifier/bayes.cr
cadmium/classifier/logistic_regression.cr
cadmium/distance/jaro_winkler.cr
cadmium/distance/levenshtein.cr
cadmium/distance/pair.cr
cadmium/graph.cr
cadmium/graph/edge_weighted_digraph.cr
cadmium/graph/shortest_path.cr
cadmium/inflector/count_inflector.cr
cadmium/inflector/form_set.cr
cadmium/inflector/noun_inflector.cr
cadmium/inflector/present_verb_inflector.cr
cadmium/inflector/tense_inflector.cr
cadmium/inflectors.cr
cadmium/ngrams.cr
cadmium/normalizers/remove_diacritics.cr
cadmium/phonetics.cr
cadmium/phonetics/metaphone.cr
cadmium/phonetics/phonetics.cr
cadmium/phonetics/soundex.cr
cadmium/readability.cr
cadmium/sentiment.cr
cadmium/stemmer.cr
cadmium/stemmer/porter_stemmer.cr
cadmium/stemmer/stemmer.cr
cadmium/stemmer/token.cr
cadmium/tfidf.cr
cadmium/tokenizer.cr
cadmium/tokenizer/aggressive_tokenizer.cr
cadmium/tokenizer/case_tokenizer.cr
cadmium/tokenizer/pragmatic/languages.cr
cadmium/tokenizer/pragmatic/languages/bulgarian.cr
cadmium/tokenizer/pragmatic/languages/common.cr
cadmium/tokenizer/pragmatic/languages/czech.cr
cadmium/tokenizer/pragmatic/languages/deutsch.cr
cadmium/tokenizer/pragmatic/languages/english.cr
cadmium/tokenizer/pragmatic/languages/portuguese.cr
cadmium/tokenizer/pragmatic/languages/spanish.cr
cadmium/tokenizer/pragmatic/post_processor.cr
cadmium/tokenizer/pragmatic/pre_processor.cr
cadmium/tokenizer/pragmatic/regex.cr
cadmium/tokenizer/pragmatic_tokenizer.cr
cadmium/tokenizer/regex_tokenizer.cr
cadmium/tokenizer/sentence_tokenizer.cr
cadmium/tokenizer/tokenizer.cr
cadmium/tokenizer/treebank_word_tokenizer.cr
cadmium/tokenizer/visible_char_tokenizer.cr
cadmium/tokenizer/whitespace_tokenizer.cr
cadmium/tokenizer/word_punctuation_tokenizer.cr
cadmium/tokenizer/word_tokenizer.cr
cadmium/transliterator.cr
cadmium/transliterator/char_map.cr
cadmium/trie.cr
cadmium/util/linalg.cr
cadmium/util/paragraph.cr
cadmium/util/sentence.cr
cadmium/util/stop_words.cr
cadmium/util/syllable.cr
cadmium/version.cr
cadmium/wordnet.cr
cadmium/wordnet/db.cr
cadmium/wordnet/lemma.cr
cadmium/wordnet/pointer.cr
cadmium/wordnet/pointers.cr
cadmium/wordnet/synset.cr
Constant Summary
-
VERSION =
"0.1.0"
Instance Method Summary
-
#aggressive_tokenizer
Convenience method for accessing
Cadmium::AggressiveTokenizer
-
#bayes_classifier
Convenience method for accessing
Cadmium::BayesClassifier
-
#case_tokenizer
Convenience method for accessing
Cadmium::CaseTokenizer
-
#count_inflector
Convenience method for accessing
Cadmium::CountInflector
-
#edge_weighted_digraph
Convenience method for accessing
Cadmium::EdgeWeightedDigraph
-
#jaro_winkler
Convenience method for accessing
Cadmium::JaroWinklerDistance
-
#levenshtein
Convenience method for accessing
Cadmium::LevenshteinDistance
-
#metaphone
Convenience method for accessing
Cadmium::Metaphone
-
#ngrams
Convenience method for accessing
Cadmium::NGrams
-
#noun_inflector
Convenience method for accessing
Cadmium::NounInflector
-
#pair_distance
Convenience method for accessing
Cadmium::PairDistance
-
#porter_stemmer
Convenience method for accessing
Cadmium::PorterStemmer
-
#pragmatic_tokenizer
Convenience method for accessing
Cadmium::PragmaticTokenizer
-
#present_tense_inflector
Convenience method for accessing
Cadmium::PresentTenseInflector
-
#readability
Convenience method for accessing
Cadmium::Readability
-
#regex_tokenizer
Convenience method for accessing
Cadmium::RegexTokenizer
-
#sentence_tokenizer
Convenience method for accessing
Cadmium::SentenceTokenizer
-
#sentiment
Convenience method for accessing
Cadmium::Sentiment
-
#sound_ex
Convenience method for accessing
Cadmium::SoundEX
-
#tf_idf
Convenience method for accessing
Cadmium::TfIdf
-
#transliterator
Convenience method for accessing
Cadmium::Transliterator
-
#treebank_word_tokenizer
Convenience method for accessing
Cadmium::TreebankWordTokenizer
-
#trie
Convenience method for accessing
Cadmium::Trie
-
#whitespace_tokenizer
Convenience method for accessing
Cadmium::WhitespaceTokenizer
-
#word_punctuation_tokenizer
Convenience method for accessing
Cadmium::WordPunctuationTokenizer
-
#word_tokenizer
Convenience method for accessing
Cadmium::WordTokenizer
-
#wordnet
Convenience method for accessing
Cadmium::Wordnet