module Cadmium

Overview

Cadmium is a Natrual Language Processing (NLP) library for Crystal. It includes several modules and classes for processing sentences and splitting them into digestable pieces.

Every module in Cadmium is accessable via a convenience method directly from the Cadmium module. This allows you to type Cadmium.bayes_classifier instead of the more explicit Cadmium::BayesClassifier.

Extended Modules

Defined in:

cadmium.cr
cadmium/classifier.cr
cadmium/classifier/bayes.cr
cadmium/classifier/logistic_regression.cr
cadmium/distance/jaro_winkler.cr
cadmium/distance/levenshtein.cr
cadmium/distance/pair.cr
cadmium/graph.cr
cadmium/graph/edge_weighted_digraph.cr
cadmium/graph/shortest_path.cr
cadmium/inflector/count_inflector.cr
cadmium/inflector/form_set.cr
cadmium/inflector/noun_inflector.cr
cadmium/inflector/present_verb_inflector.cr
cadmium/inflector/tense_inflector.cr
cadmium/inflectors.cr
cadmium/ngrams.cr
cadmium/normalizers/remove_diacritics.cr
cadmium/phonetics.cr
cadmium/phonetics/metaphone.cr
cadmium/phonetics/phonetics.cr
cadmium/phonetics/soundex.cr
cadmium/readability.cr
cadmium/sentiment.cr
cadmium/stemmer.cr
cadmium/stemmer/porter_stemmer.cr
cadmium/stemmer/stemmer.cr
cadmium/stemmer/token.cr
cadmium/tfidf.cr
cadmium/tokenizer.cr
cadmium/tokenizer/aggressive_tokenizer.cr
cadmium/tokenizer/case_tokenizer.cr
cadmium/tokenizer/pragmatic/languages.cr
cadmium/tokenizer/pragmatic/languages/bulgarian.cr
cadmium/tokenizer/pragmatic/languages/common.cr
cadmium/tokenizer/pragmatic/languages/czech.cr
cadmium/tokenizer/pragmatic/languages/deutsch.cr
cadmium/tokenizer/pragmatic/languages/english.cr
cadmium/tokenizer/pragmatic/languages/portuguese.cr
cadmium/tokenizer/pragmatic/languages/spanish.cr
cadmium/tokenizer/pragmatic/post_processor.cr
cadmium/tokenizer/pragmatic/pre_processor.cr
cadmium/tokenizer/pragmatic/regex.cr
cadmium/tokenizer/pragmatic_tokenizer.cr
cadmium/tokenizer/regex_tokenizer.cr
cadmium/tokenizer/sentence_tokenizer.cr
cadmium/tokenizer/tokenizer.cr
cadmium/tokenizer/treebank_word_tokenizer.cr
cadmium/tokenizer/visible_char_tokenizer.cr
cadmium/tokenizer/whitespace_tokenizer.cr
cadmium/tokenizer/word_punctuation_tokenizer.cr
cadmium/tokenizer/word_tokenizer.cr
cadmium/transliterator.cr
cadmium/transliterator/char_map.cr
cadmium/trie.cr
cadmium/util/linalg.cr
cadmium/util/paragraph.cr
cadmium/util/sentence.cr
cadmium/util/stop_words.cr
cadmium/util/syllable.cr
cadmium/version.cr
cadmium/wordnet.cr
cadmium/wordnet/db.cr
cadmium/wordnet/lemma.cr
cadmium/wordnet/pointer.cr
cadmium/wordnet/pointers.cr
cadmium/wordnet/synset.cr

Constant Summary

VERSION = "0.1.0"

Instance Method Summary

Instance Method Detail

def aggressive_tokenizer #

Convenience method for accessing Cadmium::AggressiveTokenizer


[View source]
def bayes_classifier #

Convenience method for accessing Cadmium::BayesClassifier


[View source]
def case_tokenizer #

Convenience method for accessing Cadmium::CaseTokenizer


[View source]
def count_inflector #

Convenience method for accessing Cadmium::CountInflector


[View source]
def edge_weighted_digraph #

Convenience method for accessing Cadmium::EdgeWeightedDigraph


[View source]
def jaro_winkler #

Convenience method for accessing Cadmium::JaroWinklerDistance


[View source]
def levenshtein #

Convenience method for accessing Cadmium::LevenshteinDistance


[View source]
def metaphone #

Convenience method for accessing Cadmium::Metaphone


[View source]
def ngrams #

Convenience method for accessing Cadmium::NGrams


[View source]
def noun_inflector #

Convenience method for accessing Cadmium::NounInflector


[View source]
def pair_distance #

Convenience method for accessing Cadmium::PairDistance


[View source]
def porter_stemmer #

Convenience method for accessing Cadmium::PorterStemmer


[View source]
def pragmatic_tokenizer #

Convenience method for accessing Cadmium::PragmaticTokenizer


[View source]
def present_tense_inflector #

Convenience method for accessing Cadmium::PresentTenseInflector


[View source]
def readability #

Convenience method for accessing Cadmium::Readability


[View source]
def regex_tokenizer #

Convenience method for accessing Cadmium::RegexTokenizer


[View source]
def sentence_tokenizer #

Convenience method for accessing Cadmium::SentenceTokenizer


[View source]
def sentiment #

Convenience method for accessing Cadmium::Sentiment


[View source]
def sound_ex #

Convenience method for accessing Cadmium::SoundEX


[View source]
def tf_idf #

Convenience method for accessing Cadmium::TfIdf


[View source]
def transliterator #

Convenience method for accessing Cadmium::Transliterator


[View source]
def treebank_word_tokenizer #

Convenience method for accessing Cadmium::TreebankWordTokenizer


[View source]
def trie #

Convenience method for accessing Cadmium::Trie


[View source]
def whitespace_tokenizer #

Convenience method for accessing Cadmium::WhitespaceTokenizer


[View source]
def word_punctuation_tokenizer #

Convenience method for accessing Cadmium::WordPunctuationTokenizer


[View source]
def word_tokenizer #

Convenience method for accessing Cadmium::WordTokenizer


[View source]
def wordnet #

Convenience method for accessing Cadmium::Wordnet


[View source]