Implementation of Porter Stemmer Algorithm V2 by Dr Martin F Porter
NLP Functions for amplifying negations, managing elisions, creating ngrams, stems, phonetic codes to tokens and more.
Functions for cross validation, shuffle, cartesian product and more
Configurable Naive Bayes Classifier for text with cross-validation support
Configurable BM25 Text Search Engine with simple semantic search support
An Implementation of Jaro Distance Algorithm by Matthew A. Jaro
Fast and Numerically Stable Statistical Analysis Utilities
Decision Tree to predict the value of a continuous target variable
Distance/Similarity functions for Bag of Words, Strings, Numbers, Dates and Vectors.
Accurate and fast sentiment scoring of phrases with emoticons :) & emojis 🎉
Multilingual tokenizer that automatically tags each token with its type
English Part-of-speech (POS) tagger
English lexicon useful in NLP/NLU
Language agnostic named entity recognizer
Multi-class averaged perceptron