"Natural" is a general natural language facility for nodejs. Tokenizing, stemming, classification, phonetics, tf-idf, WordNet, string similarity, and some inflections are currently supported.
It's still in the early stages, so we're very interested in bug reports, contributions and the like.
Note that many algorithms from Rob Ellis's node-nltools are being merged into this project and will be maintained from here onward.
At the moment, most of the algorithms are English-specific, but in the long-term, some diversity will be in order. Thanks to Polyakov Vladimir, Russian stemming has been added!, Thanks to David Przybilla, Spanish stemming has been added!.
Aside from this README, the only documentation is this DZone article and here on my blog, which is a bit older.
TABLE OF CONTENTS- Installation
- Tokenizers
- String Distance
- Stemmers
- Classifiers
- Phonetics
- Inflectors
- N-Grams
- tf-idf
- Tries
- EdgeWeightedDigraph
- ShortestPathTree
- LongestPathTree
- WordNet
- Spellcheck
- POS Tagger
- Acknowledgements/references
- Development
- License