This software is a Java library that specializes in machine learning techniques for text analysis. Best of all, it's completely free to use!
One of the most notable features of MALLET is its advanced tools for document classification. The package offers efficient routines for converting text into "features" as well as a wide variety of algorithms (including Naïve Bayes, Maximum Entropy, and Decision Trees). Users can also evaluate classifier performance using several commonly used metrics.
In addition to document classification, MALLET provides facilities for information extraction, part-of-speech tagging, noun phrase segmentation, and much more. While the development of the library is quite mature, the tool currently lacks polished front-ends and documentation compared to more established software like Rainbow.
Overall, MALLET is a powerful software package for anyone seeking to delve into natural language processing and machine learning applications. It is important to note that this tool is licensed and released under the terms of the Common Public License.
Version 2.0 RC4: N/A