Determining whether a text in a message or in Web page is spam is a difficult task, as it requires a kind of contextual analysis that is not trivial to emulate in software.
This class provides a solution that can classify text and build a knowledge base of what should be or should not be considered spam based on information that may eventually be provided by a human user.
Since it processes text that can be presented in many formats, it could be used to classify and detect spam on many Internet services like e-mail, Web pages, forums, chat services, etc..
This class can automatically classify text messages to determine whether or not their are considered to be spam.
It can build a knowledge base of known text expressions that can be looked up later to evaluate a factor that expresses the probability of a given text to be spam.
This class could be used in Web mail applications or even in less obvious applications like forums and guest books, acting like an semi-automatic moderator.