SPAM Classification System

Overview

A classic NLP project to identify spam messages. This system uses a Bag of Words model combined with several classification algorithms to accurately filter out spam.

Approach

Text Preprocessing: Tokenization, stop-word removal, and stemming.
Feature Extraction: CountVectorizer (Bag of Words).
Modeling: Trained Random Forest, Decision Tree, and Naive Bayes models.

Results

Naive Bayes showed the best performance for this specific text classification task, balancing precision and recall effectively.