Spam analysis classifier
Develop Prediction Model for webspam and hyperlink analysis designed and trained (with provided data) to achieve certain prediction goals.
We have built model for Spam\No-spam prediction for links analysis company. We have used Big Data methods for data size of 70+ Gb. There were a lot of text features, which were preprocessed by using TF-IDF, Word2Vec and Features Selecting methods. Various data cleaning and ETL methods were applied. As result we have built classifier model, which was deployed as a RESTful web service