Using text classifier to predict various categories in Malawi News articles using SMOTE and SGDClassifier.

Abid 3da3308c76
deepnote button added, correct image link added
9f988a6a2b
Complete project
2 weeks ago
50e907f8f8
Initial commit
2 weeks ago
9f988a6a2b
Complete project
2 weeks ago
3da3308c76
deepnote button added, correct image link added
2 weeks ago
9f988a6a2b
Complete project
2 weeks ago
9f988a6a2b
Complete project
2 weeks ago
9f988a6a2b
Complete project
2 weeks ago
Data Pipeline

Your version controlled data pipeline could be here! Learn how to create one with our tutorial.

README.md

Malawi-News-Classification

Using text classifier to predict various categories in Malawi News articles using SMOTE and SGDClassifier.

View in Deepnote

cover

The project code is simple and effective on competitive grounds. I have experimented with Vectorizer, Porter stemmer for test preprocessing. I have also used multiple methods to clean my text to improved overall model performance. In the end, I have used SKlearn Stochastic Gradient Decent (SGD) classifier for predicting News categories. I have also experimented with various neural networks and gradient boosting models, but they all failed as simple logistics regression with minimum hyperparameter tunning works quite well on this data.

To understand the code read my article on Medium