RPPP – Reddit Post Popularity Predictor
A project with two goals:
1. Given a Reddit post, predict how popular it's going to be (what it's score will be)
2. Showcasing a remote working file system with DVC

Puneetha Pai d405d7446a
Merge branch 'update_dvc' of puneethp/RPPP into master
3caabdf8f5
Fix: remove unused dvc remote google cache
9 months ago
7a9631b1dd
Refactor: reorder files
9 months ago
c1ea478718
Clean: param and metrics logging
9 months ago
src
58bf4bbc0c
Fix Build: lint error and Jenkins file syntax change
9 months ago
20786d2e70
Fix: unused pytest import
9 months ago
e26c2233ad
Add: Jenkins pipeline definition
9 months ago
e26c2233ad
Add: Jenkins pipeline definition
9 months ago
3ad4fd1944
Update: Pipeline once
9 months ago
e48b1b5101
Remove: unused commands
8 months ago
d9735a5dd8
Add contributing guide
1 year ago
e48b1b5101
Remove: unused commands
8 months ago
710647b99d
Update repo URL to dagshub repo
6 months ago
0aa625a470
Update 'README.md'
1 year ago
2fc1e9512a
Add: final PR reveiw usecase
8 months ago
abe9cefcf2
710647b: Update dvc.lock and metrics
6 months ago
c1ea478718
Clean: param and metrics logging
9 months ago
e48b1b5101
Remove: unused commands
8 months ago
cac18be035
Add 'remote-wfs-setup.md'
1 year ago
b00091c320
Update: dvc to 2.0.6
6 months ago
Data Pipeline
Legend
DVC Managed File
Git Managed File
Metric
Stage File
External File

README.md

RPPP - Reddit Post Popularity Predictor

This Project attempts to predict whether a reddit submission will be popular or not according to it's features.

We currently provide models for r/MachineLearning only, base on submission title and body.

DVC Remote Working File System

This project is also an exploration of DVC remote WFS workflow. To setup your remote WFS – read here: Remote WFS Setup

Contributing

Contributions Are Very Welcome!

Read the Contribution Guide for more information.

Ideas to work on:

  • Combine textual and numerical classifier into one model!
  • Add UI to test if your post is going to be successful!
  • Add MOAR data! (other subreddits, more from r/ML)
  • Improve model performance (there is a lotttt to improve)!
  • Add memes: Add MOAR MEMES