Register
Login
Resources
Docs Blog Datasets Glossary Case Studies Tutorials & Webinars
Product
Data Engine LLMs Platform Enterprise
Pricing Explore
Connect to our Discord channel
Guy 9598c1935b
Added output "metric.json" to stage "preprocessing.dvc"
3 years ago
c016be9385
init git + dvc, init remote working file system, added data to the project
3 years ago
e9d1904b2e
Added black formatting
3 years ago
1e908bb64b
Updated preprocessing step:
3 years ago
e18a705887
Fixed another issue where because of pandas' infer type some titles and bodies were considered floats instead of strings
3 years ago
9598c1935b
Added output "metric.json" to stage "preprocessing.dvc"
3 years ago
cf7012e4ce
Changed data - removed null columns, added top decile and top percent columns
3 years ago
e9d1904b2e
Added black formatting
3 years ago
Storage Buckets
Data Pipeline
Legend
DVC Managed File
Git Managed File
Metric
Stage File
External File
About

RPPP – Reddit Post Popularity Predictor
A project with two goals:
1. Given a Reddit post, predict how popular it's going to be (what it's score will be)
2. Showcasing a remote working file system with DVC

Collaborators 1

Comments

Loading...