Register
Login
Resources
Docs Blog Datasets Glossary Case Studies Tutorials & Webinars
Product
Data Engine LLMs Platform Enterprise
Pricing Explore
Connect to our Discord channel
Commit History
Message Author SHA1 Date
Use only questions (not answers) as data   Tolstoyevsky 4 years ago
Add DS posts to train set only   Tolstoyevsky 4 years ago
Add DS posts to both train and test sets   Tolstoyevsky 4 years ago
Added the "text len" column   Tolstoyevsky 4 years ago
Only numeric columns (no text)   Tolstoyevsky 4 years ago
Removed extra features   Tolstoyevsky 4 years ago
AdaBoost correct AUC   Tolstoyevsky 4 years ago
linear classifier with correct AUC   Tolstoyevsky 4 years ago
Allow logistic regression to converge   Tolstoyevsky 4 years ago
Properly calculate roc_auc and pr_auc   Tolstoyevsky 4 years ago
50000 vocab size   Tolstoyevsky 4 years ago
12000 vocab size   Tolstoyevsky 4 years ago
Convert exponent notation to <num>   Tolstoyevsky 4 years ago
Omitted numeric tokens   Tolstoyevsky 4 years ago
Turn numbers to a special <num> token   Tolstoyevsky 4 years ago
Adaboost with extra features   Tolstoyevsky 4 years ago
Merge branch 'extra-features'   Tolstoyevsky 4 years ago
Added code to predict user-supplied text   Tolstoyevsky 4 years ago
Done with the new pipeline and extra features   Tolstoyevsky 4 years ago
Seemingly successfully ran a ColumnTransformer pipeline   Tolstoyevsky 4 years ago
WIP on sklearn_pandas   Tolstoyevsky 4 years ago
Dummy classifier - most_frequent   Tolstoyevsky 4 years ago
Dummy classifier - stratified   Tolstoyevsky 4 years ago
WIP   Tolstoyevsky 4 years ago
Trigrams   Tolstoyevsky 4 years ago
Fixed text preprocessor pickling   Tolstoyevsky 4 years ago
Fix param logging and pickling of TFIDF   Tolstoyevsky 4 years ago
Try bigrams   Tolstoyevsky 4 years ago
Try random forest with default params   Tolstoyevsky 4 years ago
Try random forest   Tolstoyevsky 4 years ago