Commit History

Message Author SHA1 Date
Use only questions (not answers) as data   Tolstoyevsky 4 months ago
Add DS posts to train set only   Tolstoyevsky 4 months ago
Add DS posts to both train and test sets   Tolstoyevsky 4 months ago
Added the "text len" column   Tolstoyevsky 4 months ago
Only numeric columns (no text)   Tolstoyevsky 4 months ago
Removed extra features   Tolstoyevsky 4 months ago
AdaBoost correct AUC   Tolstoyevsky 5 months ago
linear classifier with correct AUC   Tolstoyevsky 5 months ago
Allow logistic regression to converge   Tolstoyevsky 5 months ago
Properly calculate roc_auc and pr_auc   Tolstoyevsky 5 months ago
50000 vocab size   Tolstoyevsky 5 months ago
12000 vocab size   Tolstoyevsky 5 months ago
Convert exponent notation to <num>   Tolstoyevsky 5 months ago
Omitted numeric tokens   Tolstoyevsky 5 months ago
Turn numbers to a special <num> token   Tolstoyevsky 5 months ago
Adaboost with extra features   Tolstoyevsky 5 months ago
Merge branch 'extra-features'   Tolstoyevsky 5 months ago
Added code to predict user-supplied text   Tolstoyevsky 5 months ago
Done with the new pipeline and extra features   Tolstoyevsky 5 months ago
Seemingly successfully ran a ColumnTransformer pipeline   Tolstoyevsky 5 months ago
WIP on sklearn_pandas   Tolstoyevsky 5 months ago
Dummy classifier - most_frequent   Tolstoyevsky 5 months ago
Dummy classifier - stratified   Tolstoyevsky 5 months ago
WIP   Tolstoyevsky 5 months ago
Trigrams   Tolstoyevsky 5 months ago
Fixed text preprocessor pickling   Tolstoyevsky 5 months ago
Fix param logging and pickling of TFIDF   Tolstoyevsky 5 months ago
Try bigrams   Tolstoyevsky 5 months ago
Try random forest with default params   Tolstoyevsky 5 months ago
Try random forest   Tolstoyevsky 5 months ago