No Description

Juan Diego Bermeo b84978d07b modified dvc file 6 hours ago
.dvc 2bf960eafc added raw project 3 weeks ago
.idea 1181c54d42 added script with transformers to generate kmers 2 weeks ago
data
feature_names_to_use
model_metrics
model_selection
training 1698248d1a modified stage by hand to have unix paths 2 days ago
training_data
utils 5752f9f386 Added notebook to organie final models for elegans. Updated learning curve notebooks 7 hours ago
.dvcignore 2bf960eafc added raw project 3 weeks ago
.gitignore 5752f9f386 Added notebook to organie final models for elegans. Updated learning curve notebooks 7 hours ago
Copia de Elia.ipynb 1181c54d42 added script with transformers to generate kmers 2 weeks ago
Feature selection of multiple kmers - Feature importance.ipynb 5752f9f386 Added notebook to organie final models for elegans. Updated learning curve notebooks 7 hours ago
Feature selection of multiple kmers - Variance and correlation filtering.ipynb dda46f4a11 start separating pipeline into scripts 4 days ago
Predictions on test of best performing models in elegans.ipynb 5752f9f386 Added notebook to organie final models for elegans. Updated learning curve notebooks 7 hours ago
Project 2 description (1).pdf 0967070b29 first commit 3 weeks ago
Scale to human DNA dataset - learning curves for RF, SVC, and logreg - Check feature selction.ipynb 5752f9f386 Added notebook to organie final models for elegans. Updated learning curve notebooks 7 hours ago
Scale to human DNA dataset - learning curves for RF, SVC, and logreg - Learning cuves.ipynb 5752f9f386 Added notebook to organie final models for elegans. Updated learning curve notebooks 7 hours ago
Test performance of kmers individually.ipynb 354a5c4fd7 expeiment concatenated kmers 1-6mer 2 weeks ago
best_pipeline_rf.joblib 4147de01fa Added a function to preprocessing and a tunning experiment for max_depth and min_samples split for RF with concatenated kmers 1-6 v3 1 week ago
config.yml 90f5d8f3f6 Generated kmer vectors 2 days ago
create_folds_metrics.json d0ab768ffa attempt to correct pipeline diagram 3 days ago
data.dvc 1e1c811272 Added the latest trained models metric performance plots to DVC 1 week ago
dvc.lock 1774feb496 added generate_kmers_vectors stage 2 days ago
dvc.yaml 1774feb496 added generate_kmers_vectors stage 2 days ago
feature_names_to_use.dvc 5752f9f386 Added notebook to organie final models for elegans. Updated learning curve notebooks 7 hours ago
generate_kmers_metrics.json 90f5d8f3f6 Generated kmer vectors 2 days ago
main.py 0967070b29 first commit 3 weeks ago
metrics.csv 7649579aa6 added experiment for dummy clasifier. Finished obtaining most relevant featurees for rf, svc, and lr according to permutation importance 1 week ago
model_metrics.dvc 1e1c811272 Added the latest trained models metric performance plots to DVC 1 week ago
model_selection.dvc b84978d07b modified dvc file 6 hours ago
params.yml 7649579aa6 added experiment for dummy clasifier. Finished obtaining most relevant featurees for rf, svc, and lr according to permutation importance 1 week ago

Data Pipeline

Legend
DVC Managed File
Git Managed File
Metric
Stage File
External File