Register
Login
Resources
Docs Blog Datasets Glossary Case Studies Tutorials & Webinars
Product
Data Engine LLMs Platform Enterprise
Pricing Explore
Connect to our Discord channel

preprocessing.yaml 622 B

You have to be logged in to leave a comment. Sign In
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
  1. ##################################
  2. ### preprocessing
  3. ##################################
  4. preprocessing:
  5. # input file for the cleanup_dataset.py script
  6. input_file: "raw_data.csv"
  7. # output file for the cleanup_dataset.py script
  8. output_file: "processed_data.csv"
  9. # columns to drop
  10. drop_columns: ["udi","product_id","twf","hdf","pwf","osf","rnf"]
  11. # target column
  12. target_column: "machine_failure"
  13. # missing values strategy
  14. missing_values_strategy: "drop"
  15. # correlation threshold
  16. correlation_threshold: 0.85
  17. # encode categorical variables with drop_first
  18. encode_categorical_drop_first: True
Tip!

Press p or to see the previous file or, n or to see the next file

Comments

Loading...