Register
Login
Resources
Docs Blog Datasets Glossary Case Studies Tutorials & Webinars
Product
Data Engine LLMs Platform Enterprise
Pricing Explore
Connect to our Discord channel

test_process_data.py 892 B

You have to be logged in to leave a comment. Sign In
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
  1. import numpy as np
  2. import pandas as pd
  3. from pandas.testing import assert_frame_equal, assert_series_equal
  4. from src.process_data import split_data, split_X_y
  5. def test_split_X_y():
  6. data = pd.DataFrame({"X": [1, 2, 3, 4], "AdoptionSpeed": [1, 2, 3, np.nan]})
  7. X, y = split_X_y(data)
  8. assert_series_equal(y, pd.Series([1, 2, 3, np.nan], name="AdoptionSpeed"))
  9. def test_split_data():
  10. data = pd.DataFrame(
  11. {"a": [1, 2, 3, 4], "b": [1, 2, 3, 4], "AdoptionSpeed": [1, 2, 3, np.nan]}
  12. )
  13. out = split_data.fn(data)
  14. y_train_out = out["y_train"]
  15. X_test_out = out["X_test"].reset_index(drop=True)
  16. y_train_expected = pd.Series([1, 2, 3], name="AdoptionSpeed")
  17. X_test_expected = pd.DataFrame({"a": [4], "b": [4]})
  18. assert_series_equal(y_train_out, y_train_expected, check_dtype=False)
  19. assert_frame_equal(X_test_out, X_test_expected, check_dtype=False)
Tip!

Press p or to see the previous file or, n or to see the next file

Comments

Loading...