Register
Login
Resources
Docs Blog Datasets Glossary Case Studies Tutorials & Webinars
Product
Data Engine LLMs Platform Enterprise
Pricing Explore
Connect to our Discord channel

data_validation.yml 1.9 KB

You have to be logged in to leave a comment. Sign In
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
  1. name: Data validation with deepchecks
  2. on:
  3. push:
  4. branches: [ main ]
  5. paths:
  6. - data/processed/**
  7. - data/augmented/**
  8. - src/data_preparation.py
  9. - src/data_augmentation.py
  10. - dvc.lock
  11. pull_request:
  12. branches: [ main ]
  13. paths:
  14. - data/processed/**
  15. - data/augmented/**
  16. - src/data_preparation.py
  17. - src/data_augmentation.py
  18. - dvc.lock
  19. permissions: write-all
  20. jobs:
  21. run_data_checks:
  22. runs-on: ubuntu-latest
  23. env:
  24. DVC_USERNAME: ${{ secrets.DVC_USERNAME }}
  25. DVC_PASSWORD: ${{ secrets.DVC_PASSWORD }}
  26. steps:
  27. - name: Checkout
  28. uses: actions/checkout@v3
  29. - name: Set up Python 3.10
  30. uses: actions/setup-python@v3
  31. with:
  32. python-version: "3.10"
  33. cache: pip
  34. - name: Cache dependencies
  35. uses: actions/cache@v2
  36. with:
  37. path: ~/.cache/pip
  38. key: ${{ runner.os }}-pip-${{ hashFiles('**/dev-requirements.txt') }}
  39. restore-keys: ${{ runner.os }}-pip-
  40. - name: Install dependencies
  41. run: pip install -r dev-requirements.txt
  42. - name: Get data
  43. run: |
  44. dvc remote modify origin --local auth basic
  45. dvc remote modify origin --local user $DVC_USERNAME
  46. dvc remote modify origin --local password $DVC_PASSWORD
  47. dvc pull -v -r origin
  48. - name: Remove the old validation reports
  49. run: |
  50. rm -rf data_validation
  51. mkdir data_validation
  52. - name: Validate data with deepchecks
  53. run: |
  54. python src/data_preparation.py
  55. python src/data_augmentation.py
  56. python src/data_validation.py
  57. - name: Commit the validation report file
  58. uses: stefanzweifel/git-auto-commit-action@v4
  59. with:
  60. commit_message: Added data validation reports
  61. - name: Create a comment
  62. uses: peter-evans/commit-comment@v1
  63. with:
  64. body: |
  65. Please check data validation reports.
Tip!

Press p or to see the previous file or, n or to see the next file

Comments

Loading...