|
按庞工张超意见更新文档
|
Li Chao
|
|
5 years ago |
|
fix some errors in pyspark impl
|
Li Chao
|
|
5 years ago |
|
pyspark implementation of deviation operator
|
Li Chao
|
|
5 years ago |
|
initial Spark scala implementation
|
Li Chao
|
|
5 years ago |
|
add a dvc pipeline of app.py
|
Li Chao
|
|
5 years ago |
|
add dvc
|
Li Chao
|
|
5 years ago |
|
fix a bug in deviation operator
|
Li Chao
|
|
5 years ago |
|
naive implementation of string deviation
|
Li Chao
|
|
5 years ago |
|
更新离散率算子文档
|
Li Chao
|
|
5 years ago |
|
update doc & add joining table scripts
|
Li Chao
|
|
5 years ago |
|
更新了离散度计算流程定义文档
|
Li Chao
|
|
5 years ago |
|
update doc
|
Li Chao
|
|
5 years ago |
|
naive dask implementation
|
Li Chao
|
|
5 years ago |
|
PySpark job verified: python sparkjob.py
|
Li Chao
|
|
5 years ago |
|
improve output format
|
Li Chao
|
|
5 years ago |
|
single process algorithm verified: pytest -s test_stations.py
|
Li Chao
|
|
5 years ago |
|
add multi-station implementaion
|
Li Chao
|
|
5 years ago |
|
verified on spark cluster
|
Li Chao
|
|
5 years ago |
|
some refactor and comments
|
Li Chao
|
|
5 years ago |
|
use dataframe instead of dict
|
Li Chao
|
|
5 years ago |
|
modify the cluster threshold parameter from 0.5 to 0.2
|
Li Chao
|
|
5 years ago |
|
matrix-median and max-current operators verified
|
Li Chao
|
|
5 years ago |
|
update doc
|
Li Chao
|
|
5 years ago |
|
update doc
|
Li Chao
|
|
5 years ago |
|
add embed function for debugging
|
Li Chao
|
|
5 years ago |
|
Merge branch 'master' of github.com:leetschau/max-current
|
Li Chao
|
|
5 years ago |
|
change the test method if the dataframe is wide or narrow
|
Li Chao
|
|
5 years ago |
|
change the algorithm for calculating thresholds
|
Li Chao
|
|
5 years ago |
|
verified on data ycz6502
|
Li Chao
|
|
5 years ago |
|
status checker finished, all tests passed
|
Li Chao
|
|
5 years ago |