Register
Login
Resources
Docs Blog Datasets Glossary Case Studies Tutorials & Webinars
Product
Data Engine LLMs Platform Enterprise
Pricing Explore
Connect to our Discord channel
Emmanuel M. Smith 4199ea8438
feat: extract lines plot into module
1 year ago
24250a95bd
feat: refactor entire pipeline and improve performance/readability
2 years ago
e03c904531
fix: resolve data and modelling issues
2 years ago
4199ea8438
feat: extract lines plot into module
1 year ago
4199ea8438
feat: extract lines plot into module
1 year ago
4e6fcf5ef0
ci: add pre-commit validation stage
2 years ago
dcac11f61d
Add DVC for management of data/model files
2 years ago
2563642887
Update dependencies and prepare files for LFS removal
2 years ago
9e0a78cc45
Update pre-commit config and pass all checks
2 years ago
a9925a13fb
ci: fix dag command misconfiguration to ensure pipeline passes
2 years ago
b1e0908a3f
Readd submodules
2 years ago
037e1a8d5d
feat: start investigating data issues
2 years ago
886f9bc8c8
Update dependencies to latest versions.
2 years ago
67dba604a3
feat: finish running entire pipeline
1 year ago
4bed5d2d9e
feat: add addional moments for error surface generation
2 years ago
24250a95bd
feat: refactor entire pipeline and improve performance/readability
2 years ago
2d50202703
feat: refactor entire pipeline to remove data redundency across cores
2 years ago
2d50202703
feat: refactor entire pipeline to remove data redundency across cores
2 years ago
d08edfef46
Refactor logs and convert stored object dtypes to categories
2 years ago
Storage Buckets
Data Pipeline
Legend
DVC Managed File
Git Managed File
Metric
Stage File
External File

readme.md

You have to be logged in to leave a comment. Sign In

Doctoral Thesis

🎓 Undertaking a doctoral thesis from the University of the West of Englands's PhD programme.

🔧 Usage

This project is managed with pyenv for environments, poetry for dependencies, and dvc for reproducability. If you're having issues, both .python-version and pyproject.toml should be enough for manual replication. To install the dependencies run:

# Install pyenv.
curl https://pyenv.run | bash

# Install poetry.
curl -sSL https://install.python-poetry.org | python3 -

# Restart shell to finish installations.
exec $SHELL

If you have both dependencies installed, simply run:

# Setup environment via pyenv and Poetry.
pyenv install $(cat .python-version)
poetry install --no-dev
poetry shell

You should then have an active shell environment with everything you need to replicate our results. To actually go about doing so, run:

dvc repro --pull

If you have any issues please contact sinclair.smith@uwe.ac.uk.

📝 Reporting

Almost all of the outputs of this research were written in LaTeX. If you want access to the original files:

git submodule update --init --recursive

Please note, as of writing the submodules are password protected. Due to the specific tooling used for the writing process, this will remain the case until the research concludes. Upon sucessful completion, these files will be made asscesible and read-only.

🧪 Testing

GitLab runs automatic tests whenever changes are pushed to the repository. It checks to see if the remote dataset cache is available, and tries to replicate the results using the process described above.

If you want to reproduce the testing process locally, run:

gitlab-runner exec docker replicate

This requires that you have both Docker and GitLab Runner installed. Installation instructions are outside the scope of this project.

Tip!

Press p or to see the previous file or, n or to see the next file

About

No description

Collaborators 1

Comments

Loading...