Register
Login
Resources
Docs Blog Datasets Glossary Case Studies Tutorials & Webinars
Product
Data Engine LLMs Platform Enterprise
Pricing Explore
Connect to our Discord channel
Integration:  dvc git github
lmshk 384f472ddc
Update to DVC v2.8.2
2 years ago
1450f94d49
Setup poetry, dvc and toolbox
3 years ago
157ad0dc1a
Refactor pipeline to use DVC v2.0 style templating
3 years ago
src
e4f20ff416
Make Genie3 output loader sort rows and columns by name, like the others
2 years ago
1450f94d49
Setup poetry, dvc and toolbox
3 years ago
aebe1d242c
Add noisyR tarball stage, and update toolbox
3 years ago
1450f94d49
Setup poetry, dvc and toolbox
3 years ago
cd18e249bc
Genesis
3 years ago
28d7935321
Add link to noisyR paper
3 years ago
384f472ddc
Update to DVC v2.8.2
2 years ago
9283e02749
Update toolbox to get noisyR v1.0
3 years ago
384f472ddc
Update to DVC v2.8.2
2 years ago
779eb62eae
Update to DVC v2.0
3 years ago
Storage Buckets
Data Pipeline
Legend
DVC Managed File
Git Managed File
Metric
Stage File
External File

README.adoc

You have to be logged in to leave a comment. Sign In
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
  1. = Network Inference Pipeline
  2. This project uses the https://github.com/drostlab/network-inference-toolbox[Network Inference Toolbox], a collection of Singularity containers packaging various pre-existing tools for Gene Regulatory Network Inference, to infer and evaluate some networks for specific datasets.
  3. == Dependencies
  4. * https://www.python.org/[Python] v3.9 with https://python-poetry.org/[Poetry] set up
  5. * https://sylabs.io/[Singularity] (tested with v3.7)
  6. * `git` (optional)
  7. == Overview
  8. This project is a https://dvc.org/[DVC] pipeline that glues data preprocessing, tools from `toolbox/bin` and network evaluation together. After setup as described below, you should be able to `dvc repro` the results.
  9. NOTE: To package up materials used for the https://doi.org/10.1101/2021.01.17.427026[`noisyR` paper] (Moutsopoulos et al.), run `dvc repro artifacts/noisyR-paper/dvc.yaml`, which will also implicitly generate its dependencies. The resulting tarball will be placed in `artifacts/noisyR-paper`.
  10. == Setup
  11. Clone (or download and extract) this repository and enter its directory. You then need to setup both Python dependencies and the previously mentioned Network Inference Toolbox.
  12. === Python Dependencies
  13. The recommended (but optional) way to setup DVC is to install it into a virtual environment. (E.g., call `poetry shell` and then always run `dvc` from there.) Then just run `poetry install`.
  14. === Network Inference Toolbox
  15. The recommended way to setup the Toolbox is to pull the Git submodule. In the repository root, execute:
  16. [source,sh]
  17. ----
  18. git submodule init
  19. git submodule update
  20. ----
  21. After setting up the submodule, you can build the containers by entering the `toolbox` directory and then running `make` (or, if the containers were already built somewhere else, just copy them to `toolbox/bin`).
  22. The only parts of the Network Inference Toolbox that are strictly necessary to run the pipeline are the assembled Singularity containers (i.e. `.sif` files), which the pipeline expects to find in `toolbox/bin`. If you are sure you just want to _run_ the pipeline (and not change it), you alternatively could copy the containers there, or symlink `toolbox/bin` to another location where the containers are available, without setting up the submodule as described above.
Tip!

Press p or to see the previous file or, n or to see the next file

About

Applying network-inference-toolbox to RNAseq data

Collaborators 1

Comments

Loading...