Data Engine (Dataset Management)

Datasources

The main class used to interact with the Data Engine is Datasource.

Here are functions that you can use to get and create datasources on your repository:

Datasets

Datasets are “save states” of Datasources with an already preapplied query. They can be stored on DagsHub and retrieved later by you or anybody else.

To save a dataset, apply a query to a datasource then call save_dataset().

Data Engine Structures